Kepler_(microarchitecture)

Kepler
Launched	April 3, 2012
Designed by	Nvidia
Manufactured by	TSMC;
Fabrication process	TSMC 28 nm
Product Series
Desktop	GeForce 600 series ; GeForce 700 series;
Professional/workstation	Quadro K;
Server/datacenter	Tesla K;
Specifications
L1 cache	16 KB (per SM)
L2 cache	Up to 512 KB
Memory support	GDDR5
PCIe support	PCIe 2.0 ; PCIe 3.0
Supported Graphics APIs
DirectX	DirectX 12 Ultimate (Feature Level 11_0)
Shader Model	Shader Model 6.5
Vulkan	Vulkan 1.2
Media Engine
Encode codecs	H.264
Decode codecs	H.264; H.265;
Encoder(s) supported	NVENC
Display outputs	DVI ; DisplayPort 1.2 ; HDMI 1.4a
History
Predecessor	Fermi
Successor	Maxwell

Kepler (microarchitecture)

GPU microarchitecture by Nvidia

Kepler is the codename for a GPU microarchitecture developed by Nvidia, first introduced at retail in April 2012,^[1] as the successor to the Fermi microarchitecture. Kepler was Nvidia's first microarchitecture to focus on energy efficiency. Most GeForce 600 series, most GeForce 700 series, and some GeForce 800M series GPUs were based on Kepler, all manufactured in 28 nm. Kepler found use in the GK20A, the GPU component of the Tegra K1 SoC, and in the Quadro Kxxx series, the Quadro NVS 510, and Tesla computing modules.

Quick Facts Launched, Designed by ...

Portrait of Johannes Kepler, eponym of architecture

Kepler was followed by the Maxwell microarchitecture and used alongside Maxwell in the GeForce 700 series and GeForce 800M series.

The architecture is named after Johannes Kepler, a German mathematician and key figure in the 17th century scientific revolution.

Features

The GK Series GPU contains features from both the older Fermi and newer Kepler generations. Kepler based members add the following standard features:

PCI Express 3.0 interface
DisplayPort 1.2
HDMI 1.4a 4K x 2K video output
PureVideo VP5 hardware video acceleration (up to 4K x 2K H.264 decode)
Hardware H.265 decoding^[8]
Hardware H.264 encoding acceleration block (NVENC)
Support for up to 4 independent 2D displays, or 3 stereoscopic/3D displays (NV Surround)
Next Generation Streaming Multiprocessor (SMX)
Polymorph-Engine 2.0
Simplified Instruction Scheduler
Bindless Textures
CUDA Compute Capability 3.0 to 3.5
GPU Boost (Upgraded to 2.0 on GK110)
TXAA Support
Manufactured by TSMC on a 28 nm process
New Shuffle Instructions
Dynamic Parallelism
Hyper-Q (Hyper-Q's MPI functionality reserve for Tesla only)
Grid Management Unit
Nvidia GPUDirect (GPU Direct's RDMA functionality reserve for Tesla only)

Share this article:

This article uses material from the Wikipedia article Kepler_(microarchitecture), and is written by contributors. Text is available under a CC BY-SA 4.0 International License; additional terms may apply. Images, videos and audio are available under their respective licenses.

[1] [1]
Mujtaba, Hassan (18 February 2012). "Nvidia Expected to launch Eight New 28nm Kepler GPU's in April 2012".

[2] [2]
"Inside Kepler" (PDF). Retrieved 2015-09-19.

[gtx680-nvidia-paper-3] [3]
"Introducing The GeForce GTX 680 GPU". Nvidia. March 22, 2012. Retrieved 2015-09-19.

[4] [4]
"Nvidia's Next Generation CUDA Compute Architecture: Kepler TM GK110" (PDF). Nvidia.

[anandtech-GTX680-review-5] [5]
Smith, Ryan (March 22, 2012). "Nvidia GeForce GTX 680 Review: Retaking The Performance Crown". AnandTech. Retrieved November 25, 2012.

[6] [6]
"Efficiency Through Hyper-Q, Dynamic Parallelism, & More". Nvidia. November 12, 2012. Retrieved 2015-09-19.

[7] [7]
"GeForce GTX 770 | Specifications | GeForce". Nvidia. Retrieved 2022-06-07.

[8] [8]
https://bluesky-soft.com/en/dxvac/deviceInfo/decoder/nvidia.doc

[9] [9]
"GeForce 680 (Kepler) Whitepaper" (PDF). Nvidia. Retrieved March 22, 2024.

[10] [10]
"Nvidia Kepler GK210/110 Architecture White Paper" (PDF). Nvidia. Retrieved 22 March 2024.

[anandtech-GK110-preview-11] [11]
Smith, Ryan (November 12, 2012). "Nvidia Launches Tesla K20 & K20X: GK110 Arrives At Last". AnandTech. Retrieved September 19, 2015.

[nvidia-12] [12]
"Nvidia Kepler GK110 Architecture Whitepaper" (PDF). Nvidia. Retrieved 2015-09-19.

[13] [13]
"Nvidia Launches First GeForce GPUs Based on Next-Generation Kepler Architecture". Nvidia. March 22, 2012. Archived from the original on June 14, 2013.

[14] [14]
Edward, James (November 22, 2012). "Nvidia claims partially support DirectX 11.1". TechNews. Archived from the original on June 28, 2015. Retrieved 2015-09-19.

[Nvidia/D3D11.1-15] [15]
"Nvidia Doesn't Fully Support DirectX 11.1 with Kepler GPUs, But… (Web Archive Link)". BSN. Archived from the original on December 29, 2012.

[16] [16]
"D3D_FEATURE_LEVEL enumeration (Windows)". MSDN. Retrieved 2015-09-19.

[17] [17]
Moreton, Henry (March 20, 2014). "DirectX 12: A Major Stride for Gaming". Nvidia. Retrieved 2015-09-19.

[18] [18]
"Nvidia GPUDirect". Nvidia Developer. October 6, 2015. Retrieved February 5, 2019.

[Tom’s_Hardware-19] [19]
Angelini, Chris (March 22, 2012). "Benchmark Results: NVEnc And MediaEspresso 6.5". Tom’s Hardware. Retrieved September 19, 2015.

[20] [20]
Angelini, Chris (November 7, 2013). "Nvidia GeForce GTX 780 Ti Review: GK110, Fully Unlocked". Tom's Hardware. p. 1. Retrieved December 6, 2015. The card's driver deliberately operates GK110's FP64 units at 1/8 of the GPU's clock rate. When you multiply that by the 3:1 ratio of single- to double-precision CUDA cores, you get a 1/24 rate

[21] [21]
Smith, Ryan (13 September 2012). "The Nvidia GeForce GTX 660 Review: GK106 Fills Out The Kepler Family". AnandTech. p. 1. Retrieved 6 December 2015.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

[20]

[21]

		GK104	GK106	GK107	GK110
Variant(s)		GK104-200-A2 GK104-300-A2 GK104-325-A2 GK104-400-A2 GK104-425-A2 GK104-850-A2	GK106-240-A1 GK107-400-A1	GK107-300-A2 GK107-301-A2 GK107-320-A2 GK107-400-A2 GK107-425-A2 GK107-450-A2 GK107-810-A2	GK110-300-A1 GK110-400-A1 GK110-425-B1 GK110-885-A1
Release date		Apr 3, 2012	Sep 6, 2012	Sep 6, 2012	Nov 12, 2012
Cores	CUDA Cores	1536	960	384	2880
	TMUs	128	80	32	240
	ROPs	32	24	16	48
Streaming Multiprocessors		8	5	2	15
GPCs		4	3	1	5
Cache	L1	128 KB	80 KB	32 KB	240 KB
Cache	L2	512 KB	512 KB	256 KB	1.5 MB
Memory interface		256-bit	192-bit	192-bit	384-bit
Die size		294 mm²	221 mm²	118 mm²	561 mm²
Transistor count		3.54 bn.	2.54 bn.	1.27 bn.	7.08 bn.
Transistor density		12.0 MTr/mm²	11.5 MTr/mm²	10.8 MTr/mm²	12.6 MTr/mm²
Package socket		BGA 1745	BGA 1425	BGA 908	BGA 2152
Products
Consumer	Desktop	GTX 660 GTX 660 Ti GTX 670 GTX 680 GTX 690 GTX 760 GTX 760 Ti GTX 770	GTX 650 GTX 650 Ti GTX 660 GTX 750 Ti	GT 630 GTX 650 GT 720 GT 730 GT 740 GT 1030	GTX 780 GTX Titan
Consumer	Mobile	GTX 670MX GTX 675MX GTX 680M GTX 680MX GTX 775M GTX 780M GTX 860M GTX 870M GTX 880M	GTX 765M GTX 770M	GT 640M GTX 640M LE GT 645M GT 650M GTX 660M GT 740M GT 745M GT 750M GT 755M GTX 810M GTX 820M	—
Workstation	Desktop	Quadro K4200 Quadro K5000	Quadro K4000 Quadro K5000	Quadro K410 Quadro K420 Quadro K600 Quadro K2000 Quadro K2000D	Quadro K5200 Quadro K6000
Workstation	Mobile	Quadro K3000M Quadro K3100M Quadro K4000M Quadro K4100M Quadro K5000M Quadro K5100M	—	Quadro K100M Quadro K200M Quadro K500M Quadro K1000M Quadro K1100M Quadro K2000M	—

Kepler_(microarchitecture)

Kepler (microarchitecture)

Overview

Features

Next Generation Streaming Multiprocessor (SMX)

Simplified Instruction Scheduler

GPU Boost

Microsoft Direct3D Support

Next Microsoft Direct3D Support

TXAA Support

Shuffle Instructions

Hyper-Q

Dynamic Parallelism

Grid Management Unit

Nvidia GPUDirect

Video decompression/compression

NVDEC

NVENC

Performance

Kepler dies

See also

References

Share this article:

Product Series
Launched	April 3, 2012 (2012-04-03)
Designed by	Nvidia
Manufactured by	TSMC
Fabrication process	TSMC 28 nm
Desktop	GeForce 600 series GeForce 700 series
Professional/workstation	Quadro K
Server/datacenter	Tesla K
Specifications
L1 cache	16 KB (per SM)
L2 cache	Up to 512 KB
Memory support	GDDR5
PCIe support	PCIe 2.0 PCIe 3.0
Supported Graphics APIs
DirectX	DirectX 12 Ultimate (Feature Level 11_0)
Shader Model	Shader Model 6.5
Vulkan	Vulkan 1.2
Media Engine
Encode codecs	H.264
Decode codecs	H.264 H.265
Encoder(s) supported	NVENC
Display outputs	DVI DisplayPort 1.2 HDMI 1.4a
History
Predecessor	Fermi
Successor	Maxwell