Report an Error

NVIDIA Tesla P100 SXM2

Graphics Processor
GP100
Cores
3584
TMUs
224
ROPs
96
Memory Size
16 GB
Memory Type
HBM2
Bus Width
4096 bit
The Tesla P100 SXM2 is a professional graphics card by NVIDIA, launched in April 2016. Built on the 16 nm process, and based on the GP100 graphics processor, in its GP100-890-A1 variant, the card supports DirectX 12. The GP100 graphics processor is a large chip with a die area of 610 mm² and 15,300 million transistors. It features 3584 shading units, 224 texture mapping units, and 96 ROPs. NVIDIA has paired 16 GB HBM2 memory with the Tesla P100 SXM2, which are connected using a 4096-bit memory interface. The GPU is operating at a frequency of 1328 MHz, which can be boosted up to 1480 MHz, memory is running at 715 MHz.
Its power draw is rated at 300 W maximum. This device has no display connectivity, as it is not designed to have monitors connected to it. Tesla P100 SXM2 is connected to the rest of the system using a PCI-Express 3.0 x16 interface.

Graphics Processor

GPU Name
GP100
GPU Variant
GP100-890-A1
Architecture
Pascal
Foundry
TSMC
Process Size
16 nm
Transistors
15,300 million
Die Size
610 mm²

Graphics Card

Release Date
Apr 5th, 2016
Generation
Tesla
(Pxx)
Production
Active
Bus Interface
PCIe 3.0 x16

Relative Performance

1%
GeForce 210
1%
GeForce 9400 GT
2%
Radeon HD 4550
2%
Radeon HD 5450
3%
Radeon HD 6450
3%
GeForce GT 520
3%
GeForce GT 220
4%
GeForce GT 430
4%
Radeon HD 5570
5%
Radeon HD 4670
5%
GeForce GT 440
5%
GeForce GT 240
6%
GeForce 9600 GT
6%
Radeon HD 5670
7%
GeForce GT 640
7%
GeForce 9800 GT
7%
Radeon HD 6670
8%
Radeon HD 4830
8%
Radeon HD 4770
9%
Radeon HD 4850
9%
GeForce GTS 250
9%
GeForce GTS 450
9%
Radeon HD 5750
11%
Radeon HD 7750
11%
Radeon HD 5770
11%
GeForce GTX 260
11%
Radeon HD 4870
11%
Radeon Vega 8
11%
GeForce GTX 550 Ti
11%
GeForce GTX 650
12%
Radeon HD 5830
12%
Radeon HD 6790
13%
Radeon HD 4890
13%
GeForce GTX 460
13%
Radeon RX Vega 11
13%
GeForce GT 1030
14%
GeForce GTX 275
14%
Radeon HD 7770 GHz Edition
14%
GeForce GTX 280
14%
GeForce GTX 465
15%
Radeon HD 6850
15%
GeForce GTX 285
16%
Radeon HD 5850
16%
GeForce GTX 650 Ti
16%
Radeon HD 7790
17%
Radeon RX 550
17%
GeForce GTX 470
17%
Radeon HD 6870
19%
Radeon HD 5870
19%
GeForce GTX 560 Ti
19%
Radeon HD 4870 X2
20%
GeForce GTX 750 Ti
20%
Radeon HD 6950
21%
GeForce GTX 295
21%
GeForce GTX 650 Ti Boost
21%
Radeon HD 7850
21%
GeForce GTX 480
22%
Radeon HD 6970
23%
Radeon R7 265
23%
Radeon RX 460
23%
Radeon R7 370
23%
GeForce GTX 660
23%
GeForce GTX 570
24%
Radeon RX 560
25%
GeForce GTX 950
25%
Radeon HD 7870 GHz Edition
26%
Radeon R9 270X
26%
GeForce GTX 660 Ti
26%
GeForce GTX 580
27%
GeForce GTX 1050
27%
Radeon HD 7950
27%
Radeon HD 5970
28%
GeForce GTX 760
30%
GeForce GTX 670
31%
GeForce GTX 960
32%
Radeon R9 380
32%
Radeon R9 285
32%
Radeon HD 7970
32%
GeForce GTX 680
33%
GeForce GTX 1050 Ti
34%
GeForce GTX 770
35%
Radeon HD 6990
36%
Radeon R9 280X
36%
GeForce GTX 590
36%
Radeon HD 7970 GHz Edition
40%
GeForce GTX 780
42%
GeForce GTX 1650
44%
Radeon RX 470
45%
Radeon R9 290
47%
Radeon RX 570
48%
Radeon R9 390
48%
Radeon R9 290X
49%
GeForce GTX 970
49%
GeForce GTX TITAN
50%
GeForce GTX 780 Ti
51%
Radeon R9 390X
51%
Radeon HD 7990
51%
Radeon RX 480
52%
GeForce GTX 690
53%
GeForce GTX 1060 6 GB
54%
Radeon RX 5500 OEM
54%
Radeon RX 580
55%
Radeon RX 5500 XT
55%
GeForce GTX 980
56%
GeForce GTX 1650 SUPER
56%
Radeon R9 FURY
59%
Radeon RX 590
62%
Radeon R9 295X2
62%
GeForce GTX 1660
63%
Radeon R9 FURY X
64%
GeForce GTX 980 Ti
66%
GeForce GTX TITAN X
70%
GeForce GTX 1660 SUPER
71%
GeForce GTX 1660 Ti
71%
GeForce GTX 1070
76%
Radeon RX Vega 56
80%
GeForce GTX 1070 Ti
83%
Radeon RX Vega 64
84%
GeForce RTX 2060
84%
GeForce GTX 1080
87%
Radeon RX 5700
94%
GeForce RTX 2060 SUPER
98%
GeForce RTX 2070
100%
Radeon RX 5700 XT
100%
Tesla P100 SXM2
101%
Radeon VII
104%
TITAN X Pascal
107%
GeForce GTX 1080 Ti
108%
GeForce RTX 2070 SUPER
115%
GeForce RTX 2080
122%
GeForce RTX 2080 SUPER
134%
GeForce RTX 2080 Ti
178%
GeForce RTX 3080
195%
GeForce RTX 3090
Based on TPU review data: "Performance Summary" at 1920x1080, 4K for 2080 Ti and faster.
Performance estimated based on architecture, shader count and clocks.

Clock Speeds

Base Clock
1328 MHz
Boost Clock
1480 MHz
Memory Clock
715 MHz
1430 Mbps effective

Memory

Memory Size
16 GB
Memory Type
HBM2
Memory Bus
4096 bit
Bandwidth
732.2 GB/s

Render Config

Shading Units
3584
TMUs
224
ROPs
96
SM Count
56
L1 Cache
24 KB (per SM)
L2 Cache
4 MB

Theoretical Performance

Pixel Rate
142.1 GPixel/s
Texture Rate
331.5 GTexel/s
FP16 (half) performance
21.22 TFLOPS (2:1)
FP32 (float) performance
10.61 TFLOPS
FP64 (double) performance
5.304 TFLOPS (1:2)

Board Design

TDP
300 W
Suggested PSU
700 W
Outputs
No outputs
Power Connectors
None

Graphics Features

DirectX
12 (12_1)
OpenGL
4.6
OpenCL
1.2
Vulkan
1.2
CUDA
6.0
Shader Model
6.4