Report an Error

NVIDIA Tesla V100 SXM2 32 GB

GV100
Graphics Processor
5120
Cores
320
TMUs
128
ROPs
32 GB
Memory Size
HBM2
Memory Type
4096 bit
Bus Width
NVIDIA Tesla V100 SXM2 32 GB Photo NVIDIA GV100 Photo
The Tesla V100 SXM2 32 GB is a professional graphics card by NVIDIA, launched in March 2018. Built on the 12 nm process, and based on the GV100 graphics processor, the card supports DirectX 12.0. The GV100 graphics processor is a large chip with a die area of 815 mm² and 21,100 million transistors. It features 5120 shading units, 320 texture mapping units and 128 ROPs. Also included are 640 tensor cores which help improve the speed of machine learning applications. NVIDIA has placed 32,768 MB HBM2 memory on the card, which are connected using a 4096-bit memory interface. The GPU is operating at a frequency of 1290 MHz, which can be boosted up to 1530 MHz, memory is running at 876 MHz.
Being a dual-slot card, the NVIDIA Tesla V100 SXM2 32 GB does not require any additional power connector, its power draw is rated at 250 W maximum. This device has no display connectivity, as it is not designed to have monitors connected to it. Tesla V100 SXM2 32 GB is connected to the rest of the system using a PCI-Express 3.0 x16 interface.

Graphics Processor

GPU Name
GV100
Architecture
Volta
Foundry
TSMC
Process Size
12 nm
Transistors
21,100 million
Die Size
815 mm²

Graphics Card

Release Date
Mar 27th, 2018
Generation
Tesla
(Vxx)
Production
Active
Bus Interface
PCIe 3.0 x16

Relative Performance

1%
1%
1%
1%
2%
2%
2%
3%
3%
3%
3%
4%
4%
4%
5%
5%
5%
5%
5%
6%
6%
6%
6%
7%
7%
7%
7%
7%
8%
8%
8%
8%
9%
9%
9%
9%
9%
9%
9%
9%
10%
10%
11%
11%
11%
11%
11%
12%
13%
13%
13%
13%
13%
14%
14%
14%
14%
15%
15%
16%
16%
16%
16%
17%
17%
17%
18%
18%
18%
18%
18%
18%
19%
20%
21%
21%
21%
22%
22%
22%
23%
24%
24%
24%
24%
27%
30%
30%
32%
32%
33%
33%
34%
34%
34%
34%
34%
35%
37%
37%
38%
38%
42%
42%
43%
46%
51%
53%
57%
57%
60%
67%
70%
72%
78%
89%
100%
Tesla V100 SXM2 32 GB
Based on TPU review data: "Performance Summary" at 1920x1080
Performance estimated based on architecture, shader count and clocks.

Clock Speeds

GPU Clock
1290 MHz
Boost Clock
1530 MHz
Memory Clock
876 MHz
1752 MHz effective

Memory

Memory Size
32 GB
Memory Type
HBM2
Memory Bus
4096 bit
Bandwidth
897.0 GB/s

Render Config

Shading Units
5120
TMUs
320
ROPs
128
SM Count
80
Tensor Cores
640

Theoretical Performance

Pixel Rate
195.8 GPixel/s
Texture Rate
489.6 GTexel/s
FP16 (half) performance
31,334 GFLOPS (2:1)
FP32 (float) performance
15,667 GFLOPS
FP64 (double) performance
7,834 GFLOPS (1:2)

Board Design

Slot Width
Dual-slot
TDP
250 W
Outputs
No outputs
Power Connectors
None

Graphics Features

DirectX
12.0 (12_1)
OpenGL
4.6
OpenCL
2.0
Vulkan
1.1.82
CUDA
7.0
Shader Model
6.1

Card Notes

Boost Clock:
Deep Learning: 125.338TFLOPs

GV100 GPU Notes

PureVideo HD: VP9
VDPAU: Feature Set I