Report an Error
NVIDIA A100 SXM4 40 GB
- Graphics Processor
- GA100
- Cores
- 6912
- TMUs
- 432
- ROPs
- 160
- Memory Size
- 40 GB
- Memory Type
- HBM2e
- Bus Width
- 5120 bit
The A100 SXM4 40 GB is a professional graphics card by NVIDIA, launched on May 14th, 2020. Built on the 7 nm process, and based on the GA100 graphics processor, the card does not support DirectX. Since A100 SXM4 40 GB does not support DirectX 11 or DirectX 12, it might not be able to run all the latest games. The GA100 graphics processor is a large chip with a die area of 826 mm² and 54,200 million transistors. It features 6912 shading units, 432 texture mapping units, and 160 ROPs. Also included are 432 tensor cores which help improve the speed of machine learning applications. NVIDIA has paired 40 GB HBM2e memory with the A100 SXM4 40 GB, which are connected using a 5120-bit memory interface. The GPU is operating at a frequency of 1095 MHz, which can be boosted up to 1410 MHz, memory is running at 1215 MHz.
Its power draw is rated at 400 W maximum. This device has no display connectivity, as it is not designed to have monitors connected to it. A100 SXM4 40 GB is connected to the rest of the system using a PCI-Express 4.0 x16 interface.
Graphics Processor
- GPU Name
-
GA100
- Architecture
- Ampere
- Foundry
- TSMC
- Process Size
- 7 nm
- Transistors
- 54,200 million
- Die Size
- 826 mm²
Graphics Card
- Release Date
- May 14th, 2020
- Generation
-
Tesla
(Axx)
- Production
- Active
- Bus Interface
- PCIe 4.0 x16
Clock Speeds
- Base Clock
- 1095 MHz
- Boost Clock
- 1410 MHz
- Memory Clock
-
1215 MHz
2.4 Gbps effective
Memory
- Memory Size
-
40 GB
- Memory Type
- HBM2e
- Memory Bus
-
5120 bit
- Bandwidth
-
1,555 GB/s
Render Config
- Shading Units
-
6912
- TMUs
-
432
- ROPs
-
160
- SM Count
-
108
- Tensor Cores
-
432
- L1 Cache
-
192 KB (per SM)
- L2 Cache
-
40 MB
Theoretical Performance
- Pixel Rate
-
225.6 GPixel/s
- Texture Rate
-
609.1 GTexel/s
- FP16 (half) performance
-
77.97 TFLOPS (4:1)
- FP32 (float) performance
-
19.49 TFLOPS
- FP64 (double) performance
-
9.746 TFLOPS (1:2)
Board Design
- Slot Width
- IGP
- TDP
- 400 W
- Suggested PSU
- 800 W
- Outputs
- No outputs
- Power Connectors
- None
Graphics Features
- DirectX
- N/A
- OpenGL
-
N/A
- OpenCL
- 3.0
- Vulkan
- N/A
- CUDA
- 8.0
- Shader Model
- N/A