Report an Error

NVIDIA A40 PCIe

Graphics Processor
GA102
Cores
10752
TMUs
336
ROPs
112
Memory Size
48 GB
Memory Type
GDDR6
Bus Width
384 bit
GPU Chip
GPU
Rear
Rear
The A40 PCIe is a professional graphics card by NVIDIA, launched on October 5th, 2020. Built on the 8 nm process, and based on the GA102 graphics processor, the card supports DirectX 12 Ultimate. The GA102 graphics processor is a large chip with a die area of 628 mm² and 28,300 million transistors. It features 10752 shading units, 336 texture mapping units, and 112 ROPs. Also included are 336 tensor cores which help improve the speed of machine learning applications. The card also has 84 raytracing acceleration cores. NVIDIA has paired 48 GB GDDR6 memory with the A40 PCIe, which are connected using a 384-bit memory interface. The GPU is operating at a frequency of 1305 MHz, which can be boosted up to 1740 MHz, memory is running at 1812 MHz (14.5 Gbps effective).
Being a dual-slot card, the NVIDIA A40 PCIe draws power from an 8-pin EPS power connector, with power draw rated at 300 W maximum. Display outputs include: 3x DisplayPort 1.4a. A40 PCIe is connected to the rest of the system using a PCI-Express 4.0 x16 interface. The card measures 267 mm in length, 111 mm in width, and features a dual-slot cooling solution.

Graphics Processor

GPU Name
GA102
Architecture
Ampere
Foundry
Samsung
Process Size
8 nm
Transistors
28,300 million
Density
45.1M / mm²
Die Size
628 mm²
Chip Package
BGA-3328

Graphics Card

Release Date
Oct 5th, 2020
Availability
2021
Generation
Tesla Ampere
(Axx)
Predecessor
Tesla Turing
Successor
Tesla Ada
Production
Active
Bus Interface
PCIe 4.0 x16

Relative Performance

GeForce 210
0%
GeForce 9400 GT
1%
Radeon HD 4550
1%
Radeon HD 5450
1%
Radeon HD 6450
1%
GeForce GT 520
1%
GeForce GT 220
1%
GeForce GT 430
2%
Radeon HD 5570
2%
Radeon HD 4670
2%
GeForce GT 440
2%
GeForce GT 240
2%
GeForce 9600 GT
3%
Radeon HD 5670
3%
GeForce GT 640
3%
GeForce 9800 GT
3%
Radeon HD 6670
3%
Radeon HD 4830
3%
Radeon HD 4770
3%
Radeon HD 4850
4%
GeForce GTS 250
4%
GeForce GTS 450
4%
Radeon HD 5750
4%
Radeon HD 7750
4%
Radeon HD 5770
5%
GeForce GTX 260
5%
Radeon HD 4870
5%
Radeon Vega 8
5%
GeForce GTX 550 Ti
5%
GeForce GTX 650
5%
Radeon HD 5830
5%
Radeon HD 6790
5%
Radeon HD 4890
5%
GeForce GTX 460
6%
Radeon RX Vega 11
6%
GeForce GT 1030
6%
GeForce GTX 275
6%
Radeon HD 7770 GHz Edition
6%
GeForce GTX 280
6%
GeForce GTX 465
6%
Radeon HD 6850
6%
GeForce GTX 285
6%
Radeon HD 5850
7%
GeForce GTX 650 Ti
7%
Radeon HD 7790
7%
Radeon RX 550
7%
GeForce GTX 470
7%
Radeon HD 6870
7%
Radeon HD 5870
8%
GeForce GTX 560 Ti
8%
Radeon HD 4870 X2
8%
GeForce GTX 750 Ti
8%
Radeon HD 6950
8%
GeForce GTX 295
9%
GeForce GTX 650 Ti Boost
9%
Radeon HD 7850
9%
GeForce GTX 480
9%
Radeon HD 6970
9%
Radeon R7 265
10%
Radeon RX 460
10%
Radeon R7 370
10%
GeForce GTX 660
10%
GeForce GTX 570
10%
Radeon RX 560
10%
GeForce GTX 950
11%
Radeon HD 7870 GHz Edition
11%
Radeon R9 270X
11%
GeForce GTX 660 Ti
11%
GeForce GTX 580
11%
GeForce GTX 1050
11%
Radeon HD 7950
11%
GeForce GTX 1630
12%
Radeon HD 5970
12%
GeForce GTX 760
12%
GeForce GTX 670
13%
GeForce GTX 960
13%
Radeon R9 380
13%
Radeon R9 285
13%
Radeon HD 7970
14%
GeForce GTX 680
14%
GeForce GTX 1050 Ti
14%
GeForce GTX 770
14%
Radeon HD 6990
15%
Radeon R9 280X
15%
GeForce GTX 590
15%
Radeon HD 7970 GHz Edition
15%
GeForce GTX 780
17%
Arc A380
17%
Radeon RX 6400
17%
GeForce GTX 1650
18%
Radeon RX 470
19%
Radeon R9 290
19%
Radeon RX 570
20%
Radeon R9 390
20%
Radeon R9 290X
20%
GeForce GTX 970
21%
GeForce GTX TITAN
21%
GeForce GTX 780 Ti
21%
Radeon R9 390X
22%
Radeon HD 7990
22%
Radeon RX 480
22%
GeForce GTX 690
22%
GeForce GTX 1060 6 GB
22%
Radeon RX 5500 OEM
23%
Radeon RX 580
23%
Radeon RX 5500 XT
23%
Radeon RX 6500 XT
23%
GeForce GTX 980
24%
GeForce GTX 1650 SUPER
24%
Radeon R9 FURY
24%
Radeon RX 590
25%
Radeon R9 295X2
26%
GeForce GTX 1660
26%
Radeon R9 FURY X
27%
GeForce GTX 980 Ti
27%
GeForce GTX TITAN X
28%
GeForce GTX 1660 SUPER
29%
GeForce GTX 1070
30%
GeForce GTX 1660 Ti
30%
GeForce RTX 3050 8 GB
31%
Radeon RX Vega 56
32%
GeForce GTX 1070 Ti
34%
Radeon RX 5600 XT
35%
Radeon RX Vega 64
35%
GeForce GTX 1080
36%
GeForce RTX 2060
36%
Radeon RX 5700
37%
Arc A580
37%
Radeon RX 6600
39%
GeForce RTX 2060 SUPER
40%
GeForce RTX 2070
41%
Radeon RX 5700 XT
41%
GeForce RTX 3060 12 GB
41%
Arc A750
43%
Radeon VII
43%
TITAN X Pascal
44%
Radeon RX 6600 XT
45%
GeForce GTX 1080 Ti
45%
Arc A770
46%
GeForce RTX 2070 SUPER
46%
Radeon RX 6650 XT
48%
Radeon RX 7600
49%
GeForce RTX 4060
49%
GeForce RTX 2080
49%
Radeon RX 7600 XT
50%
GeForce RTX 2080 SUPER
52%
GeForce RTX 3060 Ti
53%
Radeon RX 6700 XT
55%
Radeon RX 6750 XT
58%
GeForce RTX 4060 Ti 8 GB
59%
GeForce RTX 2080 Ti
59%
GeForce RTX 3070
62%
GeForce RTX 3070 Ti
66%
Radeon RX 7700 XT
67%
Radeon RX 6800
68%
GeForce RTX 4070
76%
Radeon RX 6800 XT
78%
GeForce RTX 3080
81%
Radeon RX 7800 XT
82%
Radeon RX 6900 XT
84%
GeForce RTX 4070 SUPER
88%
Radeon RX 7900 GRE
90%
GeForce RTX 3080 Ti
90%
Radeon RX 6950 XT
91%
GeForce RTX 3090
92%
GeForce RTX 4070 Ti
94%
A40 PCIe
100%
GeForce RTX 4070 Ti SUPER
103%
Radeon RX 7900 XT
106%
GeForce RTX 3090 Ti
107%
GeForce RTX 4080
120%
GeForce RTX 4080 SUPER
122%
Radeon RX 7900 XTX
123%
GeForce RTX 4090
151%
Based on TPU review data: "Performance Summary" at 1920x1080, 4K for 2080 Ti and faster.
Performance estimated based on architecture, shader count and clocks.

Clock Speeds

Base Clock
1305 MHz
Boost Clock
1740 MHz
Memory Clock
1812 MHz
14.5 Gbps effective

Memory

Memory Size
48 GB
Memory Type
GDDR6
Memory Bus
384 bit
Bandwidth
695.8 GB/s

Render Config

Shading Units
10752
TMUs
336
ROPs
112
SM Count
84
Tensor Cores
336
RT Cores
84
L1 Cache
128 KB (per SM)
L2 Cache
6 MB

Theoretical Performance

Pixel Rate
194.9 GPixel/s
Texture Rate
584.6 GTexel/s
FP16 (half)
37.42 TFLOPS (1:1)
FP32 (float)
37.42 TFLOPS
FP64 (double)
584.6 GFLOPS (1:64)

Board Design

Slot Width
Dual-slot
Length
267 mm
10.5 inches
Width
111 mm
4.4 inches
TDP
300 W
Suggested PSU
700 W
Outputs
3x DisplayPort 1.4a
Power Connectors
8-pin EPS
Board Number
PG133 SKU 200

Graphics Features

DirectX
12 Ultimate (12_2)
OpenGL
4.6
OpenCL
3.0
Vulkan
1.3
CUDA
8.6
Shader Model
6.7

GA102 GPU Notes

Ray Tracing Cores: 2nd Gen
Tensor Cores: 3rd Gen
NVENC: 7th Gen
NVDEC: 5th Gen
PureVideo HD: VP11
VDPAU: Feature Set K

Retail boards based on this design (4)

Name GPU Clock Boost Clock Memory Clock Other Changes
1305 MHz 1740 MHz 1812 MHz 24 GB
1305 MHz 1740 MHz 1812 MHz 2 GB
1305 MHz 1740 MHz 1812 MHz 4 GB
1305 MHz 1740 MHz 1812 MHz 6 GB
Apr 24th, 2024 01:41 EDT change timezone

New Forum Posts

Popular Reviews

Controversial News Posts