NVIDIA AD102 "Ada" Packs Over 75 Billion Transistors

btarunr · Sep 14, 2022

NVIDIA's next-generation AD102 "Ada" GPU is shaping up to be a monstrosity, with a rumored transistor-count north of 75 billion. This would put over 2.6 times the 28.3 billion transistors of the current-gen GA102 silicon. NVIDIA is reportedly building the AD102 on the TSMC N5 (5 nm EUV) node, which offers a significant transistor-density uplift over the Samsung 8LPP (8 nm DUV) node on which the GA102 is built. The 8LPP offers 44.56 million transistors per mm² die-area (MTr/mm²), while the N5 offers a whopping 134 MTr/mm², which fits in with the transistor-count gain. This would put its die-area in the neighborhood of 560 mm². The AD102 is expected to power high-end RTX 40-series SKUs in the RTX 4090-series and RTX 4080-series.

View at TechPowerUp Main Site | Source

wolf · Sep 14, 2022

IF that's true, along with the other very recent rumors, perhaps the 4090 really will be ~2x 3090/Ti-ish

Deleted member 185088 · Sep 14, 2022

Does anyone has an idea of how many transistors are used/wasted on the tensor cores.

wolf · Sep 14, 2022

Xex360 said:
Does anyone has an idea of how many transistors are used/~~wasted~~ on the tensor cores.

8-10% of die area for RT+Tensor cores in a Turning die as per this sort of napkin math assessment.

Not sure about Ampere and Ada, but I'd happily take ~10% die space for RT acceleration and Tensor cores over a 10% increase in raster performance.

Zunexxx · Sep 14, 2022

wolf said:
IF that's true, along with the other very recent rumors, perhaps the 4090 really will be ~2x 3090/Ti-ish

It's the same ad102 die, hence the transistors are there, doesn't mean all are active. 4090 at most will be 2x 3090. I went to chh just moments ago, the rumors are 4090 about 21k take while the 4090ti about 24.5k tse stock.

DrCR · Sep 14, 2022

Will it cost as much as a used Corolla though?

bobsled · Sep 14, 2022

I suspect these will have a price to reflect that huge die size. Yields on 5nm won’t be great this early on, surely

Minus Infinity · Sep 14, 2022

Meanwhile AMD's largest core will be in the region of 350mm^2 or so, smaller than last gen 6950XT core. Can't recall transistor count.

Given a 7600XT is said to beat a 6950XT, I easily beleive 7900XT will be 2x 6900XT and that's what the inside information has been saying all along. So 4090 Ti should be similarly 2x 3090Ti again in the ball park of leaks.

Crackong · Sep 14, 2022

I am sorry but I can't stop laughing when I saw the shape of those

Shou Miko · Sep 14, 2022

Crackong said:
I am sorry but I can't stop laughing when I saw the shape of those

Who said cutting edge was better than rounding edge????? :roll:

Richards · Sep 14, 2022

Easily 2.5x performance over a 3090ti.. tripple the transistors

Deleted member 185088 · Sep 14, 2022

wolf said:
8-10% of die area for RT+Tensor cores in a Turning die as per this sort of napkin math assessment.

Not sure about Ampere and Ada, but I'd happily take ~10% die space for RT acceleration and Tensor cores over a 10% increase in raster performance.

10% is quite a lot of wasted transistors.
But I can understand nVidia's goal of accelerating real work rather than gaming.

pavle · Sep 14, 2022

Old news, rumors really.

vimsux · Sep 14, 2022

That image is AI generated, lol...

ModEl4 · Sep 14, 2022

@btarunr
The figures regarding million transistors per mm² die-area (MTr/mm²) that you are taking are completely uncorrelatable and the comparison result just wrong.
If I understood you took something like a GA102 result regarding 8LPP (44.56MTr/mm²) and tried to compare it with something like a N5 Apple A14 SOC (134 MTr/mm²).
You will get closer results if you calculate (accordingly for logic/SRAM/analog etc) based on foundry tech sites like WikiChip for example (slightly different from official TSMC claims, for example TSMC N10 vs N16 logic density scaling claim is 2X while WikiChip gives 1.82X or N5 vs N7 TSMC claim is 1.84X while WikiChip gives 1.87X but in this case for example TSMC compared a whole CPU block)
According to WikiChip Samsung's 8LPP is around 17% denser than TSMC's N10 regarding logic and if you compare Apple's 10nm and 5nm SOCs the actual scaling is just 2.73X!
Logic scaling scales very differently from caches/analog for example (e.g. N5 vs N7 logic scaling is 1.84X, SRAM 1.35X and analog 1.2X only!)
So if you take 2 completely different designs the compared results will be completely wrong.

Anyway, if the 75b+ figure is true this means at least 45b+ for AD103.
If the 96MB cache implementation is similar to AMD's infinity cache and Nvidia uses 6T SRAM for example the transistor count is inconsequential (4.6b transistors+redundancy/ overhead)
So comparing 7GPC designs (AD103 vs GA102 10752 Cuda cores both) the transistor increase per GPC is just insane, I wonder what extra features Ada will implement and at what DX level will end up being in the future.

TSMC logic density scaling by WikiChip:

Apple SOC density scaling example:

SOC	process	Transistors	die size	density
A11	N10	4.3b	87.66mm²	49MTr/mm²
A14	N5	11.8b	88mm²	134MTr/mm²

Denver · Sep 14, 2022

It still has fewer transistors than Apple's inefficient aberration.

Tomorrow · Sep 14, 2022

wolf said:
I'd happily take ~10% die space for RT acceleration and Tensor cores over a 10% increase in raster performance.

I would too but not at this time. Because none of the games i regularly play support RT or DLSS. So to me these transistors are useless at the moment.
At some point in the future when RT perf is actually good and many more games support RT and DLSS then sure.
Im running 2080Ti right now at 1440p 165Hz.

Richards · Sep 14, 2022

ModEl4 said:
@btarunr
The figures regarding million transistors per mm² die-area (MTr/mm²) that you are taking are completely uncorrelatable and the comparison result just wrong.
If I understood you took something like a GA102 result regarding 8LPP (44.56MTr/mm²) and tried to compare it with something like a N5 Apple A14 SOC (134 MTr/mm²).
You will get closer results if you calculate (accordingly for logic/SRAM/analog etc) based on foundry tech sites like WikiChip for example (slightly different from official TSMC claims, for example TSMC N10 vs N16 logic density scaling claim is 2X while WikiChip gives 1.82X or N5 vs N7 TSMC claim is 1.84X while WikiChip gives 1.87X but in this case for example TSMC compared a whole CPU block)
According to WikiChip Samsung's 8LPP is around 17% denser than TSMC's N10 regarding logic and if you compare Apple's 10nm and 5nm SOCs the actual scaling is just 2.73X!
Logic scaling scales very differently from caches/analog for example (e.g. N5 vs N7 logic scaling is 1.84X, SRAM 1.35X and analog 1.2X only!)
So if you take 2 completely different designs the compared results will be completely wrong.

Anyway, if the 75b+ figure is true this means at least 45b+ for AD103.
If the 96MB cache implementation is similar to AMD's infinity cache and Nvidia uses 6T SRAM for example the transistor count is inconsequential (4.6b transistors+redundancy/ overhead)
So comparing 7GPC designs (AD103 vs GA102 10752 Cuda cores both) the transistor increase per GPC is just insane, I wonder what extra features Ada will implement and at what DX level will end up being in the future.

TSMC logic density scaling by WikiChip:

Apple SOC density scaling example:

SOC process Transistors die size density
A11 N10 4.3b 87.66mm² 49MTr/mm²
A14 N5 11.8b 88mm² 134MTr/mm²

They must be using the high density library cells for the cache and high performance cells for the cores

TheoneandonlyMrK · Sep 14, 2022

wolf said:
IF that's true, along with the other very recent rumors, perhaps the 4090 really will be ~2x 3090/Ti-ish

The 4090 is rumoured to be using Ad103 though, perhaps the Ti.

Oh noes I r wrong it is 102.

ModEl4 · Sep 14, 2022

Richards said:
They must be using the high density library cells for the cache and high performance cells for the cores

Who knows, I wonder what die size will have.
The previous rumor was around 600mm² which seems difficult since regular N4 is around 6% denser than N5, maybe we have a customized node like in Turing's case (TSMC 12nm "FFN")

Steevo · Sep 14, 2022

Tomorrow said:
I would too but not at this time. Because none of the games i regularly play support RT or DLSS. So to me these transistors are useless at the moment.
At some point in the future when RT perf is actually good and many more games support RT and DLSS then sure.
Im running 2080Ti right now at 1440p 165Hz.

How does the RT acceleration work in the 2080 compare, who would buy a 2080 over a new card today? New features on new cards need to work within a couple years or they are old features that new cards have better versions of.

I say this after watching it happen numerous times from both brands.

Tomorrow · Sep 14, 2022

Steevo said:
How does the RT acceleration work in the 2080 compare, who would buy a 2080 over a new card today? New features on new cards need to work within a couple years or they are old features that new cards have better versions of.

I say this after watching it happen numerous times from both brands.

I bought this card for 700 before the mining craze in January 2021. And i bought if for raster performance. Not RT or DLSS.

wolf · Sep 15, 2022

Xex360 said:
10% is quite a lot of wasted transistors.

I suppose they are if personally you consider them to be a waste, I do not. I've used that 10% die area for the majority of my ownership of an RTX gpu.

Bear in mind too iirc of that 10% die area, roughly 2/3 of that is RT and 1/3 is Tensor, so ~3% die space for Tensor alone, easy yes for me and indeed many people like me.

Xex360 said:
But I can understand nVidia's goal of accelerating real work rather than gaming.

Well I've used that die area for hundreds of hours of RT/DLSS enabled gaming, so I'd say they definitely are at least gaming features too.

Kapone33 · Sep 15, 2022

So the Power draw rumours could be real. Imagine that many Gates opening and closing in sequence. Are not the transistors 5nm too which means a ridiculous amount of heat to dissipate. No wonder those look like 4 slot coolers with 120MM fans. Good luck using a single rad with one of these puppies water-cooled.

The Von Matrices · Sep 15, 2022

kapone32 said:
So the Power draw rumours could be real. Imagine that many Gates opening and closing in sequence. Are not the transistors 5nm too which means a ridiculous amount of heat to dissipate. No wonder those look like 4 slot coolers with 120MM fans. Good luck using a single rad with one of these puppies water-cooled.

Yep. You don't just triple the number of transistors nowadays without nearly tripling the power.

People are thinking that this chip will be amazingly fast based solely upon the number of transistors. It won't come close to that because it will be severely power limited.

System Name	RBMK-1000
Processor	AMD Ryzen 7 5700G
Motherboard	Gigabyte B550 AORUS Elite V2
Cooling	DeepCool Gammax L240 V2
Memory	2x 16GB DDR4-3200
Video Card(s)	Galax RTX 4070 Ti EX
Storage	Samsung 990 1TB
Display(s)	BenQ 1440p 60 Hz 27-inch
Case	Corsair Carbide 100R
Audio Device(s)	ASUS SupremeFX S1220A
Power Supply	Cooler Master MWE Gold 650W
Mouse	ASUS ROG Strix Impact
Keyboard	Gamdias Hermes E2
Software	Windows 11 Pro

System Name	MightyX
Processor	Ryzen 9800X3D
Motherboard	Gigabyte B650I AX
Cooling	Scythe Fuma 2
Memory	32GB DDR5 6000 CL30 tuned
Video Card(s)	Palit Gamerock RTX 5080 oc
Storage	WD Black SN850X 2TB
Display(s)	LG 42C2 4K OLED
Case	Coolermaster NR200P
Audio Device(s)	LG SN5Y / Focal Clear
Power Supply	Corsair SF750 Platinum
Mouse	Corsair Dark Core RBG Pro SE
Keyboard	Glorious GMMK Compact w/pudding
VR HMD	Meta Quest 3
Software	case populated with Artic P12's
Benchmark Scores	4k120 OLED Gsync bliss

System Name	MightyX
Processor	Ryzen 9800X3D
Motherboard	Gigabyte B650I AX
Cooling	Scythe Fuma 2
Memory	32GB DDR5 6000 CL30 tuned
Video Card(s)	Palit Gamerock RTX 5080 oc
Storage	WD Black SN850X 2TB
Display(s)	LG 42C2 4K OLED
Case	Coolermaster NR200P
Audio Device(s)	LG SN5Y / Focal Clear
Power Supply	Corsair SF750 Platinum
Mouse	Corsair Dark Core RBG Pro SE
Keyboard	Glorious GMMK Compact w/pudding
VR HMD	Meta Quest 3
Software	case populated with Artic P12's
Benchmark Scores	4k120 OLED Gsync bliss

System Name	Personal Gaming Rig
Processor	Ryzen 7800X3D
Motherboard	MSI X670E Carbon
Cooling	MO-RA 3 420
Memory	32GB 6000MHz
Video Card(s)	RTX 4090 ICHILL FROSTBITE ULTRA
Storage	4x 2TB Nvme
Display(s)	Samsung G8 OLED
Case	Silverstone FT04

System Name	Lynni Zen \| Lenowo TwinkPad L14 G2 \| Tiny Tiger
Processor	AMD Ryzen 7 7700 Raphael \| i5-1135G7 Tiger Lake-U \| i9-9900k (Turbo disaabled)
Motherboard	ASRock B650M PG Riptide Bios v. 3.20 AMD AGESA 1.2.0.3a \| Lenowo BDPLANAR Bios 1.68 \| Lenowo M720q
Cooling	AMD Wraith Cooler \| Lenowo C-267C-2 \| Lenowo 01MN631 (65W)
Memory	Flare X5 2x16GB DDR5 6000MHZ CL36 (AMD EXPO) \| Willk Elektronik 2x16GB 2666MHZ CL17 \| Crucial 2x16GB
Video Card(s)	Sapphire PURE AMD Radeon™ RX 9070 Gaming OC 16GB \| Intel® Iris Xe Graphics \| Intel® UHD Graphics 630
Storage	Gigabyte M30 1TB\|Sabrent Rocket 2TB\| HDD: 1TB \| WD RED SN700 1TB \| M30 1TB\ SSD 1TB HDD: 16TB\10TB
Display(s)	KTC M27T20S 1440p@165Hz \| LG 48CX OLED 4K HDR \| Innolux 14" 1080p
Case	Asus Prime AP201 White Mesh \| Lenowo L14 G2 chassis \| Lenowo M720q chassis
Audio Device(s)	Steelseries Arctis Pro Wireless
Power Supply	Be Quiet! Pure Power 12 M 750W Goldie \| Cyberpunk GaN 65W USB-C charger \| Lenowo 95W slim tip
Mouse	Logitech G305 Lightspeedy Wireless \| Lenowo TouchPad & Logitech G305
Keyboard	Ducky One 3 Daybreak Fullsize \| L14 G2 UK Lumi
Software	Win11 IoT Enterprise 24H2 UK \| Win11 IoT Enterprise LTSC 24H2 UK / Arch (Fan)
Benchmark Scores	3DMARK: https://www.3dmark.com/3dm/89434432? GPU-Z: https://www.techpowerup.com/gpuz/details/v3zbr

NVIDIA AD102 "Ada" Packs Over 75 Billion Transistors

btarunr

Editor & Senior Moderator

wolf

Better Than Native

Deleted member 185088

Guest

wolf

Better Than Native

Zunexxx

DrCR

bobsled

Minus Infinity

Crackong

Shou Miko

Richards

Deleted member 185088

Guest

pavle

vimsux

New Member

ModEl4

Denver

Tomorrow

Richards

TheoneandonlyMrK

ModEl4

Steevo

Tomorrow

wolf

Better Than Native

Kapone33

The Von Matrices

System Name	DarkStar
Processor	AMD Ryzen 7 5800X3D
Motherboard	Gigabyte X570 Aorus Master 1.0 (BIOS F39g)
Cooling	Arctic Liquid Freezer II 420mm AIO (rev4)
Memory	4x8GB Patriot Viper DDR4 4400C19 @ 3733Mhz 14-14-13-27 1T
Video Card(s)	Gigabyte Radeon RX 9070 XT Gaming OC 16GB GDDR6 @ 3400Mhz Core/22Gbps Mem
Storage	1TB Samsung 990 Pro (OS);2TB Samsung PM9A1;4TB XPG S70 Blade (Games);14TB WD UltraStar HC530 (Video)
Display(s)	27" LG UltraGear 27GS85Q-B @ 2560x1440 @ 200Hz, Nano-IPS
Case	be quiet! Dark Base Pro 900 Rev.2
Audio Device(s)	SteelSeries Arctis Nova Pro Wireless
Power Supply	1000W Seasonic PRIME Ultra Titanium;600W APC SMT750i UPS
Mouse	Logitech G604
Keyboard	Logitech G910 Orion Spark
Software	Windows 11 Pro x64 24H2 (Build 26100.4351)

System Name	RyzenGtEvo/ Asus strix scar II
Processor	Amd R5 5900X/ Intel 8750H
Motherboard	Crosshair hero8 impact/Asus
Cooling	360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory	Gskill Trident Z 3900cas18 32Gb in four sticks./16Gb/16GB
Video Card(s)	Asus tuf RX7900XT /Rtx 2060
Storage	Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s)	Samsung UAE28"850R 4k freesync.dell shiter
Case	Lianli 011 dynamic/strix scar2
Audio Device(s)	Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply	corsair 1200Hxi/Asus stock
Mouse	Roccat Kova/ Logitech G wireless
Keyboard	Roccat Aimo 120
VR HMD	Oculus rift
Software	Win 10 Pro
Benchmark Scores	laptop Timespy 6506

System Name	Compy 386
Processor	7800X3D
Motherboard	Asus
Cooling	Air for now.....
Memory	64 GB DDR5 6400Mhz
Video Card(s)	7900XTX 310 Merc
Storage	Samsung 990 2TB, 2 SP 2TB SSDs, 24TB Enterprise drives
Display(s)	55" Samsung 4K HDR
Audio Device(s)	ATI HDMI
Mouse	Logitech MX518
Keyboard	Razer
Software	A lot.
Benchmark Scores	Its fast. Enough.

System Name	Best AMD Computer
Processor	AMD 7900X3D
Motherboard	Asus X670E E Strix
Cooling	In Win SR36
Memory	GSKILL DDR5 32GB 5200 30
Video Card(s)	Sapphire Pulse 7900XT (Watercooled)
Storage	Corsair MP 700, Seagate 530 2Tb, Adata SX8200 2TBx2, Kingston 2 TBx2, Micron 8 TB, WD AN 1500
Display(s)	GIGABYTE FV43U
Case	Corsair 7000D Airflow
Audio Device(s)	Corsair Void Pro, Logitch Z523 5.1
Power Supply	Deepcool 1000M
Mouse	Logitech g7 gaming mouse
Keyboard	Logitech G510
Software	Windows 11 Pro 64 Steam. GOG, Uplay, Origin
Benchmark Scores	Firestrike: 46183 Time Spy: 25121

System Name	My Surround PC
Processor	AMD Ryzen 9 7950X3D
Motherboard	ASUS STRIX X670E-F
Cooling	Swiftech MCP35X / EK Quantum CPU / Alphacool GPU / XSPC 480mm w/ Corsair Fans
Memory	96GB (2 x 48 GB) G.Skill DDR5-6000 CL30
Video Card(s)	MSI NVIDIA GeForce RTX 4090 Suprim X 24GB
Storage	WD SN850 2TB, Samsung PM981a 1TB, 4 x 4TB + 1 x 10TB HGST NAS HDD for Windows Storage Spaces
Display(s)	2 x Viotek GFI27QXA 27" 4K 120Hz + LG UH850 4K 60Hz + HMD
Case	NZXT Source 530
Audio Device(s)	Sony MDR-7506 / Logitech Z-5500 5.1
Power Supply	Corsair RM1000x 1 kW
Mouse	Patriot Viper V560
Keyboard	Corsair K100
VR HMD	HP Reverb G2
Software	Windows 11 Pro x64
Benchmark Scores	Mellanox ConnectX-3 10 Gb/s Fiber Network Card