AMD CDNA3 Architecture Sees the Inevitable Fusion of Compute Units and x86 CPU at Massive Scale

btarunr · Jun 10, 2022

AMD in its 2022 Financial Analyst Day presentation unveiled its next-generation CDNA3 compute architecture, which will see something we've been expecting for a while—a compute accelerator that has a large number of compute units for scalar processing, and a large number of x86-64 CPU cores based on some future "Zen" microarchitecture, onto a single package. The presence of CPU cores on the package would eliminate the need for the system to have an EPYC or Xeon processor at its head, and clusters of Instinct CDNA3 processors could run themselves without the need for a CPU and its system memory.

The Instinct CDNA3 processor will feature an advanced packaging technology that brings various IP blocks together as chiplets, each based on a node most economical to it, without compromising on its function. The package features stacked HBM memory, and this memory is shared not just by the compute units and x86 cores, but also forms part of large shared memory pools accessible across packages. 4th Generation Infinity Fabric ties it all together.

AMD is claiming a 500% (or 5 times) AI compute performance/Watt uplift over CDNA2, thanks to the combination of 5 nm processor for the compute dies, an advanced 3D chiplet packaging technology, 4th Gen Infinity Fabric, new math computing formats, Infinity Cache on the compute dies, and a unified memory architecture. The company is working toward a 2023 debut of CDNA3.

View at TechPowerUp Main Site

Mathragh · Jun 10, 2022

"The Future Is Fusion" finally coming to complete fruition!

R0H1T · Jun 10, 2022

Only a decade late!

Mathragh · Jun 10, 2022

R0H1T said:
Only a decade late!

Well, better late than never

And certainly better than not having tried at all.

Anc13ntEvil · Jun 10, 2022

"AMD is claiming a 500% (or 5 times) AI compute performance/Watt uplift over CDNA2"

I don't understand why so many people don't understand how percentages compared to multiplication works...

500% is SIX TIMES, not five. If you have $10 and I increase it by 100%, you would then have $20. 100% is double, 200% is triple, and so on, yet I constantly see people get this simple math wrong. Sorry not sorry but this annoys the crap out of me.

It also bugs the crap out of me when companies list something like "1.3 times the speed" (or power).
30%...just say 30%.

DeathtoGnomes · Jun 10, 2022

Anc13ntEvil said:
"AMD is claiming a 500% (or 5 times) AI compute performance/Watt uplift over CDNA2"

I don't understand why so many people don't understand how percentages compared to multiplication works...

500% is SIX TIMES, not five. If you have $10 and I increase it by 100%, you would then have $20. 100% is double, 200% is triple, and so on, yet I constantly see people get this simple math wrong. Sorry not sorry but this annoys the crap out of me.

It also bugs the crap out of me when companies list something like "1.3 times the speed" (or power).
30%...just say 30%.

All compute starts at 0 not 1, its pretty binary.

InVasMani · Jun 10, 2022

Me from two days ago..."If AMD had a 5600X with larger 3D stacked cache or also paired APU chiplet that didn't castrate PCIE lanes in the process it would be pretty popular for a budget AM4 build. In fact the larger cache could be on the APU and if the chip could utilize the cache between either with maybe a slight latency penalty worst case scenario in doing so it would be really nice. Both CPU/GPU would have a set amount of cache that's equal and at the same time can tapped into by the other in a pinch with only a bit more added latency when and where needed."

Seems AMD thought similar, but I like that they took it steps further with HBM and Zen 4 is of course a step up for such a APU designed in tandem with 3D stacked cache. It seems they went with the pooled and shared unified memory as well! I'm not sure what the new "MATHS" is, but suspect it's referring to AVX related and/or FP precision stuff bit o this bit o that!

R-T-B · Jun 10, 2022

Anc13ntEvil said:
"AMD is claiming a 500% (or 5 times) AI compute performance/Watt uplift over CDNA2"

I don't understand why so many people don't understand how percentages compared to multiplication works...

500% is SIX TIMES, not five. If you have $10 and I increase it by 100%, you would then have $20. 100% is double, 200% is triple, and so on, yet I constantly see people get this simple math wrong. Sorry not sorry but this annoys the crap out of me.

It also bugs the crap out of me when companies list something like "1.3 times the speed" (or power).
30%...just say 30%.

That's not how "uplift" works.

A 100% uplift would be double the performance.

100% of the performance would indeed be the same.

eidairaman1 · Jun 11, 2022

Mathragh said:
"The Future Is Fusion" finally coming to complete fruition!

SoCA, blade servers/workstations

DeathtoGnomes said:
All compute starts at 0 not 1, its pretty binary.

MACHINE CODE DUNANANA

Minus Infinity · Jun 11, 2022

R-T-B said:
That's not how "uplift" works.

A 100% uplift would be double the performance.

100% of the performance would indeed be the same.

Yes the subtlety of one single word and you are indeed correct.

ModEl4 · Jun 11, 2022

I wonder if the work that it's done on the CDNA3 unified memory architecture will bring some benefits for future Zen5 based APUs on the memory controller/ cache/ V-cache system integration or everything is sorted out in the APU space with HUMA later iterations.

Imaamdfanboy · Jun 12, 2022

Nope...500% is indeed 5 times more.
If you have two apples for comparison they are both equall.Lets says they are both exactly same size then they are 1 to 1.This is why they do the 1.5x or maybe 2.3x explanation these days.The one's are equalls so the points after is the presentation.So 1.5 will then be 50 present faster or if you like half more times faster than one.If you use presentation it not like normal maths witch comes down to 1*1 = 1.It excaly 0.
So 2.3 times wil hê 130% môre which relates to 1.3 times more powerful.

System Name	RBMK-1000
Processor	AMD Ryzen 7 5700G
Motherboard	Gigabyte B550 AORUS Elite V2
Cooling	DeepCool Gammax L240 V2
Memory	2x 16GB DDR4-3200
Video Card(s)	Galax RTX 4070 Ti EX
Storage	Samsung 990 1TB
Display(s)	BenQ 1440p 60 Hz 27-inch
Case	Corsair Carbide 100R
Audio Device(s)	ASUS SupremeFX S1220A
Power Supply	Cooler Master MWE Gold 650W
Mouse	ASUS ROG Strix Impact
Keyboard	Gamdias Hermes E2
Software	Windows 11 Pro

System Name	PC \|\|Zephyrus G14 2023
Processor	Ryzen 9 5900x \|\| R9 7940HS
Motherboard	MAG B550M MORTAR WIFI \|\|
Cooling	1x Corsair XR5 360mm Rad\|\|
Memory	2x16GB HyperX 3600 @ 3800 \|\| 2x16GB DDR5 @ 4800MTs
Video Card(s)	MSI RTX 2080Ti Sea Hawk EK X \|\| RTX 4060
Storage	Samsung 9801TB x2 + Striped Tiered Storage Space (2x 128Gb SSD + 2x 1TB HDD) \|\| 1TB NVME
Display(s)	Iiyama PL2770QS + Samsung U28E590, \|\| 14' 2560x1600 165Hz IPS
Case	SilverStone Alta G1M \|\|
Power Supply	Cooler Master V850 SFX \|\| 240W
Mouse	ROG Pugio II
Software	Win 11 64bit \|\| Win 11 64bit

System Name	PC \|\|Zephyrus G14 2023
Processor	Ryzen 9 5900x \|\| R9 7940HS
Motherboard	MAG B550M MORTAR WIFI \|\|
Cooling	1x Corsair XR5 360mm Rad\|\|
Memory	2x16GB HyperX 3600 @ 3800 \|\| 2x16GB DDR5 @ 4800MTs
Video Card(s)	MSI RTX 2080Ti Sea Hawk EK X \|\| RTX 4060
Storage	Samsung 9801TB x2 + Striped Tiered Storage Space (2x 128Gb SSD + 2x 1TB HDD) \|\| 1TB NVME
Display(s)	Iiyama PL2770QS + Samsung U28E590, \|\| 14' 2560x1600 165Hz IPS
Case	SilverStone Alta G1M \|\|
Power Supply	Cooler Master V850 SFX \|\| 240W
Mouse	ROG Pugio II
Software	Win 11 64bit \|\| Win 11 64bit

System Name	Dumbass
Processor	AMD Ryzen 7800X3D
Motherboard	ASUS TUF gaming B650
Cooling	Artic Liquid Freezer 2 - 420mm
Memory	G.Skill Sniper 32gb DDR5 6000
Video Card(s)	GreenTeam 4070 ti super 16gb
Storage	Samsung EVO 500gb & 1Tb, 2tb HDD, 500gb WD Black
Display(s)	1x Nixeus NX_EDG27, 2x Dell S2440L (16:9)
Case	Phanteks Enthoo Primo w/8 140mm SP Fans
Audio Device(s)	onboard (realtek?) - SPKRS:Logitech Z623 200w 2.1
Power Supply	Corsair HX1000i
Mouse	Steeseries Esports Wireless
Keyboard	Corsair K100
Software	windows 10 H
Benchmark Scores	https://i.imgur.com/aoz3vWY.jpg?2

System Name	Pioneer
Processor	Ryzen 9 9950X
Motherboard	MSI MAG X670E Tomahawk Wifi
Cooling	Noctua NH-D15 + A whole lotta Sunon, Phanteks and Corsair Maglev blower fans...
Memory	128GB (4x 32GB) G.Skill Flare X5 @ DDR5-4200(Running 1:1:1 w/FCLK)
Video Card(s)	XFX RX 7900 XTX Speedster Merc 310
Storage	Intel 5800X Optane 800GB boot, +2x Crucial P5 Plus 2TB PCIe 4.0 NVMe SSDs, 1x 2TB Seagate Exos 3.5"
Display(s)	55" LG 55" B9 OLED 4K Display
Case	Thermaltake Core X31
Audio Device(s)	TOSLINK->Schiit Modi MB->Asgard 2 DAC Amp->AKG Pro K712 Headphones or HDMI->B9 OLED
Power Supply	FSP Hydro Ti Pro 850W
Mouse	Logitech G305 Lightspeed Wireless
Keyboard	WASD Code v3 with Cherry Green keyswitches + PBT DS keycaps
Software	Gentoo Linux x64, other office machines run Windows 11 Enterprise

AMD CDNA3 Architecture Sees the Inevitable Fusion of Compute Units and x86 CPU at Massive Scale

btarunr

Editor & Senior Moderator

Mathragh

R0H1T

Mathragh

Anc13ntEvil

New Member

DeathtoGnomes

InVasMani

R-T-B

eidairaman1

The Exiled Airman

Minus Infinity

ModEl4

Imaamdfanboy

New Member

System Name	PCGOD
Processor	AMD FX 8350@ 5.0GHz
Motherboard	Asus TUF 990FX Sabertooth R2 2901 Bios
Cooling	Scythe Ashura, 2×BitFenix 230mm Spectre Pro LED (Blue,Green), 2x BitFenix 140mm Spectre Pro LED
Memory	16 GB Gskill Ripjaws X 2133 (2400 OC, 10-10-12-20-20, 1T, 1.65V)
Video Card(s)	AMD Radeon 290 Sapphire Vapor-X
Storage	Samsung 840 Pro 256GB, WD Velociraptor 1TB
Display(s)	NEC Multisync LCD 1700V (Display Port Adapter)
Case	AeroCool Xpredator Evil Blue Edition
Audio Device(s)	Creative Labs Sound Blaster ZxR
Power Supply	Seasonic 1250 XM2 Series (XP3)
Mouse	Roccat Kone XTD
Keyboard	Roccat Ryos MK Pro
Software	Windows 7 Pro 64