GPU IPC Showdown: NVIDIA Blackwell vs Ada Lovelace; AMD RDNA 4 vs RDNA 3

Vayra86 · Jun 23, 2025

Daven said:
Sure, yep I get the sarcasm but the attitudes of some of the Nvidia users have gotten so extreme lately, I don't want them taking anything too seriously. We regular PC enthusiasts know the capabilities of the different hardware over time but even though most of us are getting info from the same sources, the Nvidia enthusiasm is, how shall we say, distorting reality just a bit.

Screw what other people do or feel man. You/we arent gonna change any of it. Humans be human

Convincing is a simple matter of proving the point. Every time some shit occurs; like a melting 12v pin, planned obscolescence, price manipulation etc. Or just a better product - and thats where we hit the core of the issue dont we...

Daven · Jun 23, 2025

Lew Zealand said:
Quoted for emphasis. I do this every generation to see if anything has moved the price/performance/power metrics and this matches what I've figured using the data here from the TPU charts.

The 40% 7600XT --> 9060 XT IPC improvement with the same core count and similar memory shows that something was terribly broken in RDNA3 and AMD did say that RDNA4 was a bugfix.

I wanna know what that damn bug was!

Edit: My handwavy guess is the doubled throughput FP cores which were added in RDNA 3 didn't work properly which is why the 7600 was only marginally faster than the 6650 XT with the old cores. And that got fixed.

At least some of that 40% increase was the 15% increase in clock speed. I looked at the latest TPU Sapphire 9060XT review and the TPU Sapphire 7600XT review. The clock speed is 15% higher on the 9060XT.

Lew Zealand · Jun 23, 2025

Daven said:
At least some of that 40% increase was the 15% increase in clock speed. I looked at the latest TPU Sapphire 9060XT review and the TPU Sapphire 7600XT review. The clock speed is 15% higher on the 9060XT.

That leaves about 21% leftover for IPC increase which is more modest but appreciable. I'm still quite interested in that bugfix!

Visible Noise · Jun 23, 2025

Daven said:
I expect UDNA to have similar shifts over RDNA4 per CU:

20% gen ras
30% RT
100% path tracing

Some of this has already been rumored.

https://videocardz.com/newz/amd-udna-architecture-rumored-to-power-ps6-and-next-xbox-with-big-ray-tracing-and-ai-gains

The above combined with a doubling of CUs from 64 to 128 would result in a dominating GPU product.

Kepler said he was just making stuff up. It’s hilarious to see Twitter posts being taken as some kind of evidence.

HD64G · Jun 23, 2025

That alone proves that Nvidia doesn't care at all any more about gaming but AI. Otoh, AMD did great jump in performance and efficiency closing to Nvidia more than anyone anticipated since Nvidia stood still. Arrogance almost always bites you back.

Daven · Jun 23, 2025

Visible Noise said:
Kepler said he was just making stuff up. It’s hilarious to see Twitter posts being taken as some kind of evidence.

To make sure you are not making stuff up, can you provide the Twitter post showing that he said that?

Sammoonryong · Jun 23, 2025

piloponth said:
Not only there is no performance progress in Ada vs. Blackwell, there is also no energy efficiency progress as well. Blackwell is just bigger 40xx with messed up drivers.

And no 32bit physx …

igormp · Jun 23, 2025

It'd be interesting to see a similar test done for compute workloads, instead of games.

Visible Noise · Jun 23, 2025

Daven said:
To make sure you are not making stuff up, can you provide the Twitter post showing that he said that?

Why would I make stuff up?

Onasi · Jun 23, 2025

HD64G said:
That alone proves that Nvidia doesn't care at all any more about gaming but AI. Otoh, AMD did great jump in performance and efficiency closing to Nvidia more than anyone anticipated since Nvidia stood still. Arrogance almost always bites you back.

NVidia doesn’t really have to care in this instance. All AMD did is reach near-parity performance-wise and still trails behind in terms of ecosystem and software support. This basically won’t affect the market in a meaningful way, and yes, noting that NV focuses their efforts on AI and datacenter is… obvious to anyone sane? Same as AMD developing Zen 5 for the needs of enterprise. That’s just business. RDNA 4 is the last “gaming oriented” GPU architecture, probably ever, even AMD saw folly in trying that.

I am actually somewhat curious now if the measurements in this article are another “Zen 5%” sort of deal and Blackwell is actually significantly faster for the tasks it was designed for. It’s obviously almost impossible to verify since that would require getting one hands on enterprise level accelerators, but still, would be interesting.

hsew · Jun 23, 2025

evernessince said:
There is a surprising lack of decent data for this. Ironically a ton of AI generated articles using Nvidia marketing material but not a single one with something of value.

Actually...

Nvidia GeForce RTX 5060 Ti 16GB review: More VRAM and a price 'paper cut' could make for a compelling GPU

Retail availability and pricing will be critical factors.

www.tomshardware.com

Not a large leap in a (limited, admittedly) suite of AI/ML/Pro workloads over RTX40 either.

Will this improve with drivers or support? Remains to be seen.

Onasi · Jun 23, 2025

hsew said:
Not a large leap in a (limited, admittedly) suite of AI/ML/Pro workloads over RTX40 either.

Wai, wha? In AI and ML workloads the 5060Ti is, by those very tests, 20-40% faster than the very nearly same core count 4060Ti. That’s a very significant leap. Blender and others are understandably similar, those aren’t AI workloads. So it just points again to the fact that Blackwell was VERY AI/ML optimized.

Tomorrow · Jun 23, 2025

user-009 said:
+20% means to be on pair with the 5080 2 years later

That's the 64CU version. But who assumes that AMD will not introduce 96CU or even 128CU versions with UDNA?

LittleBro said:
Yep, created topic about RDNA3 vs. RDNA4 in terms of IPC here on TPU forum.
My calculations for Ada vs. Blackwell were accurate, after all.
All Blackwell performance improvements are based on increment of compute units (die scaling) which translates into increased power draw.
So much for those who believed that RTX 5080 with 2/3s the amount of RTX 4090 compute units would beat RTX 4090.

View attachment 404965

RX 9070 XT compared to RX 7900 XT shows massive (+44%) IPC improvement per compute unit thanks to 20% higher clocks and other part is on architectural changes. We can rule out memory bandwidth being in favor of RX 9070 XT here, as it has significantly lower memory throughput than RX 7900 XT (644 vs. 800 GB/s). RX 9070 XT with RX 7900 XT memory bandwidth would be even faster.

With OC my 9070 XT has 730GB/s (2835 real, 2850 on the slider). Not quite 7900 XT level, but better that the default 644GB/s.
Also i said from the start that 5080 will never reach 4090 performance. Some people here genuinely believed that.

Lew Zealand said:
I wanna know what that damn bug was!

Edit: My handwavy guess is the doubled throughput FP cores which were added in RDNA 3 didn't work properly which is why the 7600 was only marginally faster than the 6650 XT with the old cores. And that got fixed.

I think it's pretty clear what the bug was - RDNA3 was the first (and thus far the only only) chiplet based gaming dGPU. Naturally such innovations have growing pains. It never quite reached it's true potential. My guess is due to the chiplet communication issues. Only the 7600 series in that series was fully monolithic.

Visible Noise said:
Kepler said he was just making stuff up. It’s hilarious to see Twitter posts being taken as some kind of evidence.

"making stuff up" and "guessing" are not the same. The first one implies malice or lack of knowledge on the subject. The second could be considered an educated guess.

Sammoonryong said:
And no 32bit physx …

Also no more hot spot sensor.

Zendou · Jun 23, 2025

Tomorrow said:
Also i said from the start that 5080 will never reach 4090 performance. Some people here genuinely believed that.

But maybe the 5080 Ti Super will

hsew · Jun 23, 2025

Onasi said:
Wai, wha? In AI and ML workloads the 5060Ti is, by those very tests, 20-40% faster than the very nearly same core count 4060Ti. That’s a very significant leap. Blender and others are understandably similar, those aren’t AI workloads. So it just points again to the fact that Blackwell was VERY AI/ML optimized.

At first glance, yes, BUT, consider that the 5060Ti is already faster in gaming compared to the 4060Ti, and it draws more power too. You don't even have to normalize it to IPC (like this article), or power consumption; you already see a far smaller boost, around 10-20% just normalizing it to the average gaming performance boost vs 4060Ti. But again, maybe there are other performance figures that paint a different picture.

Point is, given how much hype nVidia put into AI while clearly not caring as much about gaming, this is underwhelming.

lukart · Jun 23, 2025

It's pretty good to see AMD stepping up when most of us thought they were going to quit. Really curious to see RDNA5 and what they do to leaverage Xbox partnership.

Tomorrow · Jun 23, 2025

Zendou said:
But maybe the 5080 Ti Super will

Doubtful. 5080 Super as it's currently speculated will equal the memory capacity and speed of 4090 (24GB, ~1TB/s), but still be a a far cry from 4090 core config.
In order to truly equal 4090 the 5080 Super/Ti would have to be based on the RTX Pro 5000 based GB202 at the very minimum (with 24GB, naturally) and i suspect even that would fall short without a significant clock speed bump. https://www.techpowerup.com/gpu-specs/rtx-pro-5000-blackwell.c4276

Tomorrow · Jun 23, 2025

Visible Noise said:
Go ahead and get hyped for the next two years. I’ve watched AMD fans get aboard the hype train for decades, only to be disappointed when it arrives at the station.

Are Intel and Nvidia fans any different?

Arrow Lake was supposed to be amazing. Much better power efficiency and equaling or surpassing Zen 5. What they got was often worse than 14th gen in many areas.

Or Nvidia. Two years to release what we know know is just a Lovelace refresh. Lovelace itself was not well received despite lofty expectation of moving back to TSMC on a much better node.

Visible Noise · Jun 23, 2025

Tomorrow said:
Are Intel and Nvidia fans any different?

Yes.

Tomorrow · Jun 23, 2025

Visible Noise said:
Yes.

Oh, of course they are. Of course. :laugh:

evernessince · Jun 23, 2025

hsew said:
Actually...

Nvidia GeForce RTX 5060 Ti 16GB review: More VRAM and a price 'paper cut' could make for a compelling GPU

Retail availability and pricing will be critical factors.

www.tomshardware.com

Not a large leap in a (limited, admittedly) suite of AI/ML/Pro workloads over RTX40 either.

Will this improve with drivers or support? Remains to be seen.

Yep and I'm willing to bet most of that is the new memory.

Onasi said:
Wai, wha? In AI and ML workloads the 5060Ti is, by those very tests, 20-40% faster than the very nearly same core count 4060Ti. That’s a very significant leap. Blender and others are understandably similar, those aren’t AI workloads. So it just points again to the fact that Blackwell was VERY AI/ML optimized.

Tom's states 15-20% on average. Most of that is going to be the higher TDP and faster memory. At the end of the day IPC improvement to the AI cores are very small if any.

Onasi · Jun 24, 2025

evernessince said:
Tom's states 15-20% on average.

They do not state that. There are three AI/ML tests and that’s applicable only to Procyon. It’s for 40 for MLPerf and around 30 for SPEC.
And this is the consumer cards, which weren’t what I was wondering about initially. I would be much more interested to see how Blackwell accelerators perform compared to Ada, though that would be almost impossible to test apples to apples.

sLowEnd · Jun 24, 2025

Tomorrow said:
"making stuff up" and "guessing" are not the same. The first one implies malice or lack of knowledge on the subject. The second could be considered an educated guess.

Sounds like copium to me

An educated guess is typically labeled as such. A nondescript guess is as good as making stuff up

Tomorrow · Jun 24, 2025

sLowEnd said:
Sounds like copium to me

An educated guess is typically labeled as such. A nondescript guess is as good as making stuff up

Kepler_L2 has a pretty good track record. I would not dismiss anything he says just because it's a guess.
Besides his "guess" is nothing outrageous. It merely implies that AMD will repeat the same IPC uplift they already did with RDNA4, now with UDNA.

evernessince · Jun 24, 2025

Onasi said:
They do not state that. There are three AI/ML tests and that’s applicable only to Procyon. It’s for 40 for MLPerf and around 30 for SPEC.

Yes, they do:

As stated in the article, MLPerf performance is heavily tied to VRAM size.

SPECWorkstation was not 30%, it was 25%:

My earlier comment stands, most of that uplift is likely the memory and not IPC.

Onasi said:
And this is the consumer cards, which weren’t what I was wondering about initially. I would be much more interested to see how Blackwell accelerators perform compared to Ada, though that would be almost impossible to test apples to apples.

Whether it's consumer or enterprise is irrelevant when they use the same architecture. IPC will still be the same, the only difference is that enterprise cards will be larger with better memory.

System Name	Tiny the White Yeti
Processor	7800X3D
Motherboard	MSI MAG Mortar b650m wifi
Cooling	CPU: Thermalright Peerless Assassin / Case: Phanteks T30-120 x3
Memory	32GB Corsair Vengeance 30CL6000
Video Card(s)	ASRock RX7900XT Phantom Gaming
Storage	Lexar NM790 4TB + Samsung 850 EVO 1TB + Samsung 980 1TB + Crucial BX100 250GB
Display(s)	Gigabyte G34QWC (3440x1440)
Case	Lian Li A3 mATX White
Audio Device(s)	Harman Kardon AVR137 + 2.1
Power Supply	EVGA Supernova G2 750W
Mouse	Steelseries Aerox 5
Keyboard	Lenovo Thinkpad Trackpoint II
VR HMD	HD 420 - Green Edition ;)
Software	W11 IoT Enterprise LTSC
Benchmark Scores	Over 9000

System Name	Gamey #2 / Office rescue
Processor	Ryzen 7 5800X3D / Core i5-7600
Motherboard	Asrock B450M P4 / Dell Q270
Cooling	IDCool SE-226-XT / Dell 65W
Memory	32GB 3200 CL16 / 32GB 2400 CL17
Video Card(s)	PNY 5070 / Pulse 6400
Storage	4TB Team MP34 / 1TB WD SN550
Display(s)	LG 32GK650F 1440p 144Hz VA
Case	Corsair 4000Air / Opti 7050 SFF
Power Supply	EVGA 650 G3 / Dell 240W

System Name	XPS, Lenovo and HP Laptops, HP Xeon Mobile Workstation, HP Servers, Dell Desktops
Processor	Everything from Turion to 13900kf
Motherboard	MSI - they own the OEM market
Cooling	Air on laptops, lots of air on servers, AIO on desktops
Memory	I think one of the laptops is 2GB, to 64GB on gamer, to 128GB on ZFS Filer
Video Card(s)	A pile up to my knee, with a RTX 4090 teetering on top
Storage	Rust in the closet, solid state everywhere else
Display(s)	Laptop crap, LG UltraGear of various vintages
Case	OEM and a 42U rack
Audio Device(s)	Headphones
Power Supply	Whole home UPS w/Generac Standby Generator
Software	ZFS, UniFi Network Application, Entra, AWS IoT Core, Splunk
Benchmark Scores	1.21 GigaBungholioMarks

Processor	AMD Ryzen 5 5600@80W
Motherboard	MSI B550 Tomahawk
Cooling	ZALMAN CNPS9X OPTIMA
Memory	2*8GB PATRIOT PVS416G400C9K@3733MT_C16
Video Card(s)	Sapphire Radeon RX 6750 XT Pulse 12GB
Storage	Sandisk SSD 128GB, Kingston A2000 NVMe 1TB, Samsung F1 1TB, WD Black 10TB
Display(s)	AOC 27G2U/BK IPS 144Hz
Case	SHARKOON M25-W 7.1 BLACK
Audio Device(s)	Realtek 7.1 onboard
Power Supply	Seasonic Core GC 500W
Mouse	Sharkoon SHARK Force Black
Keyboard	Trust GXT280
Software	Win 7 Ultimate 64bit/Win 10 pro 64bit/Manjaro Linux

System Name	The_Mule
Processor	i9 9960x
Motherboard	ASUS ROG STIRX X299-E Gaming II
Memory	64GB Quad
Video Card(s)	RTX 3080 12GB
Storage	2TB NVME + 6TB for stuff
Display(s)	DELL S2716FG + DELL 1908WFP
Case	FRACTAL DESIGN R6
Audio Device(s)	Asus Cine5 + Boom 3D + built-in SupremeFX S1220A
Power Supply	EVGA SuperNova 1000W 80+ Gold
Software	Win 11 Pro

Processor	9950x \| 5950x
Motherboard	x670e ProArt\| B550 ProArt
Cooling	PA 120 SE \|Fuma 2
Memory	4x64GB Kingston CUDIMM @5200MHz \| 4x32GB 3200MHz Corsair LPX
Video Card(s)	2x RTX 3090
Display(s)	LG 42" C2 4k OLED
Power Supply	Corsair RM1000e \| XPG Core Reactor 850W
Software	I use Arch btw

System Name	The Workhorse
Processor	AMD Ryzen R9 5900X
Motherboard	Gigabyte Aorus B550 Pro
Cooling	CPU - Noctua NH-D15S Case - 3 Noctua NF-A14 PWM at the bottom, 2 Fractal Design 180mm at the front
Memory	GSkill Trident Z 3200CL14
Video Card(s)	NVidia GTX 1070 MSI QuickSilver
Storage	Adata SX8200Pro 1 TB
Display(s)	LG 32GK850G
Case	Fractal Design Torrent (Solid)
Audio Device(s)	Sennheiser HD598, FiiO E-10K DAC/AMP, Samson Meteorite USB Microphone
Power Supply	Corsair RMx850 (2018)
Mouse	Zaopin Z1 Pro on a X-Raypad Heavy Bee Redtail
Keyboard	Cooler Master QuickFire Rapid TKL (Cherry MX Black)
Software	Windows 11 Pro (24H2)

System Name	DarkStar
Processor	AMD Ryzen 7 5800X3D
Motherboard	Gigabyte X570 Aorus Master 1.0 (BIOS F39g)
Cooling	Arctic Liquid Freezer II 420mm AIO (rev4)
Memory	4x8GB Patriot Viper DDR4 4400C19 @ 3733Mhz 14-14-13-27 1T
Video Card(s)	Gigabyte Radeon RX 9070 XT Gaming OC 16GB GDDR6 @ 3400Mhz Core/22Gbps Mem
Storage	1TB Samsung 990 Pro (OS);2TB Samsung PM9A1;4TB XPG S70 Blade (Games);14TB WD UltraStar HC530 (Video)
Display(s)	27" LG UltraGear 27GS85Q-B @ 2560x1440 @ 200Hz, Nano-IPS
Case	be quiet! Dark Base Pro 900 Rev.2
Audio Device(s)	SteelSeries Arctis Nova Pro Wireless
Power Supply	1000W Seasonic PRIME Ultra Titanium;600W APC SMT750i UPS
Mouse	Logitech G604
Keyboard	Logitech G910 Orion Spark
Software	Windows 11 Pro x64 24H2 (Build 26100.4351)

Processor	Ryzen 9800X3D
Motherboard	ASRock X670E Taichi
Cooling	Noctua NH-D15 Chromax
Memory	64GB DDR5 6000 CL26
Video Card(s)	MSI RTX 4090 Trio
Storage	P5800X 1.6TB 4x 15.36TB Micron 9300 Pro 4x WD Black 8TB M.2
Display(s)	Acer Predator XB3 27" 240 Hz
Case	Thermaltake Core X9
Audio Device(s)	JDS Element IV, DCA Aeon II
Power Supply	Seasonic Prime Titanium 850w
Mouse	PMM P-305
Keyboard	Wooting HE60
VR HMD	Valve Index
Software	Win 10