• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Shares New Details on Their 3D V-Cache Tech for Zen 3+

Joined
Jun 10, 2014
Messages
2,889 (0.81/day)
Processor AMD Ryzen 9 5900X ||| Intel Core i7-3930K
Motherboard ASUS ProArt B550-CREATOR ||| Asus P9X79 WS
Cooling Noctua NH-U14S ||| Be Quiet Pure Rock
Memory Crucial 2 x 16 GB 3200 MHz ||| Corsair 8 x 8 GB 1333 MHz
Video Card(s) MSI GTX 1060 3GB ||| MSI GTX 680 4GB
Storage Samsung 970 PRO 512 GB + 1 TB ||| Intel 545s 512 GB + 256 GB
Display(s) Asus ROG Swift PG278QR 27" ||| Eizo EV2416W 24"
Case Fractal Design Define 7 XL x 2
Audio Device(s) Cambridge Audio DacMagic Plus
Power Supply Seasonic Focus PX-850 x 2
Mouse Razer Abyssus
Keyboard CM Storm QuickFire XT
Software Ubuntu
Something's bothering me: why did AMD use the prototype @ a fixed 4 GHz VS a regular 5900X also @ a fixed 4GHz, instead of the prototype @ "it's normal speed" VS the 5900X @ it's normal speed?
Yes, and it's a good example of why such estimates are fairly useless until we get proper independent tests.

Additionally, it's important to understand that gaming performance isn't a benchmark of CPU performance, despite the fact the CPU performance being important. The CPU only needs to be fast enough to not bottle neck the GPU, which is why we typically see diminishing returns with higher CPU speeds in gaming, first the FPS gains dry out, beyond that we gain better frame time consistency, until there are no more gains.
With current games, Skylake and Zen 3 already hit this point in most games around ~4.3-4.5 GHz. So even if this new CPU is significantly faster, it might not materialize as increased gaming performance, unless you underclock the CPUs as you mentioned.

Cache also improves IPC in multi threaded applications.
That depends on the instruction scheduler design, don't you think? Single block schedulers can access single ipc better, but multi ipc worse. I'm computer illiterate, sorry if I presented it wrong.
That is not all, hyperthreading and SMT changes instruction scheduler depth, a.k.a. ipc. The computer can do more, but with less single ipc. It looks like a stupid argument, but that is the general case with all excess testing.
Guys, you should refrain from using big words like IPC when you clearly don't understand what it means ;)
IPC means instructions per clock, and is a measure of the average instructions per clock the architecture is capable of. IPC is not performance, but an underlying constant of the architecture which is independent of how many threads or cores which are involved, and is also workload independent. We usually use a single thread and a wide selection of workloads to approximate the relative IPC between different CPUs.
But whenever people mention "multithreaded IPC", "application specific IPC", SMT in relation to IPC, etc. that's a dead giveaway that they are clueless about what IPC really is. They are in fact talking about performance, not IPC.

From everything I've heard whatever was going to be Zen 3+ was cancelled, probably because of logistic issues caused by the pandemic, the lead they already have with Zen 3, and seeing just how well just the integration of this stacked cache turned out.
Do we have any actual evidence of the existence of "Zen 3+"?
Could it be that it was invented by the "leakers"?
 
Joined
Oct 15, 2019
Messages
549 (0.34/day)
Guys, you should refrain from using big words like IPC when you clearly don't understand what it means ;)
IPC means instructions per clock, and is a measure of the average instructions per clock the architecture is capable of. IPC is not performance, but an underlying constant of the architecture which is independent of how many threads or cores which are involved, and is also workload independent. We usually use a single thread and a wide selection of workloads to approximate the relative IPC between different CPUs.
But whenever people mention "multithreaded IPC", "application specific IPC", SMT in relation to IPC, etc. that's a dead giveaway that they are clueless about what IPC really is. They are in fact talking about performance, not IPC.
You are the clueless one. There are only application specific IPC, and theoretical IPC (which you can usually calculate from some datasheet by adding up all execution units in a core design, and completely useless for determining real world system level IPC). Everything else is just an opinion. Saying that IPC is a constant is a dead giveaway that the writer does not understand what the term means.

If IPC is some constant as you state, please give the IPC figure of any processsor currently in production, as a floating point number, and how that was measured.

edit: also, who the f is ”we” in your text?

you seem to be describing what relative single threaded performance at a fixed clock frequency is. That has very little to do with ”IPC”.
 
Last edited:
Joined
Mar 10, 2010
Messages
11,878 (2.31/day)
Location
Manchester uk
System Name RyzenGtEvo/ Asus strix scar II
Processor Amd R5 5900X/ Intel 8750H
Motherboard Crosshair hero8 impact/Asus
Cooling 360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory Corsair Vengeance Rgb pro 3600cas14 16Gb in four sticks./16Gb/16GB
Video Card(s) Powercolour RX7900XT Reference/Rtx 2060
Storage Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s) Samsung UAE28"850R 4k freesync.dell shiter
Case Lianli 011 dynamic/strix scar2
Audio Device(s) Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply corsair 1200Hxi/Asus stock
Mouse Roccat Kova/ Logitech G wireless
Keyboard Roccat Aimo 120
VR HMD Oculus rift
Software Win 10 Pro
Benchmark Scores 8726 vega 3dmark timespy/ laptop Timespy 6506
I must admit, I'm also wondering why only one of the CCDs was "covered" with the new cache, and not both. But the thing that worries me more is that the die is now facing downwards, and most of the excess silicon is removed before the cache is then placed on top of it - my worry is increased thermals, and maybe that's why is was running at a slower speed?

I hope it's not some kind of thermal compromise where the CPU will clock down when under high use.
It was running at the same exact speed as that cpu compared to it, to accurately present the IP gain during a world wide customer presentation, on purpose.
Leaving final speeds and performance relatively wide open on purpose, imagine Intel's engineering spec team now, I bet they're in work this weekend.
There's a cache per CCD AFAIK ?!
As for thermal worries, we've had them year's, they're not going away. ..
 
Joined
Jul 9, 2020
Messages
93 (0.07/day)
Location
RU
It was running at the same exact speed as that cpu compared to it, to accurately present the IP gain during a world wide customer presentation, on purpose.
I must admit, I'm also wondering why only one of the CCDs was "covered" with the new cache, and not both.
When i first time saw picture i got same question. Only after few days i got it - It's not two CPUs (original 5900X vs 5900X with 3D V-Cache), it's SAME CPU. They made prototype with equal CCDs when one CCD of them with 3D V-Cache. That's why it have got same frequency 4GHz. It's just easy to compare on one CPU. You simply may run benchmark on CCD1 and then run same benchmark on CCD2 with 3D V-Cache and see performance gain.
 
Joined
Apr 19, 2018
Messages
957 (0.44/day)
Processor AMD Ryzen 9 5950X
Motherboard Asus ROG Crosshair VIII Hero WiFi
Cooling Arctic Liquid Freezer II 420
Memory 32Gb G-Skill Trident Z Neo @3806MHz C14
Video Card(s) MSI GeForce RTX2070
Storage Seagate FireCuda 530 1TB
Display(s) Samsung G9 49" Curved Ultrawide
Case Cooler Master Cosmos
Audio Device(s) O2 USB Headphone AMP
Power Supply Corsair HX850i
Mouse Logitech G502
Keyboard Cherry MX
Software Windows 11
When i first time saw picture i got same question. Only after few days i got it - It's not two CPUs (original 5900X vs 5900X with 3D V-Cache), it's SAME CPU. They made prototype with equal CCDs when one CCD of them with 3D V-Cache. That's why it have got same frequency 4GHz. It's just easy to compare on one CPU. You simply may run benchmark on CCD1 and then run same benchmark on CCD2 with 3D V-Cache and see performance gain.
Interesting! So it was essentially 2 5800x's?

I would love to get my hands on that and see the impact of this cache. I'm interested in the impact on heat, voltage/current and clocks.
 
Joined
Apr 8, 2008
Messages
328 (0.06/day)
Something's bothering me: why did AMD use the prototype @ a fixed 4 GHz VS a regular 5900X also @ a fixed 4GHz, instead of the prototype @ "it's normal speed" VS the 5900X @ it's normal speed?

Either they don't yet know what "it's normal speed" will end up being, which is actually quite likely, or this prototype will end up having a lower frequency than the 5900X due to it's new "3D nature", meaning that 15% improvement might not actually be as much as it seems to be.

Ofc, that 15% @ 4 GHz is only with the 3D V-cache without anything else, so it's still possible that the new Zen 3+ CPU could still end up being 15%+ faster than the 5900X WHILE having ... say ... 400 to 500 MHz LESS frequency, depending on WHAT ELSE is being changed VS Zen 3.

I guess because different chips behave differently. They wanted to omit any chip lottery even though they control every thing, but still not all silicons are equals. So they just went for a direct clock for clock comparison. And because they can turn on/off the extra V-Cache, they can also test the power different by retesting the same CPU again and again.

There's rumour about Zen3+ being a slightly new design compared to Zen3 with extra IPC improvements, but those rumours are conflicting as hell, some says they cancelled it, some says it hasn't been there to cancel it in the first place, and some says it's just the exact same Zen3 design with the V-Cache, that's why there are many conflicting rumours around which makes all these rumours plausible in the same time.
 

Mussels

Freshwater Moderator
Staff member
Joined
Oct 6, 2004
Messages
58,413 (8.21/day)
Location
Oystralia
System Name Rainbow Sparkles (Power efficient, <350W gaming load)
Processor Ryzen R7 5800x3D (Undervolted, 4.45GHz all core)
Motherboard Asus x570-F (BIOS Modded)
Cooling Alphacool Apex UV - Alphacool Eisblock XPX Aurora + EK Quantum ARGB 3090 w/ active backplate
Memory 2x32GB DDR4 3600 Corsair Vengeance RGB @3866 C18-22-22-22-42 TRFC704 (1.4V Hynix MJR - SoC 1.15V)
Video Card(s) Galax RTX 3090 SG 24GB: Underclocked to 1700Mhz 0.750v (375W down to 250W))
Storage 2TB WD SN850 NVME + 1TB Sasmsung 970 Pro NVME + 1TB Intel 6000P NVME USB 3.2
Display(s) Phillips 32 32M1N5800A (4k144), LG 32" (4K60) | Gigabyte G32QC (2k165) | Phillips 328m6fjrmb (2K144)
Case Fractal Design R6
Audio Device(s) Logitech G560 | Corsair Void pro RGB |Blue Yeti mic
Power Supply Fractal Ion+ 2 860W (Platinum) (This thing is God-tier. Silent and TINY)
Mouse Logitech G Pro wireless + Steelseries Prisma XL
Keyboard Razer Huntsman TE ( Sexy white keycaps)
VR HMD Oculus Rift S + Quest 2
Software Windows 11 pro x64 (Yes, it's genuinely a good OS) OpenRGB - ditch the branded bloatware!
Benchmark Scores Nyooom.
I guess because different chips behave differently. They wanted to omit any chip lottery even though they control every thing, but still not all silicons are equals. So they just went for a direct clock for clock comparison. And because they can turn on/off the extra V-Cache, they can also test the power different by retesting the same CPU again and again.

There's rumour about Zen3+ being a slightly new design compared to Zen3 with extra IPC improvements, but those rumours are conflicting as hell, some says they cancelled it, some says it hasn't been there to cancel it in the first place, and some says it's just the exact same Zen3 design with the V-Cache, that's why there are many conflicting rumours around which makes all these rumours plausible in the same time.
well of course AMD would have tested *all* those options

If they found out zen3 works well for minimal cost but good gains just by adding the V-cache, they'd shuffle the new designs and ideas off for AM5 chips instead
 
Joined
Apr 19, 2018
Messages
957 (0.44/day)
Processor AMD Ryzen 9 5950X
Motherboard Asus ROG Crosshair VIII Hero WiFi
Cooling Arctic Liquid Freezer II 420
Memory 32Gb G-Skill Trident Z Neo @3806MHz C14
Video Card(s) MSI GeForce RTX2070
Storage Seagate FireCuda 530 1TB
Display(s) Samsung G9 49" Curved Ultrawide
Case Cooler Master Cosmos
Audio Device(s) O2 USB Headphone AMP
Power Supply Corsair HX850i
Mouse Logitech G502
Keyboard Cherry MX
Software Windows 11
...If they found out zen3 works well for minimal cost but good gains just by adding the V-cache, they'd shuffle the new designs and ideas off for AM5 chips instead
This also bothers me. This whole Zen3+ thing is confusing. I can finally buy a 5900x/5950x now, but am holding off because I simply cannot believe that AM4 is done. DDR5 is not ready, expensive, and slower than DDR4.

I really think AMD should release one last hurrah for AM4, to give DDR5 a chance to be available, and performant compared to DDR4. A double release towards the end of this year of the AM5 platform and one last CPU for AM4 would be perfect for the market. But who knows...
 

Mussels

Freshwater Moderator
Staff member
Joined
Oct 6, 2004
Messages
58,413 (8.21/day)
Location
Oystralia
System Name Rainbow Sparkles (Power efficient, <350W gaming load)
Processor Ryzen R7 5800x3D (Undervolted, 4.45GHz all core)
Motherboard Asus x570-F (BIOS Modded)
Cooling Alphacool Apex UV - Alphacool Eisblock XPX Aurora + EK Quantum ARGB 3090 w/ active backplate
Memory 2x32GB DDR4 3600 Corsair Vengeance RGB @3866 C18-22-22-22-42 TRFC704 (1.4V Hynix MJR - SoC 1.15V)
Video Card(s) Galax RTX 3090 SG 24GB: Underclocked to 1700Mhz 0.750v (375W down to 250W))
Storage 2TB WD SN850 NVME + 1TB Sasmsung 970 Pro NVME + 1TB Intel 6000P NVME USB 3.2
Display(s) Phillips 32 32M1N5800A (4k144), LG 32" (4K60) | Gigabyte G32QC (2k165) | Phillips 328m6fjrmb (2K144)
Case Fractal Design R6
Audio Device(s) Logitech G560 | Corsair Void pro RGB |Blue Yeti mic
Power Supply Fractal Ion+ 2 860W (Platinum) (This thing is God-tier. Silent and TINY)
Mouse Logitech G Pro wireless + Steelseries Prisma XL
Keyboard Razer Huntsman TE ( Sexy white keycaps)
VR HMD Oculus Rift S + Quest 2
Software Windows 11 pro x64 (Yes, it's genuinely a good OS) OpenRGB - ditch the branded bloatware!
Benchmark Scores Nyooom.
This also bothers me. This whole Zen3+ thing is confusing. I can finally buy a 5900x/5950x now, but am holding off because I simply cannot believe that AM4 is done. DDR5 is not ready, expensive, and slower than DDR4.

I really think AMD should release one last hurrah for AM4, to give DDR5 a chance to be available, and performant compared to DDR4. A double release towards the end of this year of the AM5 platform and one last CPU for AM4 would be perfect for the market. But who knows...
with all the shortages, they have to continue AM4 for a bit

they simply wouldnt be able to get enough stock for sales of AM5 when it needs new CPUs, mobos and RAM, if they abandon AM4

So AM4 should get one last hurrah, a zen3+ to keep using the existing stock and keep sales up while preparing stock for AM5 as a lower quantity, high priced premium option for early adopters... if you know you're gunna sell less products, market them as premium and up the price


We also have those x570S chipset boards coming out, so we know the sockets not abandoned... why release new mobo chipsets and whole new series of boards from multiple vendors, if no new CPUs are coming?
 
Joined
Jun 3, 2010
Messages
2,540 (0.50/day)
This also bothers me. This whole Zen3+ thing is confusing. I can finally buy a 5900x/5950x now, but am holding off because I simply cannot believe that AM4 is done. DDR5 is not ready, expensive, and slower than DDR4.
Personally, I find it like the muscle car enthusiasm. Sure you can wait to see what comes out of the pipeline, but outside of minute distinctions, the tdp tiers are the same.
 
Joined
Jan 3, 2021
Messages
2,593 (2.20/day)
Location
Slovenia
Processor i5-6600K
Motherboard Asus Z170A
Cooling some cheap Cooler Master Hyper 103 or similar
Memory 16GB DDR4-2400
Video Card(s) IGP
Storage Samsung 850 EVO 250GB
Display(s) 2x Oldell 24" 1920x1200
Case Bitfenix Nova white windowless non-mesh
Audio Device(s) E-mu 1212m PCI
Power Supply Seasonic G-360
Mouse Logitech Marble trackball, never had a mouse
Keyboard Key Tronic KT2000, no Win key because 1994
Software Oldwin
with all the shortages, they have to continue AM4 for a bit

they simply wouldnt be able to get enough stock for sales of AM5 when it needs new CPUs, mobos and RAM, if they abandon AM4

So AM4 should get one last hurrah, a zen3+ to keep using the existing stock and keep sales up while preparing stock for AM5 as a lower quantity, high priced premium option for early adopters... if you know you're gunna sell less products, market them as premium and up the price


We also have those x570S chipset boards coming out, so we know the sockets not abandoned... why release new mobo chipsets and whole new series of boards from multiple vendors, if no new CPUs are coming?
I'm thinking along the same lines as you, mostly ... but what's the most probable timeline, from today's point of view?

So many details are floating around regarding AM5, I'd expect it to launch this year, but only for APUs (Rembrandt/Zen3+). Not as a premium option. APU users aren't the most demanding crowd, they can tolerate the initial quirks and non-optimum performance of the whole AM5 + DDR5 platform, including high RAM latencies, but graphics would benefit a lot from the DDR5 bandwidth. 3D V-cache? No, that costs a lot. Gears? Sure, like in everything else on DDR5.

Come to think of that, AM5 may even begin its life in an OEM-only launch.

The last and best round of AM4 chips could arrive at the same time, maybe even after first ones on AM5. 3D V-cache? Yes, and that costs a lot, so the CPUs will cost a lot too. About 20% or 25% more for 15% more performance, says the law of diminishing returns.

When i first time saw picture i got same question. Only after few days i got it - It's not two CPUs (original 5900X vs 5900X with 3D V-Cache), it's SAME CPU. They made prototype with equal CCDs when one CCD of them with 3D V-Cache. That's why it have got same frequency 4GHz. It's just easy to compare on one CPU. You simply may run benchmark on CCD1 and then run same benchmark on CCD2 with 3D V-Cache and see performance gain.
Well possibly - AMD can do all kinds of weird experiments, it's their chips. But don't take it for granted that it's possible to make an operative Ryzen CPU with asymmetrical configuration of cache. The one that Lisa showed could have been made just for her, that is, for showing off.
For testing without much hassle, AMD can build pairs of identical systems, with the only difference being the presence or absence of V-cache on both dies.

One of the most impressive bits of info was that the Zen 3 CCD was designed from the very beginning to be coupled with a stacked piece of cache. Anandtech reported that I think, and it really couldn't be another way, because you can't just put some extra static RAM on top of any random CPU and hope to connect it properly.
 
Last edited:
Joined
Nov 11, 2005
Messages
35 (0.01/day)
well, now try imagine they add 8GB HBM2E or HBM3 ontop of the I/O die for APU

it would sell like pancakes as no GPUs in sight
 
Top