AMD "Zen 3" 3D Vertical Cache Detailed Some More

btarunr · Aug 6, 2021

Senior Technology Fellow Yuzo Fukuzaki shed light on the elusive new CPU technology AMD unveiled at its Computex 2021 keynote, 3D Vertical Cache (3DV Cache). The company had then detailed it as an additional 64 MB last-level cache stacked on top of a CCD (CPU core complex die), which significantly improves performance, including a claimed 15% average gain in gaming performance, which accounts for a generational performance gain over "Zen 3." The prototype AMD unveiled in its keynote was based on a Socket AM4 processor with "Zen 3" CCDs that have the 3DV Cache components in place. With two such CCDs, a 16-core processor would end up with 192 MB of L3 cache.

Yuzo Fukuzaki's theory sheds light on the most plausible position of 3DV Cache in the processor's cache hierarchy. Apparently, it expands the CCD's L3 cache, and doesn't serve as an "L4" victim cache to the L3. This way, the cache setup remains transparent to the OS, which sees it as a contiguous 96 MB block of L3 cache (per CCD). The 3DV Cache die is an SRAM chip fabricated on the same 7 nm process as the "Zen 3" CCD. It measures 6 mm x 6 mm (36 mm²), and is located above the region of the CCD that typically has the 32 MB L3 SRAM. Fukuzaki estimates that roughly 23,000 TSVs (through-silicon vias), each about 17 µm in size, connect the 3DV Cache die to the main CCD.

View at TechPowerUp Main Site

yeeeeman · Aug 6, 2021

Nice stuff

Vya Domus · Aug 6, 2021

I am still skeptical of how well this can be cooled, in areas where there is an overlap with the L1 caches for example there is going to be a massive concertation of heat.

john_ · Aug 6, 2021

I wonder if AMD will bring those CPUs to AM4 and especially to older motherboards, like X470 and B450.

On the other hand, next gen APUs, with Zen 4 cores, RDNA1/2 iGPUs, infinity cache and 3D cache, could make today's APUs look like old and very slow trash. Like comparing an AM4 APU with an FM2+ APU.

efikkan · Aug 6, 2021

article said:
Apparently, it expands the CCD's L3 cache, and doesn't serve as an "L4" victim cache to the L3. This way, the cache setup remains transparent to the OS, which sees it as a contiguous 96 MB block of L3 cache (per CCD).

Actually, the software doesn't "see" the cache at all, it's completely transparent. What software sees is memory, and thanks to caching this memory is sometimes faster.

Vya Domus said:
I am still skeptical of how well this can be cooled, in areas where there is an overlap with the L1 caches for example there is going to be a massive concertation of heat.

L3 cache doesn't add much heat, and this extra layer will probably act as a heat spreader and will be good enough, unless you're thinking about extreme overclocking.

I'm much more concerned about cost and availability, the latter of which has been a huge limitation for AMD lately.

persondb · Aug 6, 2021

I wonder how AMD will position Zen 4 against this as perfomance seems like will be pretty close, with quite a few workloads possibly being better due to more cache. Maybe this means that the top Zen 4 SKUs will have 3D cache as well?

With that said, I think that pricing is going to be really bad for those Zen3D parts and only for 5900X and 5950X, any lower and it kinda stops making sense.

One random thought that I had was what if AMD plan is to remove L3 cache altogether from the CCD and only have it from the cache die. That could be good for yields as reduces the CCD die considerably(a cache die would also likely be defect resistant since SRAM is easily redundant and thus possibly cheap), though It depends on how costly the 3D packaging is and others factors that I am probably unaware of that could make this unviable. For temperatures, maybe use some of those new 3D packaging tech that allows you to put logic on top. Could also use this as a bridge between CCDs like in the RDNA3 rumours?

That's just a random thought of mine though and probably completely wrong, so just ignore it.

TumbleGeorge · Aug 6, 2021

Hmm, this cache isn't 3D and isn't vertical. This is just another but fat layer.

Tomorrow · Aug 6, 2021

Vya Domus said:
I am still skeptical of how well this can be cooled, in areas where there is an overlap with the L1 caches for example there is going to be a massive concertation of heat.

Vcache sits on top of existing one. Cache is not's what causing the heat. The chiplet part that generates heat will be covered by epoxy resin most likely to have it level with vcache.

Richards · Aug 6, 2021

I doubt this will help in normal work loads vs alder lake

Steevo · Aug 6, 2021

TumbleGeorge said:
Hmm, this cache isn't 3D and isn't vertical. This is just another but fat layer.

STVs and it is vertical, directly above the existing chip.

Did you expect a skyscraper?

I mean at 7nm it probably scales like a skyscraper.

Cache is the new performance booster, Infonity cache, 3DV cache, the M1.

Chrispy_ · Aug 6, 2021

TumbleGeorge said:
Hmm, this cache isn't 3D and isn't vertical. This is just another but fat layer.

Looks pretty vertical to me:

Mysteoa · Aug 6, 2021

TumbleGeorge said:
Hmm, this cache isn't 3D and isn't vertical. This is just another but fat layer.

A Cube isn't 3D, it's just a fat square layer.

Makaveli · Aug 6, 2021

Richards said:
I doubt this will help in normal work loads vs alder lake

lol why am I not surprised you are the only person in this thread to come in here talking about intel?

What are you worried about?

Asni · Aug 6, 2021

2TB/s. Bandwidth wise, latency is ten time higher, this is L1 ballpark.

TumbleGeorge · Aug 6, 2021

Steevo said:
STVs and it is vertical, directly above the existing chip.

Did you expect a skyscraper?

I mean at 7nm it probably scales like a skyscraper.

Cache is the new performance booster, Infonity cache, 3DV cache, the M1.

Vertical=90° angle in relation to horizontal, yes!?

Punkenjoy · Aug 6, 2021

Steevo said:
STVs and it is vertical, directly above the existing chip.

Did you expect a skyscraper?

I mean at 7nm it probably scales like a skyscraper.

Cache is the new performance booster, Infonity cache, 3DV cache, the M1.

That guy comment was pretty rude, but it's not full 3D but more 2.5D

3D chip is not there yet and not around the corner (if it ever come). Right now it's each die are in 2D. they can stack those and have TSV, but it remain that each layer is 2D. 3D will allow very complex transistor layout. It would help with propagation as you would be able to make denser logic with less space. But I am still not even sure we can make those in labs...

Anyway, that do not mean that this 3D cache isn't impressive. It's actually very impressive and only the beginning of die stacking. The next decade will be very interesting, trust me on that. This is only the beginning.

efikkan · Aug 6, 2021

Richards said:
I doubt this will help in normal work loads vs alder lake

There should be no doubt that this will help performance, the question is by how much, and in which workloads.

L3 is a spillover cache for L2. Boosting L3 cache will help reduce cache misses, but probably mostly cache lines with instructions, as discarded data cache lines are less likely to be reused quickly, which means the gains will be very workload dependent. But regardless this is a very expensive way to add a little performance. This makes me wonder if this will be reserved for a small selection of premium models.

mechtech · Aug 6, 2021

I wonder if they are trying to do with the CPU with what they did to the GPU with the large L3 "infinity" cache??

TumbleGeorge · Aug 6, 2021

mechtech said:
I wonder if they are trying to do with the CPU with what they did to the GPU with the large L3 "infinity" cache??

Ok +$100 premium price above previous +$50! Do you ready to pay?

R0H1T · Aug 6, 2021

Richards said:
I doubt this will help in normal work loads vs alder lake

What workloads? Do we have alder lake benchmarks already?

AnarchoPrimitiv · Aug 6, 2021

With reports already claiming that Zen4 IPC increase is in the 25%+ area (including an article by this very website that stated 29%) it makes me wonder in the top models of zen4 perhaps 6900x and 6950x, but maybe even the 6800x might also come with this V-cache as well and be something really impressive, like 35%+ core for core performance increase over Zen3?

R-T-B · Aug 7, 2021

Vya Domus said:
I am still skeptical of how well this can be cooled, in areas where there is an overlap with the L1 caches for example there is going to be a massive concertation of heat.

It will certainly run hotter. The question is whether the IPC gains will make up for the increased heat generated.

Haile Selassie · Aug 7, 2021

R-T-B said:
It will certainly run hotter. The question is whether the IPC gains will make up for the increased heat generated.

95W part CCX L2 cache is about 8.1W in total, according to AMD power map. No, not insignificant, but the surface is also very big, it's about half the size of each CCX. Each core has about the same power draw, but is 30x smaller in size.

Cooling is not an issue.

Source: AMD NDA docs.

TumbleGeorge · Aug 7, 2021

Haile Selassie said:
according to AMD power map

Where is this "map". Link, please!

Asni · Aug 7, 2021

Haile Selassie said:
95W part CCX L2 cache is about 8.1W in total, according to AMD power map. No, not insignificant, but the surface is also very big, it's about half the size of each CCX. Each core has about the same power draw, but is 30x smaller in size.

Cooling is not an issue.

Source: AMD NDA docs.

They're not talking about increased power, the problem could be the greater thickness. You're adding a thick layer of silicon over the main core die: this layer could reduce heat transfer to the IHS, then to the cooler.

But Amd said that:

https://twitter.com/x/status/1399770424658087941

System Name	RBMK-1000
Processor	AMD Ryzen 7 5700G
Motherboard	Gigabyte B550 AORUS Elite V2
Cooling	DeepCool Gammax L240 V2
Memory	2x 16GB DDR4-3200
Video Card(s)	Galax RTX 4070 Ti EX
Storage	Samsung 990 1TB
Display(s)	BenQ 1440p 60 Hz 27-inch
Case	Corsair Carbide 100R
Audio Device(s)	ASUS SupremeFX S1220A
Power Supply	Cooler Master MWE Gold 650W
Mouse	ASUS ROG Strix Impact
Keyboard	Gamdias Hermes E2
Software	Windows 11 Pro

System Name	Good enough
Processor	AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard	ASRock B650 Pro RS
Cooling	2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory	32GB - FURY Beast RGB 5600 Mhz
Video Card(s)	Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage	1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s)	LG UltraGear 32GN650-B + 4K Samsung TV
Case	Phanteks NV7
Power Supply	GPS-750C

System Name	3 desktop systems: Gaming / Internet / HTPC
Processor	Ryzen 5 7600 / Ryzen 5 4600G / Ryzen 5 5500
Motherboard	X670E Gaming Plus WiFi / MSI X470 Gaming Plus Max (1) / MSI X470 Gaming Plus Max (2)
Cooling	Snowman / Segotep T4 / Νoctua U12S
Memory	Kingston FURY Beast 32GB DDR5 6000 / 16GB JUHOR / 32GB G.Skill RIPJAWS 3600 + Aegis 3200
Video Card(s)	ASRock RX 6600+GTX 1660 / Vega 7 integrated / Radeon RX 580+GTX 1050
Storage	NVMes, ONLY NVMes / NVMes, SATA Storage / NVMe, SATA, external storage
Display(s)	Philips 43PUS8857/12 UHD TV (120Hz, HDR, FreeSync Premium) / 19'' HP monitor + BlitzWolf BW-V5
Case	Sharkoon Rebel 12 / CoolerMaster Elite 361 / Xigmatek Midguard
Audio Device(s)	onboard
Power Supply	Chieftec 850W / Silver Power 400W / Sharkoon 650W
Mouse	CoolerMaster Devastator III Plus / CoolerMaster Devastator / Logitech
Keyboard	CoolerMaster Devastator III Plus / CoolerMaster Devastator / Logitech
Software	Windows 10 / Windows 10&Windows 11 / Windows 10

Processor	AMD Ryzen 9 5900X \|\|\| Intel Core i7-3930K
Motherboard	ASUS ProArt B550-CREATOR \|\|\| Asus P9X79 WS
Cooling	Noctua NH-U14S \|\|\| Be Quiet Pure Rock
Memory	Crucial 2 x 16 GB 3200 MHz \|\|\| Corsair 8 x 8 GB 1333 MHz
Video Card(s)	MSI GTX 1060 3GB \|\|\| MSI GTX 680 4GB
Storage	Samsung 970 PRO 512 GB + 1 TB \|\|\| Intel 545s 512 GB + 256 GB
Display(s)	Asus ROG Swift PG278QR 27" \|\|\| Eizo EV2416W 24"
Case	Fractal Design Define 7 XL x 2
Audio Device(s)	Cambridge Audio DacMagic Plus
Power Supply	Seasonic Focus PX-850 x 2
Mouse	Razer Abyssus
Keyboard	CM Storm QuickFire XT
Software	Ubuntu

System Name	DarkStar
Processor	AMD Ryzen 7 5800X3D
Motherboard	Gigabyte X570 Aorus Master 1.0 (BIOS F39g)
Cooling	Arctic Liquid Freezer II 420mm AIO (rev4)
Memory	4x8GB Patriot Viper DDR4 4400C19 @ 3733Mhz 14-14-13-27 1T
Video Card(s)	Gigabyte Radeon RX 9070 XT Gaming OC 16GB GDDR6 @ 3400Mhz Core/22Gbps Mem
Storage	1TB Samsung 990 Pro (OS);2TB Samsung PM9A1;4TB XPG S70 Blade (Games);14TB WD UltraStar HC530 (Video)
Display(s)	27" LG UltraGear 27GS85Q-B @ 2560x1440 @ 200Hz, Nano-IPS
Case	be quiet! Dark Base Pro 900 Rev.2
Audio Device(s)	SteelSeries Arctis Nova Pro Wireless
Power Supply	1000W Seasonic PRIME Ultra Titanium;600W APC SMT750i UPS
Mouse	Logitech G604
Keyboard	Logitech G910 Orion Spark
Software	Windows 11 Pro x64 24H2 (Build 26100.4351)

System Name	Compy 386
Processor	7800X3D
Motherboard	Asus
Cooling	Air for now.....
Memory	64 GB DDR5 6400Mhz
Video Card(s)	7900XTX 310 Merc
Storage	Samsung 990 2TB, 2 SP 2TB SSDs, 24TB Enterprise drives
Display(s)	55" Samsung 4K HDR
Audio Device(s)	ATI HDMI
Mouse	Logitech MX518
Keyboard	Razer
Software	A lot.
Benchmark Scores	Its fast. Enough.

System Name	Bragging Rights
Processor	Atom Z3735F 1.33GHz
Motherboard	It has no markings but it's green
Cooling	No, it's a 2.2W processor
Memory	2GB DDR3L-1333
Video Card(s)	Gen7 Intel HD (4EU @ 311MHz)
Storage	32GB eMMC and 128GB Sandisk Extreme U3
Display(s)	10" IPS 1280x800 60Hz
Case	Veddha T2
Audio Device(s)	Apparently, yes
Power Supply	Samsung 18W 5V fast-charger
Mouse	MX Anywhere 2
Keyboard	Logitech MX Keys (not Cherry MX at all)
VR HMD	Samsung Oddyssey, not that I'd plug it into this though....
Software	W10 21H1, barely
Benchmark Scores	I once clocked a Celeron-300A to 564MHz on an Abit BE6 and it scored over 9000.

System Name	The Expanse
Processor	AMD Ryzen 7 9800X3D
Motherboard	Asus Prime X670E-Pro Wifi BIOS 3222 AGESA PI 1.2.0.3a
Cooling	Corsair H150i Elite LCD XT
Memory	64GB G.SKILL Trident Z5 Neo RGB DDR5 6000 CL 30-40-40-96 1T
Video Card(s)	XFX Radeon RX 7900 XTX Magnetic Air (25.6.1)
Storage	WD SN850X 2TB / Corsair MP600 1TB / Samsung 860Evo 1TB x2 Raid 0 / Asus NAS AS1004T V2 20TB
Display(s)	LG 34GP83A-B 34 Inch 21: 9 UltraGear Curved QHD (3440 x 1440) 1ms Nano IPS 160Hz
Case	Fractal Design Meshify S2
Audio Device(s)	Creative X-Fi + Logitech Z-5500 + HS80 Wireless
Power Supply	Corsair AX850 Titanium
Mouse	Corsair Dark Core RGB SE
Keyboard	Corsair K100
Software	Windows 10 Pro x64 22H2
Benchmark Scores	https://valid.x86.fr/asijsu https://browser.geekbench.com/v6/cpu/11073923

Processor	Ryzen 5700x
Motherboard	Gigabyte X570S Aero G R1.1 Bios F7g
Cooling	Noctua NH-C12P SE14 w/ NF-A15 HS-PWM Fan 1500rpm
Memory	Micron DDR4-3200 2x32GB D.S. D.R. (CT2K32G4DFD832A)
Video Card(s)	AMD RX 6800 - Asus Tuf
Storage	Kingston KC3000 1TB & 2TB & 4TB Corsair MP600 Pro LPX
Display(s)	LG 27UL550-W (27" 4k)
Case	Be Quiet Pure Base 600 (no window)
Audio Device(s)	Realtek ALC1220-VB
Power Supply	SuperFlower Leadex V Gold Pro 850W ATX Ver2.52
Mouse	Mionix Naos Pro
Keyboard	Corsair Strafe with browns
Software	W10 22H2 Pro x64

System Name	Lightbringer
Processor	Ryzen 7 2700X
Motherboard	Asus ROG Strix X470-F Gaming
Cooling	Enermax Liqmax Iii 360mm AIO
Memory	G.Skill Trident Z RGB 32GB (8GBx4) 3200Mhz CL 14
Video Card(s)	Sapphire RX 5700XT Nitro+
Storage	Hp EX950 2TB NVMe M.2, HP EX950 1TB NVMe M.2, Samsung 860 EVO 2TB
Display(s)	LG 34BK95U-W 34" 5120 x 2160
Case	Lian Li PC-O11 Dynamic (White)
Power Supply	BeQuiet Straight Power 11 850w Gold Rated PSU
Mouse	Glorious Model O (Matte White)
Keyboard	Royal Kludge RK71
Software	Windows 10

System Name	Pioneer
Processor	Ryzen 9 9950X
Motherboard	MSI MAG X670E Tomahawk Wifi
Cooling	Noctua NH-D15 + A whole lotta Sunon, Phanteks and Corsair Maglev blower fans...
Memory	64GB (2x 32GB) G.Skill Flare X5 @ DDR5-6200(Running 1T no GDM)
Video Card(s)	PNY RTX 5080 OC
Storage	Intel 5800X Optane 800GB boot, +2x Crucial P5 Plus 2TB PCIe 4.0 NVMe SSDs, 1x 2TB Seagate Exos 3.5"
Display(s)	55" Hisense 55U8N 4K FALD Display
Case	Thermaltake Core X31
Audio Device(s)	TOSLINK->Schiit Modi MB->Asgard 2 DAC Amp->AKG Pro K712 Headphones or HDMI->B9 OLED
Power Supply	FSP Hydro Ti Pro 850W 80Plus Titanium PSU
Mouse	Logitech G305 Lightspeed Wireless
Keyboard	WASD Code v3 with Cherry Green keyswitches + PBT DS keycaps
Software	Gentoo Linux x64 / Windows 11 Enterprise (yes it's legit)

AMD "Zen 3" 3D Vertical Cache Detailed Some More

Editor & Senior Moderator