AMD "Strix Point" Zen 5 Monolithic Silicon has a 12-core CPU?

TumbleGeorge · Jul 20, 2023

Count von Schwalbe said:
reason to disable two more working cores and reduce profit margins.

Now a 6-core Zen 4c CCD would make sen

APU's are... monolithic design...

wNotyarD · Jul 20, 2023

Count von Schwalbe said:
Reduced prices as the die area is much much lower. 4-core CPUs don't make any sense as TSMC N5 is pretty mature, so there is not much reason to disable two more working cores and reduce profit margins.

Now a 6-core Zen 4c CCD would make sense, and disabled units for Athlons.

I mean, as (FINALLY) Intel is raising the core count on their i3's it would make sense for AMD to follow suit. Be it with 4C Athlons and 6C R3's (also, while we're at it, why not making R5's go 8C and R7's go 12C, reserve R9 for 16C).

TumbleGeorge said:
APU's are... monolithic design...

We are considering hypothetical Zen 4c designs, so that's where the CCX/CCD lingo came from.

AnotherReader · Jul 20, 2023

Count von Schwalbe said:
Reduced prices as the die area is much much lower. 4-core CPUs don't make any sense as TSMC N5 is pretty mature, so there is not much reason to disable two more working cores and reduce profit margins.

Now a 6-core Zen 4c CCD would make sense, and disabled units for Athlons.

That would be a good replacement for the Zen 2 based SKUs. Despite the anticipated clock speed handicap, they should be faster than those.

Count von Schwalbe · Jul 20, 2023

TumbleGeorge said:
APU's are... monolithic design...

Yep. EPYC is not.

AnotherReader said:
That would be a good replacement for the Zen 2 based SKUs. Despite the anticipated clock speed handicap, they should be faster than those.

Much much faster.

qcmadness · Jul 20, 2023

wNotyarD said:
That's the point. Zen 4c has half the L3 cache of Zen 4, that's what makes it smaller. Clock speeds are then to be defined by binning, core count and power target.

The APUs have half the L3 cache of desktop / server variants.

HD64G · Jul 20, 2023

I guess that a monolithic APU with 12 Zen5c cores will have a bit lower clocks than the Zen5 cores in the normal dual chiplets. But since it will have a relatively powerful iGPU in it, the graphic performance won't be hindered by the cache being halved. Price will be the point to discuss because technologically that APU will be a marvelous product.

Darmok N Jalad · Jul 21, 2023

wNotyarD said:
Actually, for an Athlon it makes a lot of sense. If memory doesn't fail me K10 Phenoms and Athlons were separated similarly.

My first Socket A cpu was a Dothan, which was the Athlon with less L2.

All this got me thinking though. AMD is the only vendor NOT doing asymmetric cores. Mobile devices have it, Intel has it, Apple has it. I think AMD is going to have to do something here, or they might just lose the core race they started. Maybe them saying they have no intention of an E core is really that they will do something like the C core. Curious which one has more cache anyway, an AMD C core or an Intel E core?

AnotherReader · Jul 21, 2023

Darmok N Jalad said:
My first Socket A cpu was a Dothan, which was the Athlon with less L2.

All this got me thinking though. AMD is the only vendor NOT doing asymmetric cores. Mobile devices have it, Intel has it, Apple has it. I think AMD is going to have to do something here, or they might just lose the core race they started. Maybe them saying they have no intention of an E core is really that they will do something like the C core. Curious which one has more cache anyway, an AMD C core or an Intel E core?

An AMD C core and the Raptor Cove E core have the same amount of L2 cache: 1 MB per core. In Raptor Cove's case, this L2 is shared across 4 cores for a total of 4 MB per cluster. Of course, the C core has L3 as well.

Minus Infinity · Jul 21, 2023

Chrispy_ said:
IIRC these aren't likely to be 12 full "P-cores" to use Intel's existing nomenclature.

No one can give reliable information on the config. AMD is testing both 8P + 4E and 12P versions. AMD probably don't know what they'll ship yet this far out from release.

persondb · Jul 21, 2023

If Phoenix is any indication, then AMD really needs to focus on driver support for their APUs as Phoenix has been a shitshow. Been months since products with it have been rolling off and there is no official support, only beta drivers from OEMs that basically barely work and no indication that AMD will ever actually integrate full support of it within the main drivers.

So a mega APU sounds great, but what is the point if AMD can't deliver the support for it?

Minus Infinity · Jul 21, 2023

persondb said:
If Phoenix is any indication, then AMD really needs to focus on driver support for their APUs as Phoenix has been a shitshow. Been months since products with it have been rolling off and there is no official support, only beta drivers from OEMs that basically barely work and no indication that AMD will ever actually integrate full support of it within the main drivers.

So a mega APU sounds great, but what is the point if AMD can't deliver the support for it?

Never buy AMD products on first release is the simple rule. Wait 6 months at least and then see what the driver feedback is.

TumbleGeorge · Jul 21, 2023

persondb said:
If Phoenix is any indication, then AMD really needs to focus on driver support for their APUs as Phoenix has been a shitshow. Been months since products with it have been rolling off and there is no official support, only beta drivers from OEMs that basically barely work and no indication that AMD will ever actually integrate full support of it within the main drivers.

So a mega APU sounds great, but what is the point if AMD can't deliver the support for it?

I search in the titles in forum "Phoenix problem" and found nothing.

persondb · Jul 21, 2023

TumbleGeorge said:
I search in the titles in forum "Phoenix problem" and found nothing.

You just didn't look hard enough, examples:

Re: AMD Ryzen™ 7 - 7840u Adrenaline Drivers

Hi , Thank you for looking into this. As of now, AMD does not offer final drivers nor proper Adrenaline support for the AMD Ryzen 7 - 7840u. If you go to the drivers download page, the Ryzen 7 - 7840u is nowhere to be found, yet you can find the previous generation Ryzen 7 - 6800u drivers...

community.amd.com

https://www.reddit.com/r/ROGAlly/comments/154ct2p
https://www.reddit.com/r/Amd/comments/14gngoj
People have a lot of instability with those systems and as an example, report crashing and being completely unable to use the AI tools from Photoshop and/or Lighroom, which is a big ooof, in my view as part of the point of Phoenix was the hardware engine for AI stuff.

Minus Infinity said:
Never buy AMD products on first release is the simple rule. Wait 6 months at least and then see what the driver feedback is.

It has already been months since the first Phoenix products hit the market. I think they started to come in by May? It was supposed to be before, I think but AMD had delays.

TumbleGeorge · Jul 21, 2023

persondb said:
You just didn't look hard enough, examples:

Re: AMD Ryzen™ 7 - 7840u Adrenaline Drivers

Hi , Thank you for looking into this. As of now, AMD does not offer final drivers nor proper Adrenaline support for the AMD Ryzen 7 - 7840u. If you go to the drivers download page, the Ryzen 7 - 7840u is nowhere to be found, yet you can find the previous generation Ryzen 7 - 6800u drivers...

community.amd.com

https://www.reddit.com/r/ROGAlly/comments/154ct2p
https://www.reddit.com/r/Amd/comments/14gngoj
People have a lot of instability with those systems and as an example, report crashing and being completely unable to use the AI tools from Photoshop and/or Lighroom, which is a big ooof, in my view as part of the point of Phoenix was the hardware engine for AI stuff.

It has already been months since the first Phoenix products hit the market. I think they started to come in by May? It was supposed to be before, I think but AMD had delays.

Thanks.
I meant in the forum here at TPU, with the key phrase that was written in my previous comment and search in the titles.

qcmadness · Jul 21, 2023

AnotherReader said:
An AMD C core and the Raptor Cove E core have the same amount of L2 cache: 1 MB per core. In Raptor Cove's case, this L2 is shared across 4 cores for a total of 4 MB per cluster. Of course, the C core has L3 as well.

Zen 4 and Zen 4c literally have the same IPC, while the P and E cores in 12th / 13th Gen has 50%+ IPC difference (P-core is 50% faster than E core at same frequency).

Chrispy_ · Jul 21, 2023

Minus Infinity said:
No one can give reliable information on the config. AMD is testing both 8P + 4E and 12P versions. AMD probably don't know what they'll ship yet this far out from release.

True, this is all speculation, but monolithic products are mobile parts and I'm not sure 12P fits the 15-45W envelope in a viable way. At least not with today's heat density and manufacturing capabilities.

Historically, monolithic mobile-first parts have been clocked up to 65W TDPs as desktop APUs later, but their primary focus is on performance/Watt because cooling and battery life are always more important than outright performance in a laptop part. That's why I'm agreeing with the rumour that these are 4x Zen5 and 8x Zen5c. Maybe I'm wrong, but at least you have my reasoning now.

AnotherReader · Jul 21, 2023

qcmadness said:
Zen 4 and Zen 4c literally have the same IPC, while the P and E cores in 12th / 13th Gen has 50%+ IPC difference (P-core is 50% faster than E core at same frequency).

That's true, but the question was about the amount of cache per core for Zen 4c and Gracemont in Raptor Lake.

Count von Schwalbe · Jul 21, 2023

AnotherReader said:
That's true, but the question was about the amount of cache per core for Zen 4c and Gracemont in Raptor Lake.

Zen 4c and Zen 4 mobile have the same cache. Gracemont is weird as it has shared L2 and L3. Raptor Lake Gracemont has 4MB per cluster of 4 cores, so 1MB each. As these are strictly for MT tasks, I suppose it is better as shared?

Zen 4c is just a space-optimized Zen 4 core, rather than a clockspeed-optimized one. L3 is halved like the mobile Zen 4 parts.

AnotherReader · Jul 21, 2023

Count von Schwalbe said:
Zen 4c and Zen 4 mobile have the same cache. Gracemont is weird as it has shared L2 and L3. Raptor Lake Gracemont has 4MB per cluster of 4 cores, so 1MB each. As these are strictly for MT tasks, I suppose it is better as shared?

Zen 4c is just a space-optimized Zen 4 core, rather than a clockspeed-optimized one. L3 is halved like the mobile Zen 4 parts.

They are lower performance cores so sharing the L2 is good from two points of view:

higher capacity means more hits in L2
fewer misses to the much slower L3

Drawbacks compared to a private L2 are:

higher latency
more wiring (probably a 4 port crossbar) requires more area
lower bandwidth when more than one requester is active, i.e. when more than one core is accessing L2

Chips and Cheese did a deep dive on Gracemont in the 12900k. This was when it had 2 MB L2 per cluster. Let's compare to an AMD SOC that had 4 core clusters too. The latencies are:

Cache Level	Gracemont size (kiB)	Zen 2 size (kiB)	Gracemont latency (cycles)	Zen 2 latency (cycles)
L1D	32	32	3	4
L2	2048	512	17	12
L3	30	16384	74	44

Let's look at bandwidth from the cache in a mulithreaded test.

Cache Level	Gracemont size (kiB)	Zen 2 size (kiB)	Gracemont bandwidth (GB/s)	Zen 2 bandwidth (GB/s)
L1D	32	32	499	1057
L2	2048	512	208	535
L3	30	16384	61	321

Now this makes it clear why Intel chose a large, shared L2. Intel's L3 has poor bandwidth for a Gracemont cluster, but it doesn't matter because the L2 makes up for it.

As an aside, Gracemont has a large L1 I-cache which is something that the bigger cores should copy. This figure from Hirki et al's paper on Haswell's power consumption also shows the benefit of a larger L1.

Count von Schwalbe · Jul 21, 2023

AnotherReader said:
As an aside, Gracemont has a large L1 I-cache which is something that the bigger cores should copy. This figure from Hirki et al's paper on Haswell's power consumption also shows the benefit of a larger L1.

I reckon the production challenges of that are the main reason this is less common. Looking at the strict voltage and thermal requirements of the X3D chips. Burying more cache inside the core itself would be an engineering challenge.

It seems that a Gracemont cluster is more like a 4-thread core than 4 individual cores. I am sure that it technically isn't, but I get FX-8350 vibes...

AnotherReader · Jul 21, 2023

Count von Schwalbe said:
I reckon the production challenges of that are the main reason this is less common. Looking at the strict voltage and thermal requirements of the X3D chips. Burying more cache inside the core itself would be an engineering challenge.

It seems that a Gracemont cluster is more like a 4-thread core than 4 individual cores. I am sure that it technically isn't, but I get FX-8350 vibes...

Sharing the L2 cache doesn't make it like Bulldozer. Bulldozer shared much more, including the front-end and the floating point execution units.

Apple has enormous L1 caches and Power 6, the first 5 GHz CPU, had 64 KiB L1 caches. The Power 6, unlike Apple, didn't have the advantage of current processes, but it also had the advantage of a high power budget. The first Zen had a 64 KiB L1 I-cache as well.

System Name	G-Station 1.17 FINAL
Processor	AMD Ryzen 7 5700X3D
Motherboard	Gigabyte X470 Aorus Gaming 7 Wi-Fi
Cooling	DeepCool AK620 Digital
Memory	Asgard Bragi DDR4-3600CL14 2x16GB
Video Card(s)	Sapphire Pulse RX 7900 XTX
Storage	240GB Samsung 840 Evo, 1TB Asgard AN2, 2TB Hiksemi FUTURE-LITE, 320GB+1TB 7200RPM HDD
Display(s)	Samsung 34" Odyssey OLED G8
Case	Thermaltake Level 20 MT
Audio Device(s)	Astro A40 TR + MixAmp
Power Supply	Cougar GEX X2 1000W
Mouse	Razer Viper Ultimate
Keyboard	Razer Huntsman Elite (Red)
Software	Windows 11 Pro

Processor	Ryzen 7 5700X
Motherboard	ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling	Noctua NH-C14S (two fans)
Memory	2x16GB DDR4 3200
Video Card(s)	Reference Vega 64
Storage	Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s)	Nixeus NX-EDG27, and Samsung S23A700
Case	Fractal Design R5
Power Supply	Seasonic PRIME TITANIUM 850W
Mouse	Logitech
VR HMD	Oculus Rift
Software	Windows 11 Pro, and Ubuntu 20.04

System Name	Work Computer \| Unfinished Computer
Processor	Core i7-6700 \| Ryzen 5 5600X
Motherboard	Dell Q170 \| Gigabyte Aorus Elite Wi-Fi
Cooling	A fan? \| Truly Custom Loop
Memory	4x4GB Crucial 2133 C17 \| 4x8GB Corsair Vengeance RGB 3600 C26
Video Card(s)	Dell Radeon R7 450 \| RTX 2080 Ti FE
Storage	Crucial BX500 2TB \| TBD
Display(s)	3x LG QHD 32" GSM5B96 \| TBD
Case	Dell \| Heavily Modified Phanteks P400
Power Supply	Dell TFX Non-standard \| EVGA BQ 650W
Mouse	Monster No-Name $7 Gaming Mouse\| TBD

Processor	AMD Ryzen 5 5600@80W
Motherboard	MSI B550 Tomahawk
Cooling	ZALMAN CNPS9X OPTIMA
Memory	2*8GB PATRIOT PVS416G400C9K@3733MT_C16
Video Card(s)	Sapphire Radeon RX 6750 XT Pulse 12GB
Storage	Sandisk SSD 128GB, Kingston A2000 NVMe 1TB, Samsung F1 1TB, WD Black 10TB
Display(s)	AOC 27G2U/BK IPS 144Hz
Case	SHARKOON M25-W 7.1 BLACK
Audio Device(s)	Realtek 7.1 onboard
Power Supply	Seasonic Core GC 500W
Mouse	Sharkoon SHARK Force Black
Keyboard	Trust GXT280
Software	Win 7 Ultimate 64bit/Win 10 pro 64bit/Manjaro Linux

System Name	Budget Box
Processor	Xeon E5-2667v2
Motherboard	ASUS P9X79 Pro
Cooling	Some cheap tower cooler, I dunno
Memory	32GB 1866-DDR3 ECC
Video Card(s)	XFX RX 5600XT
Storage	WD NVME 1GB
Display(s)	ASUS Pro Art 27"
Case	Antec P7 Neo

System Name	Bragging Rights
Processor	Atom Z3735F 1.33GHz
Motherboard	It has no markings but it's green
Cooling	No, it's a 2.2W processor
Memory	2GB DDR3L-1333
Video Card(s)	Gen7 Intel HD (4EU @ 311MHz)
Storage	32GB eMMC and 128GB Sandisk Extreme U3
Display(s)	10" IPS 1280x800 60Hz
Case	Veddha T2
Audio Device(s)	Apparently, yes
Power Supply	Samsung 18W 5V fast-charger
Mouse	MX Anywhere 2
Keyboard	Logitech MX Keys (not Cherry MX at all)
VR HMD	Samsung Oddyssey, not that I'd plug it into this though....
Software	W10 21H1, barely
Benchmark Scores	I once clocked a Celeron-300A to 564MHz on an Abit BE6 and it scored over 9000.