Ryzen 3000 memory controller has "half the performance" on single CCD CPUs

TheLostSwede · Jul 8, 2019

This is an interesting observation that I had missed until now.
It would appear AMD has "cheaped out" on their memory controller a bit and it only has "half the performance" on CPUs with only one CCD in them, during write operations.
Seemingly it has little affect in most applications, but if you're doing something that does a lot of intensive memory writes, you might want to consider getting a dual CCD CPU.
It does seem to have a small affect on the memory latency though.
Just a heads up, as it was not something that was particularly clear from AMD's side.

Source: https://www.guru3d.com/articles_pages/amd_ryzen_7_3700x_ryzen_9_3900x_review,21.html

biffzinker · Jul 8, 2019

TheLostSwede said:
It would appear AMD has "cheaped out" on their memory controller a bit and it only runs at "half speed" )16 vs 32-bit) on CPUs with only one CCD in them

It's 64Bytes/Cycle when reading from memory but drops down to 32Bytes/Cycle for writes. A review somewhere mentioned it.

TheLostSwede · Jul 8, 2019

I guess the source link got it slightly wrong, but yeah, still half speed for writes on single CCD CPUs.

Flaky · Jul 8, 2019

32 bytes * 1666 megahertz = 57.6 GBps

As long as each arrow is not a sum of 16B/cycle in each direction, a single CCD should have enough bandwidth to handle dual channel throughput.

My educated guess is that for some reason in this test the IF divider got set to 1:2.

londiste · Jul 8, 2019

DRAM:FSB ratio on AIDA screenshots is 54:3 for both, should that reflect the divider?
If they switch different CPUs around leaving divider could happen by mistake, I suppose.

biffzinker · Jul 8, 2019

TheLostSwede said:
I guess the source link got it slightly wrong, but yeah, still half speed for writes on single CCD CPUs.

The reason for the 32B/cycle writes was explained as they happen less often than reading from memory.

Anyone up for a deep dive into the Zen 2 core?

A Look At The AMD Zen 2 Core

Ahead of the highly anticipated Ryzen 3000 desktop series launch, here is a look at the AMD Zen 2 core microarchitecture.

fuse.wikichip.org

TheLostSwede · Jul 8, 2019

biffzinker said:
The reason for the 32B/cycle writes was explained as they happen less often than reading from memory.

Anyone up for a deep dive into the Zen 2 core?

A Look At The AMD Zen 2 Core

Ahead of the highly anticipated Ryzen 3000 desktop series launch, here is a look at the AMD Zen 2 core microarchitecture.

fuse.wikichip.org

Which sort of makes sense, but it also makes the single CCD CPUs look "bad" in some synthetic benchmarks. It doesn't seem to make much of a real world difference though and even the latency difference seems to be a mostly moot issue, if there even is a difference, since I've seen other tests that shows none.

biffzinker · Jul 8, 2019

TheLostSwede said:
latency difference seems to be a mostly moot issue, if there even is a difference, since I've seen other tests that shows none.

The Zen2 L3 Cache Latency is up compared to Zen+, and memory latency is a touch higher.

Anandtech said:
In terms of the DRAM latency, it seems that the new Ryzen 3900X has regressed by around 10ns when compared to the 2700X (Note: Just take into the leading edge of the “Structural Estimate” figures as the better estimate) with ~74-75.5ns versus ~65.7ns.

It also looks like Zen2’s L3 cache has also gained a few cycles: A change from ~7.5ns at 4.3GHz to ~8.1ns at 4.6GHz would mean a regression from ~32 cycles to ~37 cycles.

londiste · Jul 8, 2019

L1 and L2 are pretty much even. L3 Cache latency is slightly up but there is also twice as much L3 Cache. Memory latency is simply an inevitable tradeoff of chiplet design.
Overall it is still a noticeable improvement.

Octopuss · Jul 8, 2019

What is a CCD CPU?

Ferrum Master · Jul 8, 2019

Octopuss said:
What is a CCD CPU?

biffzinker · Jul 8, 2019

Octopuss said:
What is a CCD CPU?

TheoneandonlyMrK · Jul 8, 2019

TheLostSwede said:
This is an interesting observation that I had missed until now.
It would appear AMD has "cheaped out" on their memory controller a bit and it only runs at "half speed" on CPUs with only one CCD in them, during write operations.
Seemingly it has little affect in most applications, but if you're doing something that does a lot of intensive memory writes, you might want to consider getting a dual CCD CPU.
It does seem to have a small affect on the memory latency though.
Just a heads up, as it was not something that was particularly clear from AMD's side.

View attachment 126375

Source: https://www.guru3d.com/articles_pages/amd_ryzen_7_3700x_ryzen_9_3900x_review,21.html

Halfs performance, not speed , they run at the same speed but with one less ccd doing 16b writes that's half the writes and reads coincidentally since that one ccd can't read as much as two.

TheLostSwede · Jul 8, 2019

theoneandonlymrk said:
Halfs performance, not speed , they run at the same speed but with one less ccd doing 16b writes that's half the writes and reads coincidentally since that one ccd can't read as much as two.

Right, yes. Edited the titles to make that more obvious.

EarthDog · Jul 8, 2019

We caught that in testing and was in the review.. I don't think it matters much, however.

Hearing "CCD" brings me back to Catholic school/baptism/confirmation days... lol

TheLostSwede · Jul 8, 2019

EarthDog said:
We caught that in testing and was in the review.. I don't think it matters much, however.

Hearing "CCD" brings me back to Catholic school/baptism/confirmation days... lol

It doesn't seem to matter much in 99% of applications, that's for sure, at least judging by the benchmarks. It was just one of those things I really hadn't realised they'd done.
Admittedly it's right there in their presentations (if you compare 1x CCD vs 2x CCD CPUs), but they obviously didn't highlight it, for reasons.
It just something worth getting out there for those 1% scenarios that people might run and they might be surprised why the performance suffer.

birdie · Jul 8, 2019

Yeah, I've also noticed that: https://forums.anandtech.com/threads/ryzen-3700x-low-ram-write-speed-conundrum.2567215/

Vario · Jul 8, 2019

Can the IO Die be overclocked?

Wavetrex · Jul 8, 2019

This is basically like an old "dual-cpu + northbridge" design inside a small box, with bus speeds updated to modern day.

I remember an old Dual Pentium III system that I used many years ago, it suffered from exactly same thing: Removing one CPU would half memory bandwidth, even if the memory was still connected to the same northbridge. The "FSB" from one single CPU simply couldn't keep up.
Old ....... New
Pentium "Core" = CCD
Intel FSB = AMD IF
I/O die = Northbridge (memory controller) + half of Southbridge

Tbh, as long as application performance is fine, this is a non-issue.

Vya Domus · Jul 8, 2019

Vario said:
Can the IO Die be overclocked?

Doubt it, everything on a chip that is outside the core/cache/MC usually runs at vastly different speeds and requires tight timing so that it can communicate with other chips and buses.

biffzinker · Jul 8, 2019

EarthDog said:
We caught that in testing and was in the review

You do reviews for who? Guru3D?

EarthDog · Jul 8, 2019

biffzinker said:
You do reviews for who? Guru3D?

It's a bit complicated... lol

I own a site and work for another, much larger site (YHPM).

TheLostSwede · Jul 8, 2019

EarthDog said:
It's a bit complicated... lol

I own a site and work for another, much larger site (YHPM).

Now now, no need to be so shy, I'm sure there won't be any hard feelings if you share with everyone...

mouacyk · Jul 10, 2019

Hexus.net has similar explanation to Anandtech and Overclockers.uk:

AMD says that this is a calculated design choice for Zen 2, due to most client workloads not writing as much. Halving the data link write speed between CCD and cIOD saves area, improves power, and has ancillary knock-on benefits, too. The downside is half-write speed because of the slowness of the data fabric in that direction.

I'm more curious about the max bandwidth of IF at 1800MHz or 1900MHz (for capable silicon) and what this means for all data (especially PCIE) that needs to go through the IF. I'm sure AMD has calculated it to be sufficient for all or most needs, but does anyone know any numbers? Thanks.

clbc12 · Jul 10, 2019

This is the exact discussion I've been searching desperately for for the past few days... I'm a VFX artist by day, indie filmmaker by night who needs to build a new PC for a personal project-- I've already bought all parts except CPU and need to complete the build ASAP, but I'm really worried about this whole "half the memory for writes" thing on the new Ryzen's, because I suspect CG animation filmmaking workflows will be a 1% sort of thing vs. who these processors seem to be built for (gamers, etc.).

I animate in 3D (Maya) 2D (After Effects), and edit in Premiere (though I may switch to Resolve soon). A huge part of my workflow is being able to play back previews of animation in real time (the less dropped frames, the better). I'm not terribly tech savvy and only build a workstation every 5-7 years, so it's hard for me to discern what involves writing to RAM vs. reading from it, but it seems to me that something we call "RAM previews" in AE would be writing to RAM, and I do that a lot, and really need it to be fast. Render/export speed is less important to me, as I do that far less often and can let my computer render while I'm doing other things. You guys seem to really understand a lot about this issue (I've had trouble finding people that do), so I'd appreciate it very much if you could tell me whether you think my workflow would improve or suffer if I went with, say, a 3600x vs. a 2600x, etc.? Thanks!

System Name	Overlord Mk MLI
Processor	AMD Ryzen 7 7800X3D
Motherboard	Gigabyte X670E Aorus Master
Cooling	Noctua NH-D15 SE with offsets
Memory	32GB Team T-Create Expert DDR5 6000 MHz @ CL30-34-34-68
Video Card(s)	Gainward GeForce RTX 4080 Phantom GS
Storage	1TB Solidigm P44 Pro, 2 TB Corsair MP600 Pro, 2TB Kingston KC3000
Display(s)	Acer XV272K LVbmiipruzx 4K@160Hz
Case	Fractal Design Torrent Compact
Audio Device(s)	Corsair Virtuoso SE
Power Supply	be quiet! Pure Power 12 M 850 W
Mouse	Logitech G502 Lightspeed
Keyboard	Corsair K70 Max
Software	Windows 10 Pro
Benchmark Scores	https://valid.x86.fr/yfsd9w

Processor	Intel Core i7-13700 PL2 150W
Motherboard	MSI Z790 Gaming Plus WiFi
Cooling	Cooler Master Hyper 212 Halo Black
Memory	G Skill F5-6800J3446F48G 96GB kit
Video Card(s)	Gigabyte Radeon RX 9070 GAMING OC 16G
Storage	970 EVO NVMe 500GB, WD850N 2TB
Display(s)	Samsung 28” 4K monitor
Case	Corsair iCUE 4000D RGB AIRFLOW
Audio Device(s)	EVGA NU Audio, Edifier Bookshelf Speakers R1280
Power Supply	TT TOUGHPOWER GF A3 Gold 1050W
Mouse	Logitech G502 Hero
Keyboard	Logitech G G413 Silver
Software	Windows 11 Professional v24H2

System Name	Overlord Mk MLI
Processor	AMD Ryzen 7 7800X3D
Motherboard	Gigabyte X670E Aorus Master
Cooling	Noctua NH-D15 SE with offsets
Memory	32GB Team T-Create Expert DDR5 6000 MHz @ CL30-34-34-68
Video Card(s)	Gainward GeForce RTX 4080 Phantom GS
Storage	1TB Solidigm P44 Pro, 2 TB Corsair MP600 Pro, 2TB Kingston KC3000
Display(s)	Acer XV272K LVbmiipruzx 4K@160Hz
Case	Fractal Design Torrent Compact
Audio Device(s)	Corsair Virtuoso SE
Power Supply	be quiet! Pure Power 12 M 850 W
Mouse	Logitech G502 Lightspeed
Keyboard	Corsair K70 Max
Software	Windows 10 Pro
Benchmark Scores	https://valid.x86.fr/yfsd9w

Processor	Ryzen 7800X3D
Motherboard	ROG STRIX B650E-F GAMING WIFI
Memory	2x16GB G.Skill Flare X5 DDR5-6000 CL36 (F5-6000J3636F16GX2-FX5)
Video Card(s)	INNO3D GeForce RTX™ 4070 Ti SUPER TWIN X2
Storage	2TB Samsung 980 PRO, 4TB WD Black SN850X
Display(s)	42" LG C2 OLED, 27" ASUS PG279Q
Case	Thermaltake Core P5
Power Supply	Fractal Design Ion+ Platinum 760W
Mouse	Corsair Dark Core RGB Pro SE
Keyboard	Corsair K100 RGB
VR HMD	HTC Vive Cosmos

Processor	Intel Core i7-13700 PL2 150W
Motherboard	MSI Z790 Gaming Plus WiFi
Cooling	Cooler Master Hyper 212 Halo Black
Memory	G Skill F5-6800J3446F48G 96GB kit
Video Card(s)	Gigabyte Radeon RX 9070 GAMING OC 16G
Storage	970 EVO NVMe 500GB, WD850N 2TB
Display(s)	Samsung 28” 4K monitor
Case	Corsair iCUE 4000D RGB AIRFLOW
Audio Device(s)	EVGA NU Audio, Edifier Bookshelf Speakers R1280
Power Supply	TT TOUGHPOWER GF A3 Gold 1050W
Mouse	Logitech G502 Hero
Keyboard	Logitech G G413 Silver
Software	Windows 11 Professional v24H2

Ryzen 3000 memory controller has "half the performance" on single CCD CPUs

TheLostSwede

News Editor

biffzinker

TheLostSwede

News Editor

Flaky

londiste

biffzinker

A Look At The AMD Zen 2 Core

TheLostSwede

News Editor

A Look At The AMD Zen 2 Core

biffzinker

londiste

Octopuss

Ferrum Master

biffzinker

TheoneandonlyMrK

TheLostSwede

News Editor

EarthDog

TheLostSwede

News Editor

birdie

Vario

Wavetrex

Vya Domus

biffzinker

EarthDog

TheLostSwede

News Editor

mouacyk

clbc12

New Member

Processor	Ryzen 5800X
Motherboard	Asus TUF-Gaming B550-Plus
Cooling	Noctua NH-U14S
Memory	32GB G.Skill Trident Z Neo F4-3600C16D-32GTZNC
Video Card(s)	Sapphire AMD Radeon RX 7900 XTX Nitro+
Storage	HP EX950 512GB + Samsung 970 PRO 1TB
Display(s)	Cooler Master GP27Q
Case	Fractal Design Define R6 Black
Audio Device(s)	Creative Sound Blaster AE-5
Power Supply	Seasonic PRIME Ultra 650W Gold
Mouse	Roccat Kone AIMO Remastered
Software	Windows 10 x64

System Name	HELLSTAR
Processor	AMD RYZEN 9 5950X
Motherboard	ASUS Strix X570-E
Cooling	2x 360 + 280 rads. 3x Gentle Typhoons, 3x Phanteks T30, 2x TT T140 . EK-Quantum Momentum Monoblock.
Memory	4x8GB G.SKILL Trident Z RGB F4-4133C19D-16GTZR 14-16-12-30-44
Video Card(s)	Sapphire Pulse RX 7900XTX. Water block. Crossflashed.
Storage	Optane 900P[Fedora] + WD BLACK SN850X 4TB + 750 EVO 500GB + 1TB 980PRO+SN560 1TB(W11)
Display(s)	Philips PHL BDM3270 + Acer XV242Y
Case	Lian Li O11 Dynamic EVO
Audio Device(s)	SMSL RAW-MDA1 DAC
Power Supply	Fractal Design Newton R3 1000W
Mouse	Razer Basilisk
Keyboard	Razer BlackWidow V3 - Yellow Switch
Software	FEDORA 41

System Name	RyzenGtEvo/ Asus strix scar II
Processor	Amd R5 5900X/ Intel 8750H
Motherboard	Crosshair hero8 impact/Asus
Cooling	360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory	Gskill Trident Z 3900cas18 32Gb in four sticks./16Gb/16GB
Video Card(s)	Asus tuf RX7900XT /Rtx 2060
Storage	Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s)	Samsung UAE28"850R 4k freesync.dell shiter
Case	Lianli 011 dynamic/strix scar2
Audio Device(s)	Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply	corsair 1200Hxi/Asus stock
Mouse	Roccat Kova/ Logitech G wireless
Keyboard	Roccat Aimo 120
VR HMD	Oculus rift
Software	Win 10 Pro
Benchmark Scores	laptop Timespy 6506

System Name	Computer of Theseus
Processor	Intel i9-12900KS: 50x Pcore multi @ 1.18Vcore (target 1.275V -100mv offset)
Motherboard	EVGA Z690 Classified
Cooling	Noctua NH-D15S, 2xSF MegaCool SF-PF14, 4xNoctua NF-A12x25, 3xNF-A12x15, AquaComputer Splitty9Active
Memory	G-Skill Trident Z5 (32GB) DDR5-6000 C36 F5-6000J3636F16GX2-TZ5RK
Video Card(s)	ASUS PROART RTX 4070 Ti-Super OC 16GB, 2670MHz, 0.93V
Storage	1x Samsung 990 Pro 1TB NVMe (OS), 2x Samsung 970 Evo Plus 2TB (data), ASUS BW-16D1HT (BluRay)
Display(s)	Dell S3220DGF 32" 2560x1440 165Hz Primary, Dell P2017H 19.5" 1600x900 Secondary, Ergotron LX arms.
Case	Lian Li O11 Air Mini
Audio Device(s)	Audiotechnica ATR2100X-USB, El Gato Wave XLR Mic Preamp, ATH M50X Headphones, Behringer 302USB Mixer
Power Supply	Super Flower Leadex Platinum SE 1000W 80+ Platinum White, MODDIY 12VHPWR Cable
Mouse	Zowie EC3-C
Keyboard	Vortex Multix 87 Winter TKL (Gateron G Pro Yellow)
Software	Win 10 LTSC 21H2

System Name	Good enough
Processor	AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard	ASRock B650 Pro RS
Cooling	2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory	32GB - FURY Beast RGB 5600 Mhz
Video Card(s)	Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage	1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s)	LG UltraGear 32GN650-B + 4K Samsung TV
Case	Phanteks NV7
Power Supply	GPS-750C

System Name	Gentoo64 /w Cold Coffee
Processor	9900K 5.2GHz @1.312v
Motherboard	MXI APEX
Cooling	Raystorm Pro + 1260mm Super Nova
Memory	2x16GB TridentZ 4000-14-14-28-2T @1.6v
Video Card(s)	RTX 4090 LiquidX Barrow 3015MHz @1.1v
Storage	660P 1TB, 860 QVO 2TB
Display(s)	LG C1 + Predator XB1 QHD
Case	Open Benchtable V2
Audio Device(s)	SB X-Fi
Power Supply	MSI A1000G
Mouse	G502
Keyboard	G815
Software	Gentoo/Windows 10
Benchmark Scores	Always only ever very fast