AMD Ryzen 3000 "Zen 2" a Memory OC Beast, DDR4-5000 Possible

londiste · May 13, 2019

mat9v said:
I mean, you get very fast memory and have to push it through a thin straw of halved IF to CPU cores. Sure if IF at 1250Mhz is significantly faster than memory at 5000 then it is not a big issue but if not - you loose that gained speed from increasing memory frequency. The more important thing when running IF at half speed would be timings. Whatever you gain from better timings in memory modules you loose much more in IF lagging behind - memory chip optimisation would gain us 10-20ns while IF running at half speed would loose us a 50ns easily?

What Ryzen so far benefits from with fast memory is the increased IF link speed between CCXs which should be slower no matter what memory timings or speed gets to be.
Edit: reading up on IF speeds in current Zen, I assume AMD doubled the width of IF links from CCX to RAM, otherwise the divider would also limit RAM bandwidth.

mat9v said:
Now, if AMD would be to introduce 2x IF multiplier, that would enable use to lower delays introduced by slow IF link...
Consider somewhat slow memory at 3000 that results in IF running at 1500 and apply 2x to that to get IF running at 3000?

This is the entire problem and reason for introducing the divider - IF cannot run at that high a clock. There is hope that AMD has improved IF in Zen2 but we will ahve to wait and see what approach they have taken with it.

Redwoodz · May 13, 2019

mat9v said:
1/2 divider is kinda useless?
I mean, you get very fast memory and have to push it through a thin straw of halved IF to CPU cores. Sure if IF at 1250Mhz is significantly faster than memory at 5000 then it is not a big issue but if not - you loose that gained speed from increasing memory frequency. The more important thing when running IF at half speed would be timings. Whatever you gain from better timings in memory modules you loose much more in IF lagging behind - memory chip optimisation would gain us 10-20ns while IF running at half speed would loose us a 50ns easily? Consider that without any optimisations we get over 100ns for 2133 memory and corresponding 1066 IF speed and around 80 for 3200 memory and 1600 IF link - that is 20ns lag for 500Mhz of IF link speed. Sure increasing IF link speed would give us decreasingly smaller gains but still... going from 4000 memory with IF at 2000 to IF at 1000 would "kill the performance dead"
It's good only for breaking speed records.

Now, if AMD would be to introduce 2x IF multiplier, that would enable use to lower delays introduced by slow IF link...
Consider somewhat slow memory at 3000 that results in IF running at 1500 and apply 2x to that to get IF running at 3000? IF link with delays on the order of 25ns... and even with low quality memory running at 2133 or 2400 we would get IF running at 2133 and 2400. It would eliminate the problem with IF bus concurrency when communicating with memory controller and PCIEx (GPUs) that can happen in current scenarios. All that, if IF would be able to run at that speeds.... but considering that some memory kits are able to run 3733 on ZEN+ and that means 1866 for IF it is not really far from 2133...

One can dream of course.

More options are always better. Besides, if they can get BLK adjustments working as well it takes the limitations off because then you can run whatever IF speed you want.

mat9v · May 13, 2019

londiste said:
RAM speed through IF should not be a problem. What Ryzen so far benefits from with fast memory is the increased IF link speed between CCXs which should be slower no matter what memory timings or speed gets to be. Faster memory does provide its own benefits but this is separate and even comparatively slow IF link should be enough for RAM's purposes.

IF speed impacts all reads from and to memory just because it is between cores and memory. Games running on two CCXs only compound the problem but forcing them to run on only 4 cores from single CCX does not fix much if anything. It is those delayed reads and writes from/to memory that are the problem, not the link speed but delays introduced by IF link.
It is not like IF is so fast anyway, according to AMD it runs at 42GB/s at 1333Mhz (or 2666 memory clock) according to

ISSCC 2018: AMD's Zeppelin; Multi-chip routing and packaging

A look at AMD's Zeppelin SoC and the Infinity Fabric, a multi-chip architecture used by AMD to scale their SoC design from the mainstream PC market all the way to the server market.

fuse.wikichip.org

and that is very close to what memory benchmarks show for memory alone clocked at 3000Mhz. And yet, IF must also handle inter-CCX communication and PCIEx access - all that is overloading it's capacity.
Running it at twice the speed would be very good for performance if not power budget and yes I suppose AMD was not able to do so with ZEN/ZEN+, but I had hoped for ZEN2 to make it so.

Super XP · May 13, 2019

I read somewhere that Infinity Fabric will no longer be tied to the IMC. Because in order for the fabric to achieve a set bandwidth, it cannot be tied to the IMC. It's because of how ZEN2 is designed. By having a 14nm and 7nm chiplets. I'll see if I can find that preview then post it.

The Stilt · May 13, 2019

This has to be the new low from TPU, or frankly one of the many in recent times.

Stating that "DDR-5000MHz is possible" simply based on the available bios options is silly and makes no favors to anyone, the least to AMD. The same way you could state that with the current generation Ryzen CPUs DDR-4133MHz is possible, or that on Intel Coffee Lake Refresh parts DDR-5500MHz is possible. In reality of course, most current generation Ryzen users still struggle reaching higher than 3466MHz and the same way the typical best case scenario for daily use on Intel platforms is roughly 4133MHz or less (mostly due to the DRAM PCB or MB PCB signaling limits).

Actually both current gen. Ryzens already support up to DDR-8466MHz by their Phy design, but lets no let the facts to get in the way of fabricating the "news".

As I said, reporting BS like this (and the majority of other Zen 2 related rumors) is not in anyones interests.

Matisse will no doubt bring good improvements in most areas (incl. memory speeds), but everyone should keep their expectations at sane levels regardless.

If w1zzard was dead, he'd be spinning in his grave...

Super XP · May 13, 2019

The Stilt said:
This has to be the new low from TPU, or frankly one of the many in recent times.

Stating that "DDR-5000MHz is possible" simply based on the available bios options is silly and makes no favors to anyone, the least to AMD. The same way you could state that with the current generation Ryzen CPUs DDR-4133MHz is possible, or that on Intel Coffee Lake Refresh parts DDR-5500MHz is possible. In reality of course, most current generation Ryzen users still struggle reaching higher than 3466MHz and the same way the typical best case scenario for daily use on Intel platforms is roughly 4133MHz or less (mostly due to the DRAM PCB or MB PCB signaling limits).

Actually both current gen. Ryzens already support up to DDR-8466MHz by their Phy design, but lets no let the facts to get in the way of fabricating the "news".

As I said, reporting BS like this (and the majority of other Zen 2 related rumors) is not in anyones interests.

Matisse will no doubt bring good improvements in most areas (incl. memory speeds), but everyone should keep their expectations at sane levels regardless.

If w1zzard was dead, he'd be spinning in his grave...

Well I agree to a certain extent. Seems TPU isn't the only site reporting on this particular news.

As I said before, in my previous post. It seems Infinity Fabric will require at least 100 GB/s Bandwidth bidirectional in order to have enough to feed the 7nm Chiplets and the larger 14nm IO die.

I was under the impression the limiting factor for Infinity Fabric was the fact it was tied to the IMC. I still believe that's the case and the issue overall.

bug · May 13, 2019

Super XP said:
Well I agree to a certain extent. Seems TPU isn't the only site reporting on this particular news.

Well, the news is DDR4-5000 was found somewhere in Zen2's UEFI. TPU ups the ante and reports DDR4-5000 is possible with Zen2. There's a "slight" disconnect there. I don't have much of an issue with that, because I can understand what's being said, but you know many people don't read past the headlines.

The Stilt · May 13, 2019

Super XP said:
Well I agree to a certain extent. Seems TPU isn't the only site reporting on this particular news.

As I said before, in my previous post. It seems Infinity Fabric will require at least 100 GB/s Bandwidth bidirectional in order to have enough to feed the 7nm Chiplets and the larger 14nm IO die.

I was under the impression the limiting factor for Infinity Fabric was the fact it was tied to the IMC. I still believe that's the case and the issue overall.

SDF ("IF") is still tied to the MEMCLK, but this time around there is a Pll in between, which allows other than 1:1 frequency relation.

The intention is not to provide higher SDF bandwidth through higher frequency, but to allow higher MEMCLK frequencies (at a cost) to provide sufficient memory bandwidth.
Zen 2 is a wide core, and even Intel Xeons with 256-bit memory interface (QCH) become bandwidth starved in certain 256-bit workloads (not to mention 512-bit ones, hence SKL-SP uses 384-bit HCH memory config).

R0H1T · May 13, 2019

mat9v said:
And yet, IF must also handle inter-CCX communication and PCIEx access - all that is overloading it's capacity.

We don't know the layout of zen2 die, it could well be 8 cores per CCX or they might have changed the entire layout radically.

Redwoodz · May 13, 2019

The Stilt said:
This has to be the new low from TPU, or frankly one of the many in recent times.

Stating that "DDR-5000MHz is possible" simply based on the available bios options is silly and makes no favors to anyone, the least to AMD. The same way you could state that with the current generation Ryzen CPUs DDR-4133MHz is possible, or that on Intel Coffee Lake Refresh parts DDR-5500MHz is possible. In reality of course, most current generation Ryzen users still struggle reaching higher than 3466MHz and the same way the typical best case scenario for daily use on Intel platforms is roughly 4133MHz or less (mostly due to the DRAM PCB or MB PCB signaling limits).

Actually both current gen. Ryzens already support up to DDR-8466MHz by their Phy design, but lets no let the facts to get in the way of fabricating the "news".

As I said, reporting BS like this (and the majority of other Zen 2 related rumors) is not in anyones interests.

Matisse will no doubt bring good improvements in most areas (incl. memory speeds), but everyone should keep their expectations at sane levels regardless.

If w1zzard was dead, he'd be spinning in his grave...

bug said:
Well, the news is DDR4-5000 was found somewhere in Zen2's UEFI. TPU ups the ante and reports DDR4-5000 is possible with Zen2. There's a "slight" disconnect there. I don't have much of an issue with that, because I can understand what's being said, but you know many people don't read past the headlines.

Yes it is an overstatement, thread author would have been better off showing that the new Biostar X570 has supports DDR4 4000+OC right on it's box.......meaning it HAS to be able to reach those speeds in at least some cases or they will be sued.

Super XP · May 13, 2019

Well one goal for AMD should be to rectify the Infinity Fabric Latency hit. Hopefully they fixed this with the upcoming ZEN2 design.

MikeMurphy · May 13, 2019

Imsochobo said:
Yes and no, a few times memory bandwidth is important.
But take a i9 14 core, remove two sticks and do dual channel, surprise!
Doesn't affect it that much at all!

Zens major limitation is latency more than bandwidth.
But surprise! Frequency decreases latency.

So yeah, to some extent bandwidth but I feel the latency is really what they are after.

Low latency memory doesn't solve the problem of 32 threads competing for access to that memory. Bandwidth does.

bug · May 14, 2019

Super XP said:
Well one goal for AMD should be to rectify the Infinity Fabric Latency hit. Hopefully they fixed this with the upcoming ZEN2 design.

We'll know soon enough.

MikeMurphy said:
Low latency memory doesn't solve the problem of 32 threads competing for access to that memory. Bandwidth does.

In theory, low-latency is better when you need to access bits of memory frequently. But we already have 3 layers of cache take care of that. So yeah, bandwidth with be #1 on my watchlist too.

londiste · May 14, 2019

R0H1T said:
We don't know the layout of zen2 die, it could well be 8 cores per CCX or they might have changed the entire layout radically.

Didn't @1usmus dig out the confirmation of 4-core CCX from BIOS images?

bug · May 14, 2019

londiste said:
Didn't @1usmus dig out the confirmation of 4-core CCX from BIOS images?

AMD Ryzen 3rd Gen 'Matisse' Coming Mid 2019: Eight Core Zen 2 with PCIe 4.0 on Desktop

www.anandtech.com

That's all I know about Zen2. No idea if there's one or two CCXes on that die.

Super XP · May 14, 2019

bug said:
AMD Ryzen 3rd Gen 'Matisse' Coming Mid 2019: Eight Core Zen 2 with PCIe 4.0 on Desktop

www.anandtech.com

That's all I know about Zen2. No idea if there's one or two CCXes on that die.

By utilizing 7nm Chiplets AMD can custom build CCXs with potentially 2 or 3 or 4 or 6 or 8 cores etc.
Well that is what I read once that's it's possible all by moving everything on that 14nm I/O and keeping the CPU chipsets separate. Who knows really.

londiste · May 14, 2019

Super XP said:
By utilizing 7nm Chiplets AMD can custom build CCXs with potentially 2 or 3 or 4 or 6 or 8 cores etc.
Well that is what I read once that's it's possible all by moving everything on that 14nm I/O and keeping the CPU chipsets separate. Who knows really.

CPU Complex (CCX) is the primary arhitectural multicore building block of Zen architectures. CCX size has nothing to do with I/O core, chiplets or 7nm.

Super XP · May 14, 2019

londiste said:
CPU Complex (CCX) is the primary arhitectural multicore building block of Zen architectures. CCX size has nothing to do with I/O core, chiplets or 7nm.

Perhaps I didn't make my comment clear enough. That 14nm IO can remain the same, keeping costs down all while allowing more customization with the 7nm chiplets.

Quote:
Of particular note, the chip incorporates a new-to-AMD chiplet based design approach, using separate I/O and CPU dies to simplify manufacturing and allow for easier chip customization.

https://www.anandtech.com/show/14286/amd-7nm-navi-gpu-and-rome-cpu-to-launch-in-q3

londiste · May 14, 2019

Super XP said:
Perhaps I didn't make my comment clear enough. That 14nm IO can remain the same, keeping costs down all while allowing more customization with the 7nm chiplets.

You are not using correct terminology then.

CCX is - as its name says - a core complex. In Zen it contains 4 cores and L3 cache (when looking at it on a high level). In a Zen/Zen+ chip die there are two such CCXs connected to Scalable Data Fabric (SDF) that we can characterize as IF hub where everything in the CPU connects to - CCXs, memory controllers, IO Hub. There is only one CCX configuration, while individual cores can be disabled in it there will not be multiple CCX configurations in the same generation. There are good reasons for expecting CCXs to remain at 4 cores in Zen 2.

Cores per die and cores per package are decidedly different from architectural features.

Amount of cores in Zen 2 CCX that we do not know is important because cores inside CCX can very quickly communicate with each other but communication with cores in a different CCX (even when it is on the same die) takes longer as it goes through IF connections.

Super XP · May 14, 2019

Thanks for the explanation, I'm already aware of the inner workings of the CCX.

It's just that I read somewhere in a previous article (Pre ZEN+ release), where the author speculated that one possible reason for utilizing this ZEN2 design approach was to potentially benefit from customizable 7nm Chiplets.

Anyhow, your explanation is clarity enough. Thank You,

londiste · May 14, 2019

Super XP said:
It's just that I read somewhere in a previous article (Pre ZEN+ release), where the author speculated that one possible reason for utilizing this ZEN2 design approach was to potentially benefit from customizable 7nm Chiplets.

Chiplet design benefits from higher yields due to smaller individual dies. This makes the design much cheaper than monolithic die with the same core count. This is practically the only benefit but it is a big one.

There are some negatives as well. IF links between dies is slightly slower in its current form and does use more power. Whether chiplet design adds complexity to the package is not sure yet but it is likely. With memory controller in the I/O die, memory is inevitably further away from the CPU cores, increasing latency. How much and how AMD has mitigated that - we will see soon.

Customizable chiplets can be a huge boon for custom market - consoles primarily. Maybe (a big maybe) for laptops. On desktop as we know it, customizable chiplets in terms of adding a GPU for an APU does not look like too good of a solution. With I/O Die, memory is far away and GPU is very dependent on memory (usually more bandwidth than latency but still). AM4 does not have enough space or pins to add HBM or some direct connected RAM. TR4/SM3 are unlikely candidates for integrated GPU.

There are a lot of thoughts being shared about stacked dies but with current CPU parts (including 7nm), power density will be a huge problem.

Redwoodz · May 15, 2019

londiste said:
Chiplet design benefits from higher yields due to smaller individual dies. This makes the design much cheaper than monolithic die with the same core count. This is practically the only benefit but it is a big one.

There are some negatives as well. IF links between dies is slightly slower in its current form and does use more power. Whether chiplet design adds complexity to the package is not sure yet but it is likely. With memory controller in the I/O die, memory is inevitably further away from the CPU cores, increasing latency. How much and how AMD has mitigated that - we will see soon.

Customizable chiplets can be a huge boon for custom market - consoles primarily. Maybe (a big maybe) for laptops. On desktop as we know it, customizable chiplets in terms of adding a GPU for an APU does not look like too good of a solution. With I/O Die, memory is far away and GPU is very dependent on memory (usually more bandwidth than latency but still). AM4 does not have enough space or pins to add HBM or some direct connected RAM. TR4/SM3 are unlikely candidates for integrated GPU.

There are a lot of thoughts being shared about stacked dies but with current CPU parts (including 7nm), power density will be a huge problem.

Yes but I/O die on a cool, mature 14nm could be clocked high enough to mitigate that latency.

londiste · May 15, 2019

Redwoodz said:
Yes but I/O die on a cool, mature 14nm could be clocked high enough to mitigate that latency.

What would be clocked high enough? Memory controller is tied to memory speed but the resulting bandwidth needs to fit through IF. IF has two endpoints, the other one is in the CPU cores' chiplet. Their best bet is probably making IF wider.

Midland Dog · May 15, 2019

londiste said:
Less memory modules is better than more memory modules. Go for 2x16GB instead of 4x8GB (assuming same speeds etc).

asssuming your cpu has a 128bit imc yeah, anything more than bandwidth goes up exponentially by using all channels

Gasaraki · May 15, 2019

Motherboard BIOS options doesn't mean it's work at that speed.

Processor	Ryzen 7800X3D
Motherboard	ROG STRIX B650E-F GAMING WIFI
Memory	2x16GB G.Skill Flare X5 DDR5-6000 CL36 (F5-6000J3636F16GX2-FX5)
Video Card(s)	INNO3D GeForce RTX™ 4070 Ti SUPER TWIN X2
Storage	2TB Samsung 980 PRO, 4TB WD Black SN850X
Display(s)	42" LG C2 OLED, 27" ASUS PG279Q
Case	Thermaltake Core P5
Power Supply	Fractal Design Ion+ Platinum 760W
Mouse	Corsair Dark Core RGB Pro SE
Keyboard	Corsair K100 RGB
VR HMD	HTC Vive Cosmos

System Name	Homemade :)
Processor	Ryzen 3900X@4.35Ghz
Motherboard	Asus X570 Prime Pro
Cooling	Alphacool Eisbear 420 + 6x Silent Wings 3
Memory	2x16GB A-Data Gammix D10 3200-15-15-15-31@3733-19-18-18-38 1:1
Video Card(s)	Palit GameRock GTX1080Ti +110 core +200 mem
Storage	WD SN750 1TB + 3x Crucial MX200 500GB RAID0
Display(s)	4k TV + Benq XL2730Z
Case	Phanteks Enthoo Pro
Audio Device(s)	AIM 808
Power Supply	Seasonic Focus Plus 850W
Mouse	Logitech G603
Keyboard	Razer BlackWidow Chroma
Software	Windows 10 LTSC

System Name	RiseZEN Gaming PC
Processor	AMD Ryzen 7 5800X @ Auto
Motherboard	Asus ROG Strix X570-E Gaming ATX Motherboard
Cooling	Corsair H115i Elite Capellix AIO, 280mm Radiator, Dual RGB 140mm ML Series PWM Fans
Memory	G.Skill TridentZ 64GB (4 x 16GB) DDR4 3200
Video Card(s)	ASUS DUAL RX 6700 XT DUAL-RX6700XT-12G
Storage	Corsair MP510 480GB M.2 - 2 x WD_BLACK 1TB SN850X M.2 1TB - Lexar NQ790 M.2 2TB
Display(s)	ASUS ROG Strix 34” XG349C 144Hz 1440p + Asus ROG 27" MG278Q 144Hz WQHD 1440p
Case	Corsair Obsidian Series 450D Gaming Case
Audio Device(s)	SteelSeries 5Hv2 w/ Sound Blaster Z SE
Power Supply	Corsair RM750x Power Supply
Mouse	Razer Death-Adder + Viper 8K HZ Ambidextrous Gaming Mouse - Ergonomic Left Hand Edition
Keyboard	Logitech G910 Orion Spectrum RGB Gaming Keyboard
Software	Windows 10 Pro - 64-Bit Edition (Back to Win 10 because 11 is garbage)
Benchmark Scores	I'm the Doctor, Doctor Who. The Definition of Gaming is PC Gaming...

System Name	RiseZEN Gaming PC
Processor	AMD Ryzen 7 5800X @ Auto
Motherboard	Asus ROG Strix X570-E Gaming ATX Motherboard
Cooling	Corsair H115i Elite Capellix AIO, 280mm Radiator, Dual RGB 140mm ML Series PWM Fans
Memory	G.Skill TridentZ 64GB (4 x 16GB) DDR4 3200
Video Card(s)	ASUS DUAL RX 6700 XT DUAL-RX6700XT-12G
Storage	Corsair MP510 480GB M.2 - 2 x WD_BLACK 1TB SN850X M.2 1TB - Lexar NQ790 M.2 2TB
Display(s)	ASUS ROG Strix 34” XG349C 144Hz 1440p + Asus ROG 27" MG278Q 144Hz WQHD 1440p
Case	Corsair Obsidian Series 450D Gaming Case
Audio Device(s)	SteelSeries 5Hv2 w/ Sound Blaster Z SE
Power Supply	Corsair RM750x Power Supply
Mouse	Razer Death-Adder + Viper 8K HZ Ambidextrous Gaming Mouse - Ergonomic Left Hand Edition
Keyboard	Logitech G910 Orion Spectrum RGB Gaming Keyboard
Software	Windows 10 Pro - 64-Bit Edition (Back to Win 10 because 11 is garbage)
Benchmark Scores	I'm the Doctor, Doctor Who. The Definition of Gaming is PC Gaming...

Processor	Intel i5-12600k
Motherboard	Asus H670 TUF
Cooling	Arctic Freezer 34
Memory	2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s)	EVGA GTX 1060 SC
Storage	500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s)	Dell U3219Q + HP ZR24w
Case	Raijintek Thetis
Audio Device(s)	Audioquest Dragonfly Red :D
Power Supply	Seasonic 620W M12
Mouse	Logitech G502 Proteus Core
Keyboard	G.Skill KM780R
Software	Arch Linux + Win10

System Name	Sillicon Nightmares
Processor	Intel i7 9700KF 5ghz (5.1ghz 4 core load, no avx offset), 4.7ghz ring, 1.412vcore 1.3vcio 1.264vcsa
Motherboard	Asus Z390 Strix F
Cooling	DEEPCOOL Gamer Storm CAPTAIN 360
Memory	2x8GB G.Skill Trident Z RGB (B-Die) 3600 14-14-14-28 1t, tRFC 220 tREFI 65535, tFAW 16, 1.545vddq
Video Card(s)	ASUS GTX 1060 Strix 6GB XOC, Core: 2202-2240, Vcore: 1.075v, Mem: 9818mhz (Sillicon Lottery Jackpot)
Storage	Samsung 840 EVO 1TB SSD, WD Blue 1TB, Seagate 3TB, Samsung 970 Evo Plus 512GB
Display(s)	BenQ XL2430 1080p 144HZ + (2) Samsung SyncMaster 913v 1280x1024 75HZ + A Shitty TV For Movies
Case	Deepcool Genome ROG Edition
Audio Device(s)	Bunta Sniff Speakers From The Tip Edition With Extra Kenwoods
Power Supply	Corsair AX860i/Cable Mod Cables
Mouse	Logitech G602 Spilled Beer Edition
Keyboard	Dell KB4021
Software	Windows 10 x64
Benchmark Scores	13543 Firestrike (3dmark.com/fs/22336777) 601 points CPU-Z ST 37.4ns AIDA Memory

System Name	Unimatrix
Processor	Intel i9-9900K @ 5.0GHz
Motherboard	ASRock x390 Taichi Ultimate
Cooling	Custom Loop
Memory	32GB GSkill TridentZ RGB DDR4 @ 3400MHz 14-14-14-32
Video Card(s)	EVGA 2080 with Heatkiller Water Block
Storage	2x Samsung 960 Pro 512GB M.2 SSD in RAID 0, 1x WD Blue 1TB M.2 SSD
Display(s)	Alienware 34" Ultrawide 3440x1440
Case	CoolerMaster P500M Mesh
Power Supply	Seasonic Prime Titanium 850W
Keyboard	Corsair K75
Benchmark Scores	Really Really High