• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Ryzen 7 4700GE Memory Benchmarked: Extremely Low Latency Explains Tiny L3 Caches

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
41,062 (8.27/day)
Location
Hyderabad, India
Processor AMD Ryzen 7 2700X
Motherboard ASUS ROG Strix B450-E Gaming
Cooling AMD Wraith Prism
Memory 2x 16GB Corsair Vengeance LPX DDR4-3000
Video Card(s) Palit GeForce RTX 2080 SUPER GameRock
Storage Western Digital Black NVMe 512GB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) Creative Sound Blaster Recon3D PCIe
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Microsoft Sidewinder X4
Software Windows 10 Pro
AMD's 7 nm "Renoir" APU silicon, which features eight "Zen 2" CPU cores, has only a quarter of the L3 cache of the 8-core "Zen 2" CCD used in "Matisse," "Rome," and "Castle Peak" processors, with each of its two quad-core compute complexes (CCXs) featuring just 4 MB of it (compared to 16 MB per CCX on the 8-core "Zen 2" CCD). Chinese-language tech publication TecLab pubished a quick review of an alleged Ryzen 7 4700GE socket AM4 processor based on the "Renoir" silicon, and discovered that the chip offers significantly lower memory latencies than "Matisse," posting just 47.6 ns latency when paired with DDR4-4233 dual-channel memory.

In comparison, a Ryzen 9 3900X with these kinds of memory clocks typically posts 60-70 ns latencies, owing to the MCM design of "Matisse," where the CPU cores and memory controllers sit on separate dies, which is one of the key reasons AMD is believed to have doubled the L3 cache amount per CCX compared to previous-generation "Zeppelin" dies. TecLab tested the alleged 4700GE engineering sample on a ROG Crosshair VIII Impact X570 motherboard that has 1 DIMM per channel (the best possible memory topology).



View at TechPowerUp Main Site
 
Joined
Aug 14, 2009
Messages
194 (0.05/day)
Location
Denmark
System Name Bongfjaes
Processor AMD 3700x
Motherboard Assus Crosshair VII Hero
Cooling Dark Rock Pro 4
Memory 2x8GB G.Skill FlareX 3200MT/s CL14
Video Card(s) GTX 970
Storage Adata SX8200 Pro 1TB + Lots of spinning rust
Display(s) Viewsonic VX2268wm
Case Fractal Design R6
Audio Device(s) Creative SoundBlaster AE-5
Power Supply Seasonic TTR-1000
Mouse Pro Intellimouse
Keyboard SteelKeys 6G
I dont know man, 4333CL 14-13-13-28 doesnt really show us much ,except that IF fabric speed can go higher.

Current ryzen 3000 series desktop cpus would probably go super close to that if it wouldnt desync the fclk with the others

Would be more interesting to see what it does on 3200cl14 for example, or 3600 cl 14 at least
The amount of people that has kits that goes to 4333 cl14-13-13-28 is pretty low
 
Joined
Feb 19, 2009
Messages
1,102 (0.25/day)
Location
I live in Norway
System Name 3 sys spec seperated by "|"
Processor R9 3900x| R7 1700 @3.75 | 4800H
Motherboard Asrock X570M | AB350M Pro 4 | Asus Tuf A15
Cooling Air | Air | duh laptop
Memory 64gb G.skill SniperX @3600 CL16 | 64GB | 32GB
Video Card(s) XFX RX 6800 Speedster |V64\Quadro P4000 | RTX2060M
Storage MP510 2TB, 660P 2TB, 2x860 evo 1tb | 960 500gb Intel 660P 1tb PM871 4x256gb ++| 1TB 660+ 1tb A1000
Display(s) AOC 28" 4K something + 1440p AOC 144hz something.
Case Phanteks EvolvX M-Atx
Power Supply Corsair RM850
Mouse g502 Lightspeed
Keyboard G915
Software win10,unraid,Manjaro
Benchmark Scores 30000FS, 16300 TS. Lappy, 7000 TS.
I dont know man, 4333CL 14-13-13-28 doesnt really show us much ,except that IF fabric speed can go higher.

Current ryzen 3000 series desktop cpus would probably go super close to that if it wouldnt desync the fclk with the others

Would be more interesting to see what it does on 3200cl14 for example, or 3600 cl 14 at least
The amount of people that has kits that goes to 4333 cl14-13-13-28 is pretty low

about 5ns lower latency at jedec cl22 3200 vs matisse in my testing.
 
Joined
Jan 8, 2017
Messages
6,647 (4.19/day)
System Name Good enough
Processor AMD Ryzen R7 1700X - 4.0 Ghz / 1.350V
Motherboard ASRock B450M Pro4
Cooling Deepcool Gammaxx L240 V2
Memory 16GB - Corsair Vengeance LPX - 3333 Mhz CL16
Video Card(s) OEM Dell GTX 1080 with Kraken G12 + Water 3.0 Performer C
Storage 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s) 4K Samsung TV
Case Deepcool Matrexx 70
Power Supply GPS-750C
More like the tiny L3 cache explains the low latency. Generally, the smaller the cache the less time it takes to read/write to a particular cache line and therefore the overall average memory access time goes down.
 
Joined
Nov 11, 2004
Messages
7,992 (1.33/day)
Location
Formosa
System Name Overlord Mk MXVI
Processor AMD Ryzen 7 3800X
Motherboard Gigabyte X570 Aorus Master
Cooling Corsair H115i Pro
Memory 32GB Viper Steel 3600 DDR4 @ 3800MHz 16-19-16-19-36
Video Card(s) Gigabyte RTX 2080 Gaming OC 8G
Storage 1TB WD Black NVMe (2018), 2TB Viper VPN100, 1TB WD Blue 3D NAND
Display(s) Asus PG27AQ
Case Corsair Carbide 275Q
Audio Device(s) Corsair Virtuoso SE
Power Supply Corsair RM750
Mouse Logitech G502 Lightspeed
Keyboard Wooting Two
Software Windows 10 Pro
Benchmark Scores https://valid.x86.fr/33u9si
Actually, going above 3800MHz on a Ryzen 3000 CPU would end up somewhere around 80ns+
 
Joined
Oct 28, 2012
Messages
723 (0.23/day)
Processor AMD Ryzen 3700x
Motherboard asus ROG Strix B-350I Gaming
Cooling Noctua NH-U12-S Chromax.Black
Memory crucial ballistix 32Gb DDR4
Video Card(s) RTX 3070 FE
Storage WD sn550 1To/WD ssd sata 1To /Samsung 960 evo 256 Gb/Seagate 2To/WD book 4 To back-up
Display(s) LG 25UM58
Case Cooler Master NR200p
Audio Device(s) sennheiser HD58X
Power Supply bequiet SFX L power 600w
Mouse MX master 3
Keyboard Master Key Mx
Software win 10 pro
More like the tiny L3 cache explains the low latency. Generally, the smaller the cache the less time it takes to read/write to a particular cache line and therefore the overall average memory access time goes down.
The lower latency can't be all about that. That would mean that AMD actually made a huge mistake with regular zen2 and effectivelly reduced the gaming performance with the "game cache"
 
Joined
Jan 8, 2017
Messages
6,647 (4.19/day)
System Name Good enough
Processor AMD Ryzen R7 1700X - 4.0 Ghz / 1.350V
Motherboard ASRock B450M Pro4
Cooling Deepcool Gammaxx L240 V2
Memory 16GB - Corsair Vengeance LPX - 3333 Mhz CL16
Video Card(s) OEM Dell GTX 1080 with Kraken G12 + Water 3.0 Performer C
Storage 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s) 4K Samsung TV
Case Deepcool Matrexx 70
Power Supply GPS-750C
The lower latency can't be all about that. That would mean that AMD actually made a huge mistake with regular zen2 and effectivelly reduced the gaming performance with the "game cache"

Nah, cache size will always be more beneficial than slightly lower memory access time.
 
Joined
Jun 16, 2015
Messages
11 (0.01/day)
Processor Ryzen 9 5900x @4500mhz 1.16v
Motherboard Gigabyte X570 I Aeorus Pro Wifi
Cooling Noctua NH-U12A
Memory G.SKILL 32GB KIT DDR4 3600 MHz CL16 Trident Z @3666MHz tuned by Ryzen calculator
Video Card(s) ASUS GeForce TUF RTX 3070 8G GAMING @1905/8000MHz 0.9v
Storage ADATA XPG SX8200 Pro SSD 2TB
Display(s) 34" Dell AW3418DW Alienware
Case LIAN-LI TU150
Audio Device(s) Logitech G PRO
Power Supply Corsair SF750
Mouse Logitech G PRO X Superlight
Keyboard Logitech G413
Actually, going above 3800MHz on a Ryzen 3000 CPU would end up somewhere around 80ns+
Maybe there is 2100Mhz fclock ,1:1 mclock:uclock and this latency number is possible.
 
Joined
Oct 22, 2014
Messages
11,914 (4.97/day)
Location
Sunshine Coast
System Name Black Box
Processor Intel i5-9600KF
Motherboard NZXT N7 Z370 Black
Cooling Cooler Master 240 RGB AIO / Stock
Memory Thermaltake Toughram 16GB 4400MHz DDR4 or Gigabyte 16GB 3600MHz DDR4 or Adata 8GB 2133Mhz DDR4
Video Card(s) Asus Dual 1060 6GB
Storage Kingston A2000 512Gb NVME
Display(s) AOC 24" Freesync 1m.s. 75Hz
Case Corsair 450D High Air Flow.
Audio Device(s) No need.
Power Supply FSP Aurum 650W
Mouse Yes
Keyboard Of course
Software W10 Pro 64 bit
They're comparing an APU to a normal CPU, and it's the low power version too (GE).
 
Last edited:
Joined
Aug 22, 2016
Messages
122 (0.07/day)
The cache latencies arent dramatcly lower, but expected for the cache size. The memory latencies I think is just for the memcontroller beeing so close to the CPU and 7nm as well.
Here are my 3600 4.2Ghz results, with the best mem stable mem settings that matisse can do.
latencies.png
 
Joined
Apr 30, 2011
Messages
1,853 (0.51/day)
Location
Greece
Processor AMD Ryzen 5 2600X@95W
Motherboard MSI B450 Tomahawk MAX
Cooling Deepcool Gammaxx 400 Black
Memory 2*8GB PATRIOT PVS416G373C7K@3333MT_C16
Video Card(s) Sapphire Radeon RX 5700 Pulse 8GB
Storage Sandisk SSD 120GB, INTEL 540S SSDSCKKW180H6 180GB, Samsung F1 1TB, Hitachi HUS724040ALE640 4TB
Display(s) AOC 27G2U/BK IPS 144Hz
Case SHARKOON M25-W 7.1 BLACK
Audio Device(s) Realtek 7.1 onboard
Power Supply Zalman Z550
Mouse Sharkoon SHARK Force Black
Keyboard Trust GXT280
Software Win 7 sp1 64bit/Win 10 pro 64bit
Benchmark Scores CB R15 64bit: single core 173p, multicore 1306p
Now think of Zen3 having L3cache of Zen2 size with latencies matching or better than those of Renoir and clock speeds close to 5GHz.
 
Joined
Feb 3, 2017
Messages
2,970 (1.90/day)
Processor R5 5600X
Motherboard ASUS ROG STRIX B550-I GAMING
Cooling Alpenföhn Black Ridge
Memory 2*16GB DDR4-2666 VLP @3800
Video Card(s) Geforce RTX 3070 FE
Storage 1TB Samsung 970 Pro, 2TB Intel 660p
Display(s) ASUS PG279Q, Eizo EV2736W
Case Dan Cases A4-SFX
Power Supply Corsair SF600
Mouse Corsair Ironclaw Wireless RGB
Keyboard Corsair K60
Looking at the latency charts in TPU Forums (https://www.techpowerup.com/forums/...-go-memory-latency-competition-aida64.263929/) very noticeable improvment but does not seem to quite catch Intel's memory latency yet.

The closest and most comparable results to the 47.6 on the screenshot seem to be:
4200CL18 on 9600KF at 44.5
4266CL15 on 9900K at 33.6
(Keep in mind that compared to 4233CL14, 4266CL15 should be about 6% slower and 4200CL18 almost 30% slower in raw latency)
 
Last edited:
Joined
Mar 31, 2014
Messages
1,268 (0.49/day)
Location
Grunn
System Name Indis the Fair
Processor R5 3600 (PBO 150/130/130, 73c temp limit, FCLK/UCLK 1866)
Motherboard Asus Prime X470 Pro
Cooling Heatkiller VI Pro, VPP755 V.3, XT45 240mm, 2xA12x25, Arctic P14 case fans
Memory G.Skill Ripjaws V 2x16GB 4000 16-19-19 (b-die@3733 14-15/9-13-26-36 1.45v)
Video Card(s) EVGA 2080 Super Hybrid (A12x25 fan)
Storage 860EVO 500GB, 660p 1TB, WD Blue 3D 1TB, Sandisk Ultra 3D 2TB
Display(s) BenQ XL2430T, Dell P2417H
Case Phanteks Enthoo Pro M
Audio Device(s) DT770 Pro, Topping A50, Focusrite Scarlett 2i2, Røde VXLR+, Modmic 5
Power Supply Seasonic 860w Platinum
Mouse Razer Viper Mini, Razer Gigantus
Keyboard GMMK Fullsize v2 (Gateron Browns)
Software Win10 x64/Win7 x64/Ubuntu
+500MHz FCLK on top of those Anandtech results makes a difference...
The amount of people that has kits that goes to 4333 cl14-13-13-28 is pretty low
Most b-die kits will do around 4000-4400 with CAS14, but that would be at benching voltages (1.7-1.8v, iirc 1.8v is the max DRAM voltage Asus non crosshair/maximus etc boards) with maxmem. Just about any decent bin of b-die does 3666-3800 at CAS14, 14 ticks at 3800 is equivalent to 16 ticks at 4333 in terms of latency.

The frequency depends a bit more on the motherboard but many newer 8Gbit ICs don't struggle to run into the mid 4000s on recent motherboards. Stuff like Rev E, DJR, and D-die for example... I expect with normal voltages for these to land around 10ns quicker than what is currently being done on Matisse.
Nah, cache size will always be more beneficial than slightly lower memory access time.
Depends on the access patterns of the program. Ryzen's L3 also gets used differently than Intel's skylake/xcove L3 because of Ryzen using exclusive victim caching while intel has been using inclusive (to L2).
 
Last edited:
Joined
Feb 25, 2012
Messages
48 (0.01/day)
More like the tiny L3 cache explains the low latency. Generally, the smaller the cache the less time it takes to read/write to a particular cache line and therefore the overall average memory access time goes down.
Completely wrong.
Matisse and Renoir have the same L3$ associativity, that means L3$ tag check has the same latency.
 
Joined
Jan 8, 2017
Messages
6,647 (4.19/day)
System Name Good enough
Processor AMD Ryzen R7 1700X - 4.0 Ghz / 1.350V
Motherboard ASRock B450M Pro4
Cooling Deepcool Gammaxx L240 V2
Memory 16GB - Corsair Vengeance LPX - 3333 Mhz CL16
Video Card(s) OEM Dell GTX 1080 with Kraken G12 + Water 3.0 Performer C
Storage 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s) 4K Samsung TV
Case Deepcool Matrexx 70
Power Supply GPS-750C
Completely wrong.
Matisse and Renoir have the same L3$ associativity, that means L3$ tag check has the same latency.
I said generally, the larger the cache and the more lines there are the more tags need to be checked.
 
Joined
Feb 25, 2012
Messages
48 (0.01/day)
I said generally, the larger the cache and the more lines there are the more tags need to be checked.
Number of tags are need to be checked depends on its associativity only. Renoir and Matisse have 16-way L3$.
Also both chips have the same 10ns L3$ access latency, it means dram access penalty is the same too.
 
Joined
Feb 19, 2009
Messages
1,102 (0.25/day)
Location
I live in Norway
System Name 3 sys spec seperated by "|"
Processor R9 3900x| R7 1700 @3.75 | 4800H
Motherboard Asrock X570M | AB350M Pro 4 | Asus Tuf A15
Cooling Air | Air | duh laptop
Memory 64gb G.skill SniperX @3600 CL16 | 64GB | 32GB
Video Card(s) XFX RX 6800 Speedster |V64\Quadro P4000 | RTX2060M
Storage MP510 2TB, 660P 2TB, 2x860 evo 1tb | 960 500gb Intel 660P 1tb PM871 4x256gb ++| 1TB 660+ 1tb A1000
Display(s) AOC 28" 4K something + 1440p AOC 144hz something.
Case Phanteks EvolvX M-Atx
Power Supply Corsair RM850
Mouse g502 Lightspeed
Keyboard G915
Software win10,unraid,Manjaro
Benchmark Scores 30000FS, 16300 TS. Lappy, 7000 TS.
The cache latencies arent dramatcly lower, but expected for the cache size. The memory latencies I think is just for the memcontroller beeing so close to the CPU and 7nm as well.
Here are my 3600 4.2Ghz results, with the best mem stable mem settings that matisse can do.
View attachment 160720

The physical difference have no major impact to memory latencies.
It's interconnect and purely interconnect which matters (Yes there is a physical difference in delay but who's counting 0.2ns or so)
however, the cpu and memory controller on the same die may allow the frequency of said interconnect at higher frequency as it's not going across a substrate to another chip and thus why it clocks higher.

Just a tiny correction, and information as many thing physical distance matters for latency and no it does not it does have massive implications to power consumption which is the drawback of chiplets :).
 
Joined
Jan 8, 2017
Messages
6,647 (4.19/day)
System Name Good enough
Processor AMD Ryzen R7 1700X - 4.0 Ghz / 1.350V
Motherboard ASRock B450M Pro4
Cooling Deepcool Gammaxx L240 V2
Memory 16GB - Corsair Vengeance LPX - 3333 Mhz CL16
Video Card(s) OEM Dell GTX 1080 with Kraken G12 + Water 3.0 Performer C
Storage 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s) 4K Samsung TV
Case Deepcool Matrexx 70
Power Supply GPS-750C
Yes there is a physical difference in delay but who's counting 0.2ns or so

AMD is definitely counting those or anyone else that's making a chip. When you're accessing a cache millions of times a second you're going to start and feel those 0.2 of a nanosecond.
 
Joined
Mar 21, 2016
Messages
945 (0.50/day)
Nah, cache size will always be more beneficial than slightly lower memory access time.
Exactly being out of memory is far worse between the two. I'm wager we'll step into the 32GB minimum requirement for system memory on games before the next console generation is over and possibly cross into 64GB requirements in certain scenario's high resolutions and high AA/AF that's bound to happen. Hopefully we'll have some 64GB GPU cards by then at least the workstation level I'd anticipate it and the low end card will probably have 16GB by that point in time.
 
Top