AMD Ryzen 7 4700GE Memory Benchmarked: Extremely Low Latency Explains Tiny L3 Caches

btarunr · Jun 30, 2020

AMD's 7 nm "Renoir" APU silicon, which features eight "Zen 2" CPU cores, has only a quarter of the L3 cache of the 8-core "Zen 2" CCD used in "Matisse," "Rome," and "Castle Peak" processors, with each of its two quad-core compute complexes (CCXs) featuring just 4 MB of it (compared to 16 MB per CCX on the 8-core "Zen 2" CCD). Chinese-language tech publication TecLab pubished a quick review of an alleged Ryzen 7 4700GE socket AM4 processor based on the "Renoir" silicon, and discovered that the chip offers significantly lower memory latencies than "Matisse," posting just 47.6 ns latency when paired with DDR4-4233 dual-channel memory.

In comparison, a Ryzen 9 3900X with these kinds of memory clocks typically posts 60-70 ns latencies, owing to the MCM design of "Matisse," where the CPU cores and memory controllers sit on separate dies, which is one of the key reasons AMD is believed to have doubled the L3 cache amount per CCX compared to previous-generation "Zeppelin" dies. TecLab tested the alleged 4700GE engineering sample on a ROG Crosshair VIII Impact X570 motherboard that has 1 DIMM per channel (the best possible memory topology).

View at TechPowerUp Main Site

Fouquin · Jun 30, 2020

I feel like I'm living in some kind of split timeline where we didn't already know Renoirs specs and seen reviews of this silicon in action already.

Axaion · Jun 30, 2020

I dont know man, 4333CL 14-13-13-28 doesnt really show us much ,except that IF fabric speed can go higher.

Current ryzen 3000 series desktop cpus would probably go super close to that if it wouldnt desync the fclk with the others

Would be more interesting to see what it does on 3200cl14 for example, or 3600 cl 14 at least
The amount of people that has kits that goes to 4333 cl14-13-13-28 is pretty low

Imsochobo · Jun 30, 2020

Axaion said:
I dont know man, 4333CL 14-13-13-28 doesnt really show us much ,except that IF fabric speed can go higher.

Current ryzen 3000 series desktop cpus would probably go super close to that if it wouldnt desync the fclk with the others

Would be more interesting to see what it does on 3200cl14 for example, or 3600 cl 14 at least
The amount of people that has kits that goes to 4333 cl14-13-13-28 is pretty low

about 5ns lower latency at jedec cl22 3200 vs matisse in my testing.

Vya Domus · Jun 30, 2020

More like the tiny L3 cache explains the low latency. Generally, the smaller the cache the less time it takes to read/write to a particular cache line and therefore the overall average memory access time goes down.

TheLostSwede · Jun 30, 2020

Actually, going above 3800MHz on a Ryzen 3000 CPU would end up somewhere around 80ns+

dyonoctis · Jun 30, 2020

Vya Domus said:
More like the tiny L3 cache explains the low latency. Generally, the smaller the cache the less time it takes to read/write to a particular cache line and therefore the overall average memory access time goes down.

The lower latency can't be all about that. That would mean that AMD actually made a huge mistake with regular zen2 and effectivelly reduced the gaming performance with the "game cache"

Vya Domus · Jun 30, 2020

dyonoctis said:
The lower latency can't be all about that. That would mean that AMD actually made a huge mistake with regular zen2 and effectivelly reduced the gaming performance with the "game cache"

Nah, cache size will always be more beneficial than slightly lower memory access time.

HABO · Jun 30, 2020

TheLostSwede said:
Actually, going above 3800MHz on a Ryzen 3000 CPU would end up somewhere around 80ns+

Maybe there is 2100Mhz fclock ,1:1 mclock:uclock and this latency number is possible.

Caring1 · Jun 30, 2020

They're comparing an APU to a normal CPU, and it's the low power version too (GE).

Bruno Vieira · Jun 30, 2020

The cache latencies arent dramatcly lower, but expected for the cache size. The memory latencies I think is just for the memcontroller beeing so close to the CPU and 7nm as well.
Here are my 3600 4.2Ghz results, with the best mem stable mem settings that matisse can do.

HD64G · Jun 30, 2020

Now think of Zen3 having L3cache of Zen2 size with latencies matching or better than those of Renoir and clock speeds close to 5GHz.

londiste · Jun 30, 2020

Looking at the latency charts in TPU Forums (https://www.techpowerup.com/forums/...-go-memory-latency-competition-aida64.263929/) very noticeable improvment but does not seem to quite catch Intel's memory latency yet.

The closest and most comparable results to the 47.6 on the screenshot seem to be:
4200CL18 on 9600KF at 44.5
4266CL15 on 9900K at 33.6
(Keep in mind that compared to 4233CL14, 4266CL15 should be about 6% slower and 4200CL18 almost 30% slower in raw latency)

GorbazTheDragon · Jun 30, 2020

Fouquin said:
I feel like I'm living in some kind of split timeline where we didn't already know Renoirs specs and seen reviews of this silicon in action already.

+500MHz FCLK on top of those Anandtech results makes a difference...

Axaion said:
The amount of people that has kits that goes to 4333 cl14-13-13-28 is pretty low

Most b-die kits will do around 4000-4400 with CAS14, but that would be at benching voltages (1.7-1.8v, iirc 1.8v is the max DRAM voltage Asus non crosshair/maximus etc boards) with maxmem. Just about any decent bin of b-die does 3666-3800 at CAS14, 14 ticks at 3800 is equivalent to 16 ticks at 4333 in terms of latency.

The frequency depends a bit more on the motherboard but many newer 8Gbit ICs don't struggle to run into the mid 4000s on recent motherboards. Stuff like Rev E, DJR, and D-die for example... I expect with normal voltages for these to land around 10ns quicker than what is currently being done on Matisse.

Vya Domus said:
Nah, cache size will always be more beneficial than slightly lower memory access time.

Depends on the access patterns of the program. Ryzen's L3 also gets used differently than Intel's skylake/xcove L3 because of Ryzen using exclusive victim caching while intel has been using inclusive (to L2).

AlB80 · Jul 1, 2020

Vya Domus said:
More like the tiny L3 cache explains the low latency. Generally, the smaller the cache the less time it takes to read/write to a particular cache line and therefore the overall average memory access time goes down.

Completely wrong.
Matisse and Renoir have the same L3$ associativity, that means L3$ tag check has the same latency.

Vya Domus · Jul 1, 2020

AlB80 said:
Completely wrong.
Matisse and Renoir have the same L3$ associativity, that means L3$ tag check has the same latency.

I said generally, the larger the cache and the more lines there are the more tags need to be checked.

AlB80 · Jul 2, 2020

Vya Domus said:
I said generally, the larger the cache and the more lines there are the more tags need to be checked.

Number of tags are need to be checked depends on its associativity only. Renoir and Matisse have 16-way L3$.
Also both chips have the same 10ns L3$ access latency, it means dram access penalty is the same too.

Imsochobo · Jul 3, 2020

Bruno Vieira said:
The cache latencies arent dramatcly lower, but expected for the cache size. The memory latencies I think is just for the memcontroller beeing so close to the CPU and 7nm as well.
Here are my 3600 4.2Ghz results, with the best mem stable mem settings that matisse can do.
View attachment 160720

The physical difference have no major impact to memory latencies.
It's interconnect and purely interconnect which matters (Yes there is a physical difference in delay but who's counting 0.2ns or so)
however, the cpu and memory controller on the same die may allow the frequency of said interconnect at higher frequency as it's not going across a substrate to another chip and thus why it clocks higher.

Just a tiny correction, and information as many thing physical distance matters for latency and no it does not it does have massive implications to power consumption which is the drawback of chiplets

.

Vya Domus · Jul 3, 2020

Imsochobo said:
Yes there is a physical difference in delay but who's counting 0.2ns or so

AMD is definitely counting those or anyone else that's making a chip. When you're accessing a cache millions of times a second you're going to start and feel those 0.2 of a nanosecond.

InVasMani · Sep 30, 2020

Vya Domus said:
Nah, cache size will always be more beneficial than slightly lower memory access time.

Exactly being out of memory is far worse between the two. I'm wager we'll step into the 32GB minimum requirement for system memory on games before the next console generation is over and possibly cross into 64GB requirements in certain scenario's high resolutions and high AA/AF that's bound to happen. Hopefully we'll have some 64GB GPU cards by then at least the workstation level I'd anticipate it and the low end card will probably have 16GB by that point in time.

System Name	RBMK-1000
Processor	AMD Ryzen 7 5700G
Motherboard	Gigabyte B550 AORUS Elite V2
Cooling	DeepCool Gammax L240 V2
Memory	2x 16GB DDR4-3200
Video Card(s)	Galax RTX 4070 Ti EX
Storage	Samsung 990 1TB
Display(s)	BenQ 1440p 60 Hz 27-inch
Case	Corsair Carbide 100R
Audio Device(s)	ASUS SupremeFX S1220A
Power Supply	Cooler Master MWE Gold 650W
Mouse	ASUS ROG Strix Impact
Keyboard	Gamdias Hermes E2
Software	Windows 11 Pro

System Name	Bongfjaes
Processor	AMD 3700x
Motherboard	Assus Crosshair VII Hero
Cooling	Dark Rock Pro 4
Memory	2x8GB G.Skill FlareX 3200MT/s CL14
Video Card(s)	GTX 970
Storage	Adata SX8200 Pro 1TB + Lots of spinning rust
Display(s)	Viewsonic VX2268wm
Case	Fractal Design R6
Audio Device(s)	Creative SoundBlaster AE-5
Power Supply	Seasonic TTR-1000
Mouse	Pro Intellimouse
Keyboard	SteelKeys 6G

Processor	9800x3D\| 5800x \| 4800H \| Rog ally
Motherboard	Gb x870 Aorus Elite ice \| Asrack x470d4u \| Asus Tuf A15
Cooling	Air \| Air \| duh laptop
Memory	64gb G.skill SniperX @3600 CL16 \| 128gb \| 32GB \| 192gb
Video Card(s)	RTX 4080 \|Quadro P5000 \| RTX2060M
Storage	Many drives
Display(s)	AW3423dwf.
Case	Jonsbo D41
Power Supply	Corsair RM850x
Mouse	g502x Lightspeed
Keyboard	G913 tkl
Software	win11, proxmox

System Name	Good enough
Processor	AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard	ASRock B650 Pro RS
Cooling	2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory	32GB - FURY Beast RGB 5600 Mhz
Video Card(s)	Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage	1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s)	LG UltraGear 32GN650-B + 4K Samsung TV
Case	Phanteks NV7
Power Supply	GPS-750C

System Name	Overlord Mk MLI
Processor	AMD Ryzen 7 7800X3D
Motherboard	Gigabyte X670E Aorus Master
Cooling	Noctua NH-D15 SE with offsets
Memory	32GB Team T-Create Expert DDR5 6000 MHz @ CL30-34-34-68
Video Card(s)	Gainward GeForce RTX 4080 Phantom GS
Storage	1TB Solidigm P44 Pro, 2 TB Corsair MP600 Pro, 2TB Kingston KC3000
Display(s)	Acer XV272K LVbmiipruzx 4K@160Hz
Case	Fractal Design Torrent Compact
Audio Device(s)	Corsair Virtuoso SE
Power Supply	be quiet! Pure Power 12 M 850 W
Mouse	Logitech G502 Lightspeed
Keyboard	Corsair K70 Max
Software	Windows 10 Pro
Benchmark Scores	https://valid.x86.fr/yfsd9w

AMD Ryzen 7 4700GE Memory Benchmarked: Extremely Low Latency Explains Tiny L3 Caches

btarunr

Editor & Senior Moderator

Fouquin

Staff

Axaion

Imsochobo

Vya Domus

TheLostSwede

News Editor

dyonoctis

Vya Domus

HABO

Caring1

Bruno Vieira

HD64G

londiste

GorbazTheDragon

AlB80

Vya Domus

AlB80

Imsochobo

Vya Domus

InVasMani

Processor	AMD Ryzen 3700x
Motherboard	asus ROG Strix B-350I Gaming
Cooling	Deepcool LS520 SE
Memory	crucial ballistix 32Gb DDR4
Video Card(s)	RTX 3070 FE
Storage	WD sn550 1To/WD ssd sata 1To /WD black sn750 1To/Seagate 2To/WD book 4 To back-up
Display(s)	LG GL850
Case	Dan A4 H2O
Audio Device(s)	sennheiser HD58X
Power Supply	Corsair SF600
Mouse	MX master 3
Keyboard	Master Key Mx
Software	win 11 pro

Processor	Ryzen 9 5800X3d
Motherboard	Gigabyte X570 I Aeorus Pro Wifi
Cooling	Noctua NH-U12A
Memory	G.SKILL 32GB KIT DDR4 3600 MHz CL16 Trident Z @3666MHz tuned by Ryzen calculator
Video Card(s)	EVGA 3080Ti XC3 ULTRA@1800MHz 0.8v
Storage	Samsung 980 PRO 2 TB, ADATA XPG SX8200 Pro 2TB
Display(s)	42" LG C2 OLED
Case	Cooler Master MasterBox NR200P
Audio Device(s)	Grado
Power Supply	Corsair SF750
Mouse	Logitech G PRO X Superlight
Keyboard	custom

System Name	H7 Flow 2024
Processor	AMD 5800X3D
Motherboard	Asus X570 Tough Gaming
Cooling	Custom liquid
Memory	32 GB DDR4
Video Card(s)	Intel ARC A750
Storage	Crucial P5 Plus 2TB.
Display(s)	AOC 24" Freesync 1m.s. 75Hz
Mouse	Lenovo
Keyboard	Eweadn Mechanical
Software	W11 Pro 64 bit

Processor	AMD Ryzen 5 5600@80W
Motherboard	MSI B550 Tomahawk
Cooling	ZALMAN CNPS9X OPTIMA
Memory	2*8GB PATRIOT PVS416G400C9K@3733MT_C16
Video Card(s)	Sapphire Radeon RX 6750 XT Pulse 12GB
Storage	Sandisk SSD 128GB, Kingston A2000 NVMe 1TB, Samsung F1 1TB, WD Black 10TB
Display(s)	AOC 27G2U/BK IPS 144Hz
Case	SHARKOON M25-W 7.1 BLACK
Audio Device(s)	Realtek 7.1 onboard
Power Supply	Seasonic Core GC 500W
Mouse	Sharkoon SHARK Force Black
Keyboard	Trust GXT280
Software	Win 7 Ultimate 64bit/Win 10 pro 64bit/Manjaro Linux

Processor	Ryzen 7800X3D
Motherboard	ROG STRIX B650E-F GAMING WIFI
Memory	2x16GB G.Skill Flare X5 DDR5-6000 CL36 (F5-6000J3636F16GX2-FX5)
Video Card(s)	INNO3D GeForce RTX™ 4070 Ti SUPER TWIN X2
Storage	2TB Samsung 980 PRO, 4TB WD Black SN850X
Display(s)	42" LG C2 OLED, 27" ASUS PG279Q
Case	Thermaltake Core P5
Power Supply	Fractal Design Ion+ Platinum 760W
Mouse	Corsair Dark Core RGB Pro SE
Keyboard	Corsair K100 RGB
VR HMD	HTC Vive Cosmos

System Name	Indis the Fair (cursed edition)
Processor	11900k 5.1/4.9 undervolted.
Motherboard	MSI Z590 Unify-X
Cooling	Heatkiller VI Pro, VPP755 V.3, XSPC TX360 slim radiator, 3xA12x25, 4x Arctic P14 case fans
Memory	G.Skill Ripjaws V 2x16GB 4000 16-19-19 (b-die@3600 14-14-14 1.45v)
Video Card(s)	EVGA 2080 Super Hybrid (T30-120 fan)
Storage	970EVO 1TB, 660p 1TB, WD Blue 3D 1TB, Sandisk Ultra 3D 2TB
Display(s)	BenQ XL2546K, Dell P2417H
Case	FD Define 7
Audio Device(s)	DT770 Pro, Topping A50, Focusrite Scarlett 2i2, Røde VXLR+, Modmic 5
Power Supply	Seasonic 860w Platinum
Mouse	Razer Viper Mini, Odin Infinity mousepad
Keyboard	GMMK Fullsize v2 (Boba U4Ts)
Software	Win10 x64/Win7 x64/Ubuntu