What are latencies like with GDDR5, GDDR5x, GDDR6 and GDDR6x?

80251 · Mar 13, 2023

I'm not sure if this is the proper forum to ask this in but here goes. What are the latencies like with GDDR5, GDDR5x, GDDR6 and GDDR6x? I'm not here referring to what each IC is rated at but what kind of latencies and end user would see if running, for example, AIDA64's memory benchmark.

Deleted member 185158 · Mar 13, 2023

That depends on a lot of factors. If you can run tighter timings. Increase frequency. Even just the GPU overclock would reduce the latencies. So this number will vary greatly depending on the card its installed on.

Latency between an X and non X version could be an improvement all of only 5ns.

How much does this matter to gaming frame rates?
Generally single digit percentage gains.

Mussels · Mar 13, 2023

80251 said:
I'm not sure if this is the proper forum to ask this in but here goes. What are the latencies like with GDDR5, GDDR5x, GDDR6 and GDDR6x? I'm not here referring to what each IC is rated at but what kind of latencies and end user would see if running, for example, AIDA64's memory benchmark.

More or less, as every generation steps up they increase bandwidth but latency gets worse.
Higher clock speeds attempt to keep it similar, but it's simply not.

Eth mining for example was faster on a 1070Ti than a 1080, because the 1070Ti used GDDR5 vs the GDDR5x on the GTX 1080

I cant find the stock values as they aren't listed anywhere (They'd vary between cards) but theres an example here where the 1080 and 1080ti benefited from lowering the VRAM speeds to tighten their timings, while the 1070 cards didn't need this and out-mined them at stock

Optimize Memory Timings on Nvidia GDDR5X GPUs With OhGodAnETHlargementPill | The Crypto Blog (medium.com)

There are no end-user VRAM latency benchmarks out there, especially these days since we can no longer modify VRAM timings in custom BIOS files, but some fancy places have done testing on this over the years

Measuring GPU Memory Latency – Chips and Cheese
GPU Memory Latency’s Impact, and Updated Test – Chips and Cheese

Simply changing the size of what's tested changes the latency, and various GPU's have different methods of accessing that VRAM so you can see why this isn't a simple thing to test
I do not understand the information below, just posting it here as examples from the links

Hah, i love that the reason for the revised article was a comment from TPU specifying a better way to test this.

80251 · Mar 13, 2023

Thanks Mussels that was really interesting.

Those GDDR latencies certainly do look bad relative to what DDR4 system RAM latencies look like but they're also accessing more data than a typical DDR4 access does (I believe in x86 arch. an entire cache line is always read from memory but that doesn't even amount to 1 KB of data). I think a cache line in the x86 arch. is 32 bytes.

Count von Schwalbe · Mar 13, 2023

Always was curious about comparing GDDR6 latencies to standard DDR4/5. All of the consoles seem to use G6 for system RAM - but tight timings on RAM provides tangible benefit to PC gaming.

Wirko · Mar 13, 2023

80251 said:
Thanks Mussels that was really interesting.

Those GDDR latencies certainly do look bad relative to what DDR4 system RAM latencies look like but they're also accessing more data than a typical DDR4 access does (I believe in x86 arch. an entire cache line is always read from memory but that doesn't even amount to 1 KB of data). I think a cache line in the x86 arch. is 32 bytes.

It's 64 bytes. In DDRx, it moves over the 64-bit channel in a burst of 8 transfers (4 clock cycles). DDR5 can either do the same or, alternatively, use 32-bit subchannels, in which case it takes 16 transfers (8 cycles).
It's more complicated in GPUs I think, a cache line is wider (128B?) but Nvidia can fetch smaller units too, and probably AMD has a similar feature.

But what do those latency tests actually measure? Seeing those horribly large numbers for CPUs, I assume it's random access latency, with no sequential access at all. So it should, amount to the sum of the four primary DDR latencies approximately, and it does.

AnotherReader · Mar 13, 2023

80251 said:
Thanks Mussels that was really interesting.

Those GDDR latencies certainly do look bad relative to what DDR4 system RAM latencies look like but they're also accessing more data than a typical DDR4 access does (I believe in x86 arch. an entire cache line is always read from memory but that doesn't even amount to 1 KB of data). I think a cache line in the x86 arch. is 32 bytes.

Cache line size is a property of the microarchitecture. The x86 architecture doesn't specify a line size. For x86 CPUs, the cache line sizes are:

Intel 486: 16 byte
Pentium to Pentium III, AMD K5 and K6, Cyrix 6x86, and Centaur's Winchip: 32 byte
Pentium 4: 64 byte L1, 128 byte sectored L2
All AMD processors since the original Athlon and all Intel processors after the Pentium 4: 64 byte
Haswell and Broadwell eDRAM L4: 128 byte

System Name	Rainbow Sparkles (Power efficient, <350W gaming load)
Processor	Ryzen R7 5800x3D (Undervolted, 4.45GHz all core)
Motherboard	Asus x570-F (BIOS Modded)
Cooling	Alphacool Apex UV - Alphacool Eisblock XPX Aurora + EK Quantum ARGB 3090 w/ active backplate
Memory	2x32GB DDR4 3600 Corsair Vengeance RGB @3866 C18-22-22-22-42 TRFC704 (1.4V Hynix MJR - SoC 1.15V)
Video Card(s)	Galax RTX 3090 SG 24GB: Underclocked to 1700Mhz 0.750v (375W down to 250W))
Storage	2TB WD SN850 NVME + 1TB Sasmsung 970 Pro NVME + 1TB Intel 6000P NVME USB 3.2
Display(s)	Phillips 32 32M1N5800A (4k144), LG 32" (4K60) \| Gigabyte G32QC (2k165) \| Phillips 328m6fjrmb (2K144)
Case	Fractal Design R6
Audio Device(s)	Logitech G560 \| Corsair Void pro RGB \|Blue Yeti mic
Power Supply	Fractal Ion+ 2 860W (Platinum) (This thing is God-tier. Silent and TINY)
Mouse	Logitech G Pro wireless + Steelseries Prisma XL
Keyboard	Razer Huntsman TE ( Sexy white keycaps)
VR HMD	Oculus Rift S + Quest 2
Software	Windows 11 pro x64 (Yes, it's genuinely a good OS) OpenRGB - ditch the branded bloatware!
Benchmark Scores	Nyooom.

Processor	i5-6600K
Motherboard	Asus Z170A
Cooling	some cheap Cooler Master Hyper 103 or similar
Memory	16GB DDR4-2400
Video Card(s)	IGP
Storage	Samsung 850 EVO 250GB
Display(s)	2x Oldell 24" 1920x1200
Case	Bitfenix Nova white windowless non-mesh
Audio Device(s)	E-mu 1212m PCI
Power Supply	Seasonic G-360
Mouse	Logitech Marble trackball, never had a mouse
Keyboard	Key Tronic KT2000, no Win key because 1994
Software	Oldwin

Processor	Ryzen 7 5700X
Motherboard	ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling	Noctua NH-C14S (two fans)
Memory	2x16GB DDR4 3200
Video Card(s)	Reference Vega 64
Storage	Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s)	Nixeus NX-EDG27, and Samsung S23A700
Case	Fractal Design R5
Power Supply	Seasonic PRIME TITANIUM 850W
Mouse	Logitech
VR HMD	Oculus Rift
Software	Windows 11 Pro, and Ubuntu 20.04

What are latencies like with GDDR5, GDDR5x, GDDR6 and GDDR6x?

80251

Deleted member 185158

Guest

Mussels

Freshwater Moderator

80251

Count von Schwalbe

Nocturnus Moderatus

Wirko

AnotherReader