Could they not add a second bank of cheaper older memory?

ZoneDymo · May 18, 2023

Bit theoretical, but claims are made that memory (vram) is just expensive, which Im not buying but lets say it is.

Cant they make a card with like 6 gb of the latest fastest gddr6x or heck even HBM and then a second layer of much cheaper (and yes slower) gddr5 but like 10 gb of it?

Just so that if the first part is filled, it can shove it over to the other slower memory and take the faster needing stuff for its own account?

(kinda like how ram works with the cpu instead of the slower ssd/hdd)

oxrufiioxo · May 18, 2023

ZoneDymo said:
Bit theoretical, but claims are made that memory (vram) is just expensive, which Im not buying but lets say it is.

Cant they make a card with like 6 gb of the latest fastest gddr6x or heck even HBM and then a second layer of much cheaper (and yes slower) gddr5 but like 10 gb of it?

Just so that if the first part is filled, it can shove it over to the other slower memory and take the faster needing stuff for its own account?

(kinda like how ram works with the cpu instead of the slower ssd/hdd)

I doubt this would be ideal and would still make the actual gpu die more complicated due to needing both GDDR5/6 memory controllers and a weird bus layout likely negating any savings using the slower memory.

Marcus L · May 18, 2023

GTX 970 says hello :clap:

oxrufiioxo · May 18, 2023

Marcus L said:
GTX 970 says hello

Although he's talking about using two different kinds of vram... The 970 had 4GB of identical vram but only 3.5 had access to full bandwidth.

P4-630 · May 18, 2023

They'd get sued again... :laugh:

Marcus L · May 18, 2023

oxrufiioxo said:
Although he's talking about using two different kinds of vram... The 970 had 4GB of identical vram but only 3.5 had access to full bandwidth.

I knew it was slower than the main 3.5GB assumed it was a different kind of vRAM just slower, not that I had one to care enough to look into it in detail but it was big news (probably more to the enthusiast community) at the time

R0H1T · May 18, 2023

ZoneDymo said:
a second layer of much cheaper (and yes slower) gddr5 but like 10 gb of it

You mean like the (in)famous 970? They can't use the same bus width for that slower mem, so it wouldn't come cheap!
Also the reason why they lost the case IIRC was the (less) cache.

oxrufiioxo · May 18, 2023

Marcus L said:
I knew it was slower than the main 3.5GB assumed it was a different kind of vRAM just slower, not that I had one to care enough to look into it in detail but it was big news (probably more to the enthusiast community) at the time

Well they ended up having to pay 970 owners about 30 usd per card due to getting sued over it.... That is actually a pretty large amount for a class action suit.

GerKNG · May 18, 2023

VRAM costs basically nothing... the difference in manufacturing between the 8 and 16GB model is probably 10-20 dollars for a company like NVidia.

Vayra86 · May 20, 2023

P4-630 said:
They'd get sued again...

This precisely.

I believe Nvidia has officially said goodbye to asymmetrical bus nonsense, and using slower memory is really more of the same thing.

GeForce GTX 970: Correcting The Specs & Exploring Memory Allocation

www.anandtech.com

Here's the full 970 analysis. Really nice read

If you consider this pic, it really puts things in its place. That L2 cache is shared across not just part of the 3.5GB... but ALSO the 0.5GB memory module on its own - that one is sharing cache so it impairs both modules/MCs. So basically, the way they wired this, it even cripples 0.5GB of the 3.5GB that 'has full bandwidth'. You could say this GPU starts losing after 3GB allocation.

agent_x007 · May 20, 2023

ZoneDymo said:
Bit theoretical, but claims are made that memory (vram) is just expensive, which Im not buying but lets say it is.

Cant they make a card with like 6 gb of the latest fastest gddr6x or heck even HBM and then a second layer of much cheaper (and yes slower) gddr5 but like 10 gb of it?

Just so that if the first part is filled, it can shove it over to the other slower memory and take the faster needing stuff for its own account?

Memory controller won't like that (to put it mildly).
There is a reason why you never see a MB that can operate on two different memory standards (like DDR2 + DDR3/DDR3 + DDR4) at the same time.

Making HBM + GDDR work like L2/L3 cache on single PCB - is beyond nightmare level of PCB trickery to make work (+ it would cost WAY too much, for it to make a profit with wider audience).

chrcoluk · May 20, 2023

ZoneDymo said:
Bit theoretical, but claims are made that memory (vram) is just expensive, which Im not buying but lets say it is.

Cant they make a card with like 6 gb of the latest fastest gddr6x or heck even HBM and then a second layer of much cheaper (and yes slower) gddr5 but like 10 gb of it?

Just so that if the first part is filled, it can shove it over to the other slower memory and take the faster needing stuff for its own account?

(kinda like how ram works with the cpu instead of the slower ssd/hdd)

They could but I will be blunt, I think Nvidia have adopted a business plan of making their cards become obsolescent as fast as possible.

I still remember when they were interviewed by pcper in the maxwell era, and was asked how life is so easy having no decent competitor, their response was we are competing against ourselves, its a big job to convince owners of our older gen products to upgrade, I think that reply was actually quite honest.

They could do modular GPU's, add the ability to use VRAM of a cheaper card in another PCIE slot or something, but they dont. Instead its a case if you want more VRAM, you can buy an entirely replacement GPU with more rendering performance than you need.

There is also the market segmentation issue, they have people desperate for VRAM (who also have the pockets) buying the 4 figure priced SKUs now, think of incidents where you e.g. have amazon throwing old stock away all because they dont want to devalue the brand of a product. Its kind of like that, if they stick 16 gigs on a 4070 at $600, it would decimate sales of the 4080.

Wirko · May 20, 2023

We'll soon see something like that for system memory in servers. Fast local memory in DIMM slots + slow, high latency, but expandable CXL memory over PCIe bus. Of course it won't "just work". Algorithms that determine what goes where will have to be very smart in order to utilise each type of memory to the best of its capabilities, and prevent frequent movement of huge amounts of data between the two types.

Could a GPU's memory manager possibly be smart enough to handle a similar situation effectively?

LabRat 891 · May 21, 2023

Lotta 'If's but there was once a chance of seeing NAND/PCM on GPUs, deployed *very* similarly to how OP proposed.

AMD Radeon Pro SSG Specs

AMD Vega 10, 1500 MHz, 4096 Cores, 256 TMUs, 64 ROPs, 16384 MB HBM2, 945 MHz, 2048 bit

www.techpowerup.com

If I had silly amounts of money to throw around, I'd have already imported a used Pro SSG from the UK, and stuck 4 Optane drives in it. Just to have an Artifact of What Could Have Been

Wirko · May 21, 2023

LabRat 891 said:
Lotta 'If's but there was once a chance of seeing NAND/PCM on GPUs, deployed *very* similarly to how OP proposed.

I've thought of that too. Very much doable on a consumer card, on a more modest scale. AMD could integrate a 4-channel SMI SSD controller with custom firmware on the PCB, add 250 GB of TLC flash, and make it operate in permanent pseudo-SLC mode to get 80 GB of high endurance memory, getting maybe close to 6-7 GB/s speeds.

LabRat 891 · May 21, 2023

Wirko said:
I've thought of that too. Very much doable on a consumer card, on a more modest scale. AMD could integrate a 4-channel SMI SSD controller with custom firmware on the PCB, add 250 GB of TLC flash, and make it operate in permanent pseudo-SLC mode to get 80 GB of high endurance memory, getting maybe close to 6-7 GB/s speeds.

I believe AMD used one of their Xilinx FPGAs to accomplish similar with 4x (striped) Samsung Gen3 NVMe M.2 Modules.
(it appears common for companies to deploy developing technologies using 'off-the-shelf' parts first, prior to integration.)

Normally, 'storage devices' can't be 'used as "memory"'; whatever magic programming they loaded into their FPGA allowed the NVMe drives to be addressed as 'extended VRAM'.

I think it's 'Related IP' to HBCC, but there were never any driver-side implementations beyond using System RAM as extended VRAM.

HBCC almost was what @ZoneDymo was 'getting at'.

HBCC Memory Segment – High Bandwidth Cache Controller (HBCC) Memory Segment allows allocation of system memory to the graphics card. This can be useful in applications that require more video memory than what is available on the graphics card.
Set HBCC Memory Segment to Enabled the drag the HBCC Memory Size slider right or left to increase or decrease the total system memory that is allocated to the HBCC. Click OK to confirm.

To reset the system memory allocation back to default settings, click Perform Reset and OK to confirm.

Below is a screenshot example of these options:

I've never seen such a configuration, but:
If someone could 'fake' a storage volume into 'extended system memory', you could (effectively) use HBCC to DIY-a-ProSSG w/ an MI25, WX9100, Vega Frontier, etc.
(edit: I supposed allocating an entire storage volume to Page File/Virtual Memory, then allocating Maximum (real)RAM to HBCC, would be semi-equivalent. However, I'd expect issues to arise; if only increased overall system latency)

Edit:
Oh, and since I don't think it'd been mentioned yet...
DirectStorage, is almost a 'standardized software implementation' of the Thread's Topic.
Gen5 SSDs are going to be (at least in raw bandwidth), 'up there with' older DRAM and GraphicsDRAM.

Wirko · May 21, 2023

LabRat 891 said:
DirectStorage, is almost a 'standardized software implementation' of the Thread's Topic.

Yeah, especially as one of its features is decompressing from memory (even better *if* that also means from system RAM to VRAM).
NAND flash can't just serve as "RAM extension", it may in cases when you have a lot of slow-changing data, I don't know if games do.

Shihab · May 21, 2023

Sacrificing bandwidth for capacity in bandwidth sensitive application sounds counterproductive to me.
And I agree with agent_x007 said. Hardware solutions that add unnecessary, and costly, complexity should generally be avoided.

chrcoluk said:
They could do modular GPU's, add the ability to use VRAM of a cheaper card in another PCIE slot or something, but they dont.

Because it doesn't make sense.
PCIe bandwidth, at its best, is a fraction of even a midrange GPU's memory bandwidth. The-yet-to-go-mainstream PCIe6 has 121GBps bandwidth at x16. An old GTX2060 has 336GBps memory bandwidth.
And don't forget that you'd need two slots, which is even rarer to have both on x16 mode. The typical high end "gaming" motherboard is probably still stuck at PCIe5 x8/x8 config for dual slot usage. At those speeds, even using the system's memory may be bottlenecked.

System Name	Cyberline
Processor	Intel Core i7 2600k -> 12600k
Motherboard	Asus P8P67 LE Rev 3.0 -> Gigabyte Z690 Auros Elite DDR4
Cooling	Tuniq Tower 120 -> Custom Watercoolingloop
Memory	Corsair (4x2) 8gb 1600mhz -> Crucial (8x2) 16gb 3600mhz
Video Card(s)	AMD RX480 -> RX7800XT
Storage	Samsung 750 Evo 250gb SSD + WD 1tb x 2 + WD 2tb -> 2tb MVMe SSD
Display(s)	Philips 32inch LPF5605H (television) -> Dell S3220DGF
Case	antec 600 -> Thermaltake Tenor HTCP case
Audio Device(s)	Focusrite 2i4 (USB)
Power Supply	Seasonic 620watt 80+ Platinum
Mouse	Elecom EX-G
Keyboard	Rapoo V700
Software	Windows 10 Pro 64bit

System Name	His & Hers
Processor	R7 5800X/ R7 7950X3D Stock
Motherboard	X670E Aorus Pro X/ROG Crosshair VIII Hero
Cooling	Corsair h150 elite/ Corsair h115i Platinum
Memory	Trident Z5 Neo 6000/ 32 GB 3200 CL14 @3800 CL16 Team T Force Nighthawk
Video Card(s)	Evga FTW 3 Ultra 3080ti/ Gigabyte Gaming OC 4090
Storage	lots of SSD.
Display(s)	A whole bunch OLED, VA, IPS.....
Case	011 Dynamic XL/ Phanteks Evolv X
Audio Device(s)	Arctis Pro + gaming Dac/ Corsair sp 2500/ Logitech G560/Samsung Q990B
Power Supply	Seasonic Ultra Prime Titanium 1000w/850w
Mouse	Logitech G502 Lightspeed/ Logitech G Pro Hero.
Keyboard	Logitech - G915 LIGHTSPEED / Logitech G Pro

Processor	Ryzen 5700x
Motherboard	Gigabyte Auros Elite AX V2
Cooling	Thermalright Peerless Assassin SE White
Memory	TeamGroup T-Force Delta RGB 32GB 3600Mhz
Video Card(s)	PowerColor Red Dragon Rx 6800
Storage	Fanxiang S660 1TB, Fanxiang S500 Pro 1TB, BraveEagle 240GB SSD, 2TB Seagate HDD
Case	Corsair 4000D White
Power Supply	Corsair RM750x SHIFT

System Name	His & Hers
Processor	R7 5800X/ R7 7950X3D Stock
Motherboard	X670E Aorus Pro X/ROG Crosshair VIII Hero
Cooling	Corsair h150 elite/ Corsair h115i Platinum
Memory	Trident Z5 Neo 6000/ 32 GB 3200 CL14 @3800 CL16 Team T Force Nighthawk
Video Card(s)	Evga FTW 3 Ultra 3080ti/ Gigabyte Gaming OC 4090
Storage	lots of SSD.
Display(s)	A whole bunch OLED, VA, IPS.....
Case	011 Dynamic XL/ Phanteks Evolv X
Audio Device(s)	Arctis Pro + gaming Dac/ Corsair sp 2500/ Logitech G560/Samsung Q990B
Power Supply	Seasonic Ultra Prime Titanium 1000w/850w
Mouse	Logitech G502 Lightspeed/ Logitech G Pro Hero.
Keyboard	Logitech - G915 LIGHTSPEED / Logitech G Pro

System Name	AlderLake
Processor	Intel i7 12700K P-Cores @ 5Ghz
Motherboard	Gigabyte Z690 Aorus Master
Cooling	Noctua NH-U12A 2 fans + Thermal Grizzly Kryonaut Extreme + 5 case fans
Memory	32GB DDR5 Corsair Dominator Platinum RGB 6000MT/s CL36
Video Card(s)	MSI RTX 2070 Super Gaming X Trio
Storage	Samsung 980 Pro 1TB + 970 Evo 500GB + 850 Pro 512GB + 860 Evo 1TB x2
Display(s)	23.8" Dell S2417DG 165Hz G-Sync 1440p
Case	Be quiet! Silent Base 600 - Window
Audio Device(s)	Panasonic SA-PMX94 / Realtek onboard + B&O speaker system / Harman Kardon Go + Play / Logitech G533
Power Supply	Seasonic Focus Plus Gold 750W
Mouse	Logitech MX Anywhere 2 Laser wireless
Keyboard	RAPOO E9270P Black 5GHz wireless
Software	Windows 11
Benchmark Scores	Cinebench R23 (Single Core) 1936 @ stock Cinebench R23 (Multi Core) 23006 @ stock

Processor	AMD Ryzen 9 9950X3D
Motherboard	ASRock B850M PRO-A
Cooling	Corsair Nautilus 360 RS
Memory	2x32GB Kingston Fury Beast 6000 CL30
Video Card(s)	PowerColor Hellhound RX 9070 XT
Storage	1TB Samsung 990 Pro, 2TB Samsung 990 Pro, 4TB Samsung 990 Pro
Display(s)	LG 27GS95QE-B, MSI G272QPF E2
Case	Lian Li DAN Case A3 Black Wood Edition
Audio Device(s)	Bose Companion Series 2 III, Sennheiser GSP600 and HD599 SE - Creative Soundblaster X4
Power Supply	Corsair RM1000X ATX 3.1
Mouse	Razer Deathadder V3
Keyboard	Razer Black Widow V3 TKL
VR HMD	Oculus Rift S

System Name	Tiny the White Yeti
Processor	7800X3D
Motherboard	MSI MAG Mortar b650m wifi
Cooling	CPU: Thermalright Peerless Assassin / Case: Phanteks T30-120 x3
Memory	32GB Corsair Vengeance 30CL6000
Video Card(s)	ASRock RX7900XT Phantom Gaming
Storage	Lexar NM790 4TB + Samsung 850 EVO 1TB + Samsung 980 1TB + Crucial BX100 250GB
Display(s)	Gigabyte G34QWC (3440x1440)
Case	Lian Li A3 mATX White
Audio Device(s)	Harman Kardon AVR137 + 2.1
Power Supply	EVGA Supernova G2 750W
Mouse	Steelseries Aerox 5
Keyboard	Lenovo Thinkpad Trackpoint II
VR HMD	HD 420 - Green Edition ;)
Software	W11 IoT Enterprise LTSC
Benchmark Scores	Over 9000

System Name	BOX
Processor	Core i7 6950X @ 4,26GHz (1,28V)
Motherboard	X99 SOC Champion (BIOS F23c + bifurcation mod)
Cooling	Thermalright Venomous-X + 2x Delta 38mm PWM (Push-Pull)
Memory	Patriot Viper Steel 4000MHz CL16 4x8GB (@3240MHz CL12.12.12.24 CR2T @ 1,48V)
Video Card(s)	Titan V (~1650MHz @ 0.77V, HBM2 1GHz, Forced P2 state [OFF])
Storage	WD SN850X 2TB + Samsung EVO 2TB (SATA) + Seagate Exos X20 20TB (4Kn mode)
Display(s)	LG 27GP950-B
Case	Fractal Design Meshify 2 XL
Audio Device(s)	Motu M4 (audio interface) + ATH-A900Z + Behringer C-1
Power Supply	Seasonic X-760 (760W)
Mouse	Logitech RX-250
Keyboard	HP KB-9970
Software	Windows 10 Pro x64

System Name	Main PC
Processor	13700k
Motherboard	Asrock Z690 Steel Legend D4 - Bios 13.02
Cooling	Noctua NH-D15S
Memory	32 Gig 3200CL14
Video Card(s)	4080 RTX SUPER FE 16G
Storage	1TB 980 PRO, 2TB SN850X, 2TB DC P4600, 1TB 860 EVO, 2x 3TB WD Red, 2x 4TB WD Red
Display(s)	LG 27GL850
Case	Fractal Define R4
Audio Device(s)	Soundblaster AE-9
Power Supply	Antec HCG 750 Gold
Software	Windows 10 21H2 LTSC

Processor	i5-6600K
Motherboard	Asus Z170A
Cooling	some cheap Cooler Master Hyper 103 or similar
Memory	16GB DDR4-2400
Video Card(s)	IGP
Storage	Samsung 850 EVO 250GB
Display(s)	2x Oldell 24" 1920x1200
Case	Bitfenix Nova white windowless non-mesh
Audio Device(s)	E-mu 1212m PCI
Power Supply	Seasonic G-360
Mouse	Logitech Marble trackball, never had a mouse
Keyboard	Key Tronic KT2000, no Win key because 1994
Software	Oldwin

System Name	Metalia
Processor	AMD Ryzen 7 5800X3D
Motherboard	Asus TuF Gaming X570-PLUS
Cooling	ID Cooling 280mm AIO w/ Arctic P14s
Memory	2x32GB DDR4-3600
Video Card(s)	Sapphire Pulse RX 9070 XT
Storage	Optane P5801X 400GB, Samsung 990Pro 2TB
Display(s)	LG ‎32GS95UV 32" OLED 240/480hz 4K/1080P Dual Mode
Case	Geometric Future M8 Dharma
Audio Device(s)	Xonar Essence STX
Power Supply	Seasonic Focus GX-1000 Gold
Mouse	Attack Shark R3 Magnesium - White
Keyboard	Keychron K8 Pro - White - Tactile Brown Switch
Software	Windows 10 IoT Enterprise LTSC 2021

System Name	192.168.1.1~192.168.1.100
Processor	AMD Ryzen5 5600G.
Motherboard	Gigabyte B550m DS3H.
Cooling	AMD Wraith Stealth.
Memory	16GB Crucial DDR4.
Video Card(s)	Gigabyte GTX 1080 OC (Underclocked, underpowered).
Storage	Samsung 980 NVME 500GB && Assortment of SSDs.
Display(s)	ViewSonic VA2406-MH 75Hz
Case	Bitfenix Nova Midi
Audio Device(s)	On-Board.
Power Supply	SeaSonic CORE GM-650.
Mouse	Logitech G300s
Keyboard	Kingston HyperX Alloy FPS.
VR HMD	A pair of OP spectacles.
Software	Ubuntu 24.04 LTS.
Benchmark Scores	Me no know English. What bench mean? Bench like one sit on?