AMD Radeon RX 9070 XT Could Get a 32 GB GDDR6 Upgrade

Makaveli · Feb 13, 2025

10tothemin9volts said:
With the 5090, NV has released a 32GB VRAM consumer GPU, ofc AMD is going to do the same (wasn't it the same with 24GB VRAM consumer GPUs?). The difference is the 9070 (XT) is based on a 256-bit chip using GDDR6 ~600 GB/s vs 512-bit GDDR7 1792 GB/s for the 5090. Still fast enough.

AFAIK, only modded games may require more than 24GB VRAM in 4K right now, but 32GB are nice for fully offloading/hosting big-ish LLMs locally.

Regarding CUDA/ML stack, indeed, I think of AMD GPUs only in terms of running/inferencing LLMs, not training/finetuning, but I read it's still possible and supposedly got easier over the last years, but CUDA is tier agnostic and supports consumer GPUs, workstation GPUs and enterprise cards. To improve this, UDNA (U for unified) will replace RDNA at some point.

5090' idle power consumption unfortunately increased to 30W (4090 22W), but it's still not too bad (it's more than linear in video playback: 54W 5090 vs 26W 4090) considering there are 16 2GB modules (linear increase: 22W[4090]/12[GDDR6X]*16[GDDR7] = 29.33W).

For me to consider this RDNA4 32GB GPU (in no particular order):

DLSS 2-like upscaling quality improvement

Fix HDMI 2.1 48GB/s, aka HDMI 2.1a on Linux

Back to good power scaling like in RDNA2

Low idle power consumption, linear increase with the amount of VRAM compared to the 16GB VRAM, at the worst

Just like the 5090, 9070 (XT) 32GB also must be a consumer GPU, so that the price increase is minimal

So, AMD, it's 48GB VRAM consumer GPUs for the UDNA arch after RDNA4 then as well? Would allow to fully offload `Llama-3.3-70B-Instruct-Q4_K_M.gguf` (42.5GB) (by then we will have a different and more capable 70B LLM, ofc), or allow for much higher context.

Yes, 24GB VRAM can't fit a e.g 27GB `Qwen2.5-32B-Instruct-Q6_K.gguf` SOTA LLM, but the .gguf format allows to offload the rest of the LLM layers to RAM, but it will run much slower. The tokens per second speed increases exponentially the more layers are offloaded to the GPU, I did some testing:
View attachment 384684

I'm not sure they will do 48GB on consumer just yet as that will give the W7800 and W7900 workstation gpu's some competition but we shall see.

Right now this model gives me the best performance on a 24GB VRAM gpu

Doing about 28 tok / sec

Looks like this rumor is false.

bitsandboots · Feb 13, 2025

10tothemin9volts said:
AFAIK, only modded games may require more than 24GB VRAM in 4K right now, but 32GB are nice for fully offloading/hosting big-ish LLMs locally.

Regarding CUDA/ML stack, indeed, I think of AMD GPUs only in terms of running/inferencing LLMs, not training/finetuning, but I read it's still possible and supposedly got easier over the last years, but CUDA is tier agnostic and supports consumer GPUs, workstation GPUs and enterprise cards. To improve this, UDNA (U for unified) will replace RDNA at some point.

5090' idle power consumption unfortunately increased to 30W (4090 22W), but it's still not too bad (it's more than linear in video playback: 54W 5090 vs 26W 4090) considering there are 16 2GB modules (linear increase: 22W[4090]/12[GDDR6X]*16[GDDR7] = 29.33W).

For me to consider this RDNA4 32GB GPU (in no particular order):

DLSS 2-like upscaling quality improvement

Fix HDMI 2.1 48GB/s, aka HDMI 2.1a on Linux

Back to good power scaling like in RDNA2

Low idle power consumption, linear increase with the amount of VRAM compared to the 16GB VRAM, at the worst

Just like the 5090, 9070 (XT) 32GB also must be a consumer GPU, so that the price increase is minimal

So, AMD, it's 48GB VRAM consumer GPUs for the UDNA arch after RDNA4 then as well? Would allow to fully offload `Llama-3.3-70B-Instruct-Q4_K_M.gguf` (42.5GB) (by then we will have a different and more capable 70B LLM, ofc), or allow for much higher context.

Yes, 24GB VRAM can't fit a e.g 27GB `Qwen2.5-32B-Instruct-Q6_K.gguf` SOTA LLM, but the .gguf format allows to offload the rest of the LLM layers to RAM, but it will run much slower. The tokens per second speed increases exponentially the more layers are offloaded to the GPU, I did some testing:

Yeah spot on.
Regarding VRAM and games: I think its hard to get games to use above 16GB but the one exception similar to what you said with mods, is VRChat. This game is quite unusual in that you're seeing user uploaded unity assets and as a result the optimization is horrible, a tragedy of the commons situation in which not enough individuals optimize their assets so each avatar could be as bad as 500MB of vram... so if you go to a populated room, there is no real upper bound for how much VRAM you'd like! And, as that's a VR game, displayport 2 is a must for future proof because current-gen VR headsets already saturate what DP 1.4 can do.

Regarding VRAM and AI: If you train a LoRA for SD XL, you'll already cross the 16GB boundary. SD 3 and Flux are going to be worse. Training speed isn't really an issue here, just vram.
Inference as you say, doesn't need hardly as much vram, so the 9070xt will shine with even less VRAM.

But as a competitor to say a 5080, the 5080 just doesnt have enough vram. I think 24GB is enough, 32 is a bonus, but 16 just isn't enough for these admittedly obscure tasks.

Bomby569 · Feb 13, 2025

not sure that cards need it, it's mostly to serve the AI crowd, grab more money, less gpus available for gamers

HD64G · Feb 13, 2025

Smart move if they manage to offer that 32GB iteration as a workstation GPU in order to lighten gamers' GPUs series demand from whoever (non-gamer) needs more RAM for apps. And they will be able to sell them for higher profit margins that should allow AMD to keep the gamers' GPUs in normal pricing.

AusWolf · Feb 13, 2025

Look at the update, guys. There's no 32 GB gamer card, but maybe a Radeon Pro workstation card coming later.

TheDeeGee · Feb 14, 2025

Bomby569 said:
not sure that cards need it, it's mostly to serve the AI crowd, grab more money, less gpus available for gamers

It won't as it's not going to be a 4K card.

The 4080 isn't for 4K either, more like 1440p240.

Makaveli · Feb 14, 2025

TheDeeGee said:
It won't as it's not going to be a 4K card.

The 4080 isn't for 4K either, more like 1440p240.

I agree at 4k you really want a 4090 or a 5090.

the 7900XTX / 4080 tier is better at 1440 or 1440 UW resolutions. 4K 60 is doable at this tier but for me personally I prefer to have my fps up in the 100-144 range without the use of upscaling or fg.

HBSound · Feb 14, 2025

I like the 32GB version and would like GPU water cooling support with a single PCI lane. I would have no issue going for this over the Nvidia FE.

Dr. Dro · Feb 14, 2025

https://twitter.com/x/status/1890123828367601764

No 32 GB version planned.

wolf · Feb 14, 2025

Bruno_O said:
unless you play at 4k native + ultra settings + RT + AA

which can and will use over 16GB easily in many games

Will be interesting to see if a 9070Xt can even do that at acceptable framerates.

In any case, looks like the rumor was false.

Bomby569 · Feb 14, 2025

TheDeeGee said:
It won't as it's not going to be a 4K card.

The 4080 isn't for 4K either, more like 1440p240.

no one is going to game on that thing

10tothemin9volts · Feb 14, 2025

News article said:
Update 20:55 UTC: AMD's Frank Azor on X debunked rumors of the 32 GB SKU coming to gamers. So, this will not happen. Instead, we could be looking at prosumer oriented AMD Radeon Pro GPU with 32 GB of memory instead.

Rumor false? You mean false until it's out in 6-12 months? AMD want ppl to buy their much more expensive prosumer SKUs when NV is offering 32GB in the consumer space? AMD never misses an opportunity to miss an opportunity.

Dragokar · Feb 14, 2025

Dr. Dro said:
https://twitter.com/x/status/1890123828367601764

No 32 GB version planned.

This is not what is written, in fact it clearly says no 9070XT 32Gb which does not state not 9070 XTX 32Gb or 9075XT 32Gb or any other naming. It is just corpo speech.....

Dr. Dro · Feb 14, 2025

Dragokar said:
This is not what is written, in fact it clearly says no 9070XT 32Gb which does not state not 9070 XTX 32Gb or 9075XT 32Gb or any other naming. It is just corpo speech.....

No higher SKU announced, though. Not even a faint rumor from CN forums, and AMD themselves has already admitted nothing above Navi 48 was developed, with the 9070 XT having the full configuration already. But I'll let you hit the hopium as much as you want

10tothemin9volts said:
Rumor false? You mean false until it's out in 6-12 months? AMD want ppl to buy their much more expensive prosumer SKUs when NV is offering 32GB in the consumer space? AMD never misses an opportunity to miss an opportunity.

AMD has done prosumer once: Vega Frontier, and it was a complete disaster. You'll never guess who once had one :rolleyes:

But not really, not in this case. This GPU just doesn't have the performance chops for 32 GB at anything, and LLMs would only run faster because they are almost always VRAM capacity bottlenecked.

Dragokar · Feb 14, 2025

Dr. Dro said:
No higher SKU announced, though. Not even a faint rumor from CN forums, and AMD themselves has already admitted nothing above Navi 48 was developed, with the 9070 XT having the full configuration already. But I'll let you hit the hopium as much as you want

AMD has done prosumer once: Vega Frontier, and it was a complete disaster. You'll never guess who once had one

But not really, not in this case. This GPU just doesn't have the performance chops for 32 GB at anything, and LLMs would only run faster because they are almost always VRAM capacity bottlenecked.

Probably 9070 WS though

AMD 確認 Radeon RX 9070 XT 不提供 32GB 版本，其他就自由發揮與想像 - BenchLife.info

Radeon RX 9070 XT 不會有 32GB 記憶體版本，但留下了支伏筆啊… 早 […]

benchlife.info

Daven · Feb 14, 2025

At least AMD is officially responding to the rumors. A rumor mill left untamed will run wild.

Of course the best way to deal with all this is to RELEASE THE DAMN CARDS ALREADY!

Dr. Dro · Feb 14, 2025

Dragokar said:
Probably 9070 WS though

AMD 確認 Radeon RX 9070 XT 不提供 32GB 版本，其他就自由發揮與想像 - BenchLife.info

Radeon RX 9070 XT 不會有 32GB 記憶體版本，但留下了支伏筆啊… 早 […]

benchlife.info

Yeah, Radeon Pro version will 100% have 32 GB, that much is expected. I also expect NV should launch an RTX 6000 Blackwell Generation with 64 gigs too

Makaveli · Feb 14, 2025

Dr. Dro said:
Yeah, Radeon Pro version will 100% have 32 GB, that much is expected. I also expect NV should launch an RTX 6000 Blackwell Generation with 64 gigs too

I know they announced a 96GB version of Blackwell for workstation cards so a 64GB model probably coming aswell.

NVIDIA RTX Blackwell GPU with 96GB GDDR7 memory and 512-bit bus spotted - VideoCardz.com

NVIDIA preparing a workstation flagship with 96GB memory This card is said to use 3GB modules. According to a report from ComputerBase, NVIDIA’s upcoming desktop graphics card is expected to feature 96GB of GDDR7 memory. This configuration was revealed in shipping manifests discovered by the...

videocardz.com

Dr. Dro · Feb 14, 2025

Makaveli said:
I know they announced a 96GB version of Blackwell for workstation cards so a 64GB model probably coming aswell.

NVIDIA RTX Blackwell GPU with 96GB GDDR7 memory and 512-bit bus spotted - VideoCardz.com

NVIDIA preparing a workstation flagship with 96GB memory This card is said to use 3GB modules. According to a report from ComputerBase, NVIDIA’s upcoming desktop graphics card is expected to feature 96GB of GDDR7 memory. This configuration was revealed in shipping manifests discovered by the...

videocardz.com

Ooo, 3 GB G7 chips are ready. The G6 ones I guess will not release after all, or maybe might see some limited use for AMD that's still using the old standard in their cards?

BlaezaLite · Feb 14, 2025

David Macafee confirmed no 32gb 9070XT. Just seen it.

Makaveli · Feb 14, 2025

Dr. Dro said:
Ooo, 3 GB G7 chips are ready. The G6 ones I guess will not release after all, or maybe might see some limited use for AMD that's still using the old standard in their cards?

Other than going HBM way back AMD generally doesn't go with expensive top end memory so they will probably stay with current memory will have to wait and see what they do with the UDNA generation.

10tothemin9volts · Feb 16, 2025

10tothemin9volts said:
[...]
For me to consider this RDNA4 32GB GPU (in no particular order):

DLSS 2-like upscaling quality improvement

Fix HDMI 2.1 48GB/s, aka HDMI 2.1a on Linux

Back to good power scaling like in RDNA2

Low idle power consumption, linear increase with the amount of VRAM compared to the 16GB VRAM, at the worst

Just like the 5090, 9070 (XT) 32GB also must be a consumer GPU, so that the price increase is minimal

So, AMD, it's 48GB VRAM consumer GPUs for the UDNA arch after RDNA4 then as well? Would allow to fully offload `Llama-3.3-70B-Instruct-Q4_K_M.gguf` (42.5GB) (by then we will have a different and more capable 70B LLM, ofc), or allow for much higher context.
[...]

I must add that when AMD releases a 48GB VRAM consumer GPU (the time of release date may be accelerated due to AI / LLM selfhosting being a thing now, but in 2-3 years/next gen at the earliest), I'd like them to use GDDR7 at that point, otherwise the speeds may be too slow (the +30% speed increase using GDDR7 is worth it and by that time GDDR7 should be cheaper too).

csendesmark said:
Oh, please don't tell me, I am painfully aware how slow things can get when you run LLM-s outside of a GPU
May I ask you to post here?

I thought so. I guess I don't want to create a new topic to post my benchmark result of RAM vs VRAM offloading speeds

I'm going to post there soon.

Dragokar said:
This is not what is written, in fact it clearly says no 9070XT 32Gb which does not state not 9070 XTX 32Gb or 9075XT 32Gb or any other naming. It is just corpo speech.....

Makes sense. A same name would only confuse endconsumers (but hiding Zen 2 in Zen 3 iCPU names does already that, to name just one example). Name dilemma: Same performance vs different VRAM amounts, especially from 16GB to 32GB, may not only deserve a special "AI" added to its name (they add "AI" in product names already), but maybe a (slightly) different name altogether. (though NV has a 4060Ti 8GB and 4060Ti 16GB, lets see if they do the same (confusion for endconsumers) for the GeForce 50 series)

Makaveli said:
I'm not sure they will do 48GB on consumer just yet as that will give the W7800 and W7900 workstation gpu's some competition but we shall see.

[...]

Indeed, but would be interesting to know what percentage of users buy the workstation GPUs solely for their VRAM amount vs the requirement of the workstation GPU's features. I don't expect a RDNA4 48GB consumer GPU ever, but with all the AI / LLM selfhosting being a thing now (it kinda just started and many more people may demand cheap, higher VRAM consumer GPUs), maybe next generation / in 2-3 years.

Bruno_O · Feb 19, 2025

wolf said:
Will be interesting to see if a 9070Xt can even do that at acceptable framerates.

In any case, looks like the rumor was false.

my 7900XT can depending on the game at 60fps, 9070xt is stronger so it will as well

also, a bit of future proofing in regards to vram doesn't hurt anyone

FeelinFroggy · Feb 20, 2025

TPUnique said:
Ok. And what about non-game applications ? Is that also an utter waste for them ?

This is not a prosumer card. It is a Radeon gaming card. So non gaming applications are at the bottom of the pecking order.

Nhonho · Feb 20, 2025

W1zzard said:
Exactly what just happened to me.

Could you do some testing to show us how much VRAM is really needed?
Is 16 GB of VRAM enough, or is 24 or 32 GB of VRAM really necessary?

System Name	The Expanse
Processor	AMD Ryzen 7 9800X3D
Motherboard	Asus Prime X670E-Pro Wifi BIOS 3222 AGESA PI 1.2.0.3a
Cooling	Corsair H150i Elite LCD XT
Memory	64GB G.SKILL Trident Z5 Neo RGB DDR5 6000 CL 30-40-40-96 1T
Video Card(s)	XFX Radeon RX 7900 XTX Magnetic Air (25.5.1)
Storage	WD SN850X 2TB / Corsair MP600 1TB / Samsung 860Evo 1TB x2 Raid 0 / Asus NAS AS1004T V2 20TB
Display(s)	LG 34GP83A-B 34 Inch 21: 9 UltraGear Curved QHD (3440 x 1440) 1ms Nano IPS 160Hz
Case	Fractal Design Meshify S2
Audio Device(s)	Creative X-Fi + Logitech Z-5500 + HS80 Wireless
Power Supply	Corsair AX850 Titanium
Mouse	Corsair Dark Core RGB SE
Keyboard	Corsair K100
Software	Windows 10 Pro x64 22H2
Benchmark Scores	https://valid.x86.fr/0412jp https://browser.geekbench.com/v6/cpu/11073923

System Name	SOCIETY
Processor	AMD Ryzen 9 7800x3D
Motherboard	MSI MAG X670E TOMAHAWK
Cooling	Arctic Liquid Freezer II 420
Memory	64GB 6000mhz
Video Card(s)	Nvidia RTX 3090
Storage	WD SN850X 4TB, Micron 1100 2TB, ZFS NAS over 10gbe network
Display(s)	27" Dell S2721DGF, 24" ASUS IPS, 24" Dell IPS
Case	Corsair 750D
Power Supply	Cooler Master 1200W Gold
Mouse	Razer Deathadder
Keyboard	ROG Falchion
VR HMD	Varjo Aero, Quest Pro, Pimax 8KX
Software	Windows 10 with Debian VM

Processor	Ryzen 5 5700x
Motherboard	B550 Elite
Cooling	Thermalright Perless Assassin 120 SE
Memory	32GB Fury Beast DDR4 3200Mhz
Video Card(s)	Gigabyte 3060 ti gaming oc pro
Storage	Samsung 970 Evo 1TB, WD SN850x 1TB, plus some random HDDs
Display(s)	LG 27gp850 1440p 165Hz 27''
Case	Lian Li Lancool II performance
Power Supply	MSI 750w
Mouse	G502

Processor	AMD Ryzen 5 5600@80W
Motherboard	MSI B550 Tomahawk
Cooling	ZALMAN CNPS9X OPTIMA
Memory	2*8GB PATRIOT PVS416G400C9K@3733MT_C16
Video Card(s)	Sapphire Radeon RX 6750 XT Pulse 12GB
Storage	Sandisk SSD 128GB, Kingston A2000 NVMe 1TB, Samsung F1 1TB, WD Black 10TB
Display(s)	AOC 27G2U/BK IPS 144Hz
Case	SHARKOON M25-W 7.1 BLACK
Audio Device(s)	Realtek 7.1 onboard
Power Supply	Seasonic Core GC 500W
Mouse	Sharkoon SHARK Force Black
Keyboard	Trust GXT280
Software	Win 7 Ultimate 64bit/Win 10 pro 64bit/Manjaro Linux

System Name	My second and third PCs are Intel + Nvidia
Processor	AMD Ryzen 7 7800X3D @ 45 W TDP Eco Mode
Motherboard	MSi Pro B650M-A Wifi
Cooling	be quiet! Shadow Rock LP
Memory	2x 24 GB Corsair Vengeance DDR5-6000 CL36
Video Card(s)	PowerColor Reaper Radeon RX 9070 XT
Storage	2 TB Corsair MP600 GS, 4 TB Seagate Barracuda
Display(s)	Dell S3422DWG 34" 1440 UW 144 Hz
Case	Corsair Crystal 280X
Audio Device(s)	Logitech Z333 2.1 speakers, AKG Y50 headphones
Power Supply	750 W Seasonic Prime GX
Mouse	Logitech MX Master 2S
Keyboard	Logitech G413 SE
Software	Bazzite (Fedora Linux) KDE Plasma

System Name	TheDeeGee's PC
Processor	Intel Core i7-11700
Motherboard	ASRock Z590 Steel Legend
Cooling	Noctua NH-D15S
Memory	Crucial Ballistix 3200/C16 32GB
Video Card(s)	Nvidia RTX 4070 Ti 12GB
Storage	Crucial P5 Plus 2TB / Crucial P3 Plus 2TB / Crucial P3 Plus 4TB
Display(s)	EIZO CX240
Case	Lian-Li O11 Dynamic Evo XL / Noctua NF-A12x25 fans
Audio Device(s)	Creative Sound Blaster ZXR / AKG K601 Headphones
Power Supply	Seasonic PRIME Fanless TX-700
Mouse	Logitech G500S
Keyboard	Keychron Q6
Software	Windows 10 Pro 64-Bit
Benchmark Scores	None, as long as my games runs smooth.

Processor	13th Gen Intel Core i9-13900KS
Motherboard	ASUS ROG Maximus Z790 Apex Encore
Cooling	Pichau Lunara ARGB 360 + Honeywell PTM7950
Memory	32 GB G.Skill Trident Z5 RGB @ 7600 MT/s
Video Card(s)	Palit GameRock GeForce RTX 5090 32 GB
Storage	500 GB WD Black SN750 + 4x 300 GB WD VelociRaptor WD3000HLFS HDDs
Display(s)	55-inch LG G3 OLED
Case	Cooler Master MasterFrame 700 benchtable
Power Supply	EVGA 1300 G2 1.3kW 80+ Gold
Mouse	Microsoft Classic IntelliMouse
Keyboard	IBM Model M type 1391405
Software	Windows 10 Pro 22H2
Benchmark Scores	I pulled a Qiqi~

System Name	MightyX
Processor	Ryzen 9800X3D
Motherboard	Gigabyte B650I AX
Cooling	Scythe Fuma 2
Memory	32GB DDR5 6000 CL30 tuned
Video Card(s)	Palit Gamerock RTX 5080 oc
Storage	WD Black SN850X 2TB
Display(s)	LG 42C2 4K OLED
Case	Coolermaster NR200P
Audio Device(s)	LG SN5Y / Focal Clear
Power Supply	Corsair SF750 Platinum
Mouse	Corsair Dark Core RBG Pro SE
Keyboard	Glorious GMMK Compact w/pudding
VR HMD	Meta Quest 3
Software	case populated with Artic P12's
Benchmark Scores	4k120 OLED Gsync bliss

Processor	7800X3D @ Curve Optimizer: All Core: -25
Motherboard	TUF Gaming B650-Plus
Memory	2xKSM48E40BD8KM-32HM ECC RAM (ECC enabled in BIOS)
Video Card(s)	4070 @ 110W
Display(s)	SAMSUNG S95B 55" QD-OLED TV
Power Supply	RM850x

System Name	Homebase
Processor	Ryzen 5 5600
Motherboard	Gigabyte Aorus X570S UD
Cooling	Scythe Mugen 5 RGB
Memory	2*16 Kingston Fury DDR4-3600 double ranked
Video Card(s)	AMD Radeon RX 6800 16 GB
Storage	1512 WD Red SN700, 12TB Curcial P5, 12TB Sandisk Plus (TLC), 114TB Toshiba MG
Display(s)	Philips E-line 275E1S
Case	Fractal Design Torrent Compact
Power Supply	Corsair RM850 2019
Mouse	Sharkoon Sharkforce Pro
Keyboard	Fujitsu KB955

System Name	AMDWeapon
Processor	Ryzen 7 7800X3D -20 CO
Motherboard	X670E MSI Tomahawk WiFi
Cooling	Thermalright Peerless Assassin 120 ARGB with Silverstone Air Blazer 2200rpm fans
Memory	G-Skill Trident Z Neo RGB 6000 CL30 32GB@EXPO
Video Card(s)	Powercolor 7900 GRE Red Devil minor undervolt
Storage	Samsung 870 QVO 1TB x 2, Lexar 256 GB, TeamGroup MP44L 2TB, Crucial T700 1TB, Seagate Firecuda 2TB
Display(s)	32" LG UltraGear GN600-B
Case	Montech 903 MAX AIR
Audio Device(s)	SteelSeries Arctis Nova Pro Wireless + Wicked Cushions Gel earpads
Power Supply	MSI MPG AGF 850 watt gold
Mouse	SteelSeries AeroX 5 l Forza Pad GameSir G7 SE l Power A OPS V3 for FPS (paddles)
Keyboard	SteelSeries Apex 9 TKL
VR HMD	-
Software	Windows 11 Pro 24H2
Benchmark Scores	Enough for me

AMD Radeon RX 9070 XT Could Get a 32 GB GDDR6 Upgrade

Better Than Native