AMD Radeon RX 9070 XT Could Get a 32 GB GDDR6 Upgrade

Bruno_O · Feb 12, 2025

csendesmark said:
Gaming wise, there is little point moving above 12~16GB VRAM in 2025Q1.

unless you play at 4k native + ultra settings + RT + AA

which can and will use over 16GB easily in many games

InVasMani · Feb 12, 2025

Don't worry the 8GB is all you need crowd will be replacing it with a 12GB is all you need card soon enough.

Makaveli · Feb 13, 2025

Visible Noise said:
AMD themselves only support 7900 series desktop GPUs.

Incorrect ROCm supports RDNA 3 & 2

System requirements (Windows) — HIP SDK installation (Windows)

Windows GPU and OS support

rocm.docs.amd.com

Visible Noise · Feb 13, 2025

HIP SDK isn’t ROCm.

System requirements (Linux) — ROCm installation (Linux)

System requirements for AMD ROCm

rocm.docs.amd.com

Edit: That’s hilarious, Auswolf gave you a thumbs up until I posted. Oopsies! lol.

Makaveli · Feb 13, 2025

Visible Noise said:
HIP SDK isn’t ROCm.

System requirements (Linux) — ROCm installation (Linux)

System requirements for AMD ROCm

rocm.docs.amd.com

View attachment 384592

Edit: That’s hilarious, Auswolf gave you a thumbs up until I posted. Oopsies! lol.

I posted Windows and you are posting Linux

Visible Noise · Feb 13, 2025

Makaveli said:
I posted Windows and you are posting Linux

Christ. ROCm doesn’t run on Windows. A Software Development Kit isn’t a Platform. Pay attention please.

The HIP SDK for Windows brings a subset of the ROCm platform to Windows.

HIP SDK installation for Windows — HIP SDK installation (Windows)

HIP SDK installation for Windows

rocm.docs.amd.com

No debugger, no tools, no AI, no communications. Jesus, Make doesn’t even work on Windows. Tell me again about all the compute apps for AMD desktop GPUs?

Edit: the porting tools have been “Coming Soon” for a year. And AMD is all “We’re a software company now”. What a fucking joke.

Amatuers

Dr. Dro · Feb 13, 2025

Octopuss said:
What's the bloody point?

As follows:

1. It helps them mask the regression in VRAM capacity in their uppermost segment GPU when compared to the previous generation by introducing this variant
2. It helps them shift cards to AI inference customers and whatever cryptocurrency miners are left, as well as milking some extra profits from the one-off gamer that will purchase this
3. It helps them increase margins by pricing this model significantly higher, while quietly reducing the supply of the 16 GB model with far less backlash from the customer base
4. It helps them state "If Nvidia can build a 32 GB gaming card, so can we!" to shareholders
5. Loyal fanbase will not question any of the four above points and attempt to spin whatever is convenient (specifically, #2) as a positive

Bruno_O said:
unless you play at 4k native + ultra settings + RT + AA

which can and will use over 16GB easily in many games

Not only it generally cannot (16 GB is perfectly adequate for this performance level), but the 9070 XT is not powerful enough to pull this off. I don't think the 5090 really is, depending on the game we're talking about.

AusWolf · Feb 13, 2025

Dr. Dro said:
As follows:

1. It helps them mask the regression in VRAM capacity in their uppermost segment GPU when compared to the previous generation by introducing this variant

That "uppermost" segment is $599 x70 level. Nvidia offers 12 GB here. So even AMD's "regression" is more. Hmm...

Dr. Dro said:
2. It helps them shift cards to AI inference customers and whatever cryptocurrency miners are left, as well as milking some extra profits from the one-off gamer that will purchase this

Definitely this.

Dr. Dro said:
3. It helps them increase margins by pricing this model significantly higher, while quietly reducing the supply of the 16 GB model with far less backlash from the customer base

It'll definitely be priced higher to milk AI customers, but reducing 16 GB supply is pure speculation with no basis in real life.

Dr. Dro said:
4. It helps them state "If Nvidia can build a 32 GB gaming card, so can we!" to shareholders

Yes. Not that it affects buyers with half a brain, though.

Dr. Dro · Feb 13, 2025

AusWolf said:
It'll definitely be priced higher to milk AI customers, but reducing 16 GB supply is pure speculation with no basis in real life.

Only one problem with that, there is no reason to prioritize an $599 16 GB version if they can sell the same chip on an $999 32 GB version that could arguably still be spun as a bargain to many

AusWolf · Feb 13, 2025

Dr. Dro said:
Only one problem with that, there is no reason to prioritize an $599 16 GB version if they can sell the same chip on an $999 32 GB version that could arguably still be spun as a bargain to many

There is: company reputation and market share in gaming GPUs.

Jtuck9 · Feb 13, 2025

AusWolf said:
There is: company reputation and market share in gaming GPUs.

Maybe that will become more of a lucrative market segment going forward with a.i. supposedly becoming "more expensive to train and cheaper to run"

AusWolf · Feb 13, 2025

Jtuck9 said:
Maybe that will become more of a lucrative market segment going forward with a.i. supposedly becoming "more expensive to train and cheaper to run"

I'm not saying it's not more lucrative. But I can't see AMD, or even Nvidia abandoning the gaming GPU market as of now. Why have any midrange GPUs otherwise? Why not just sell all for AI people?

Jtuck9 · Feb 13, 2025

AusWolf said:
I'm not saying it's not more lucrative. But I can't see AMD, or even Nvidia abandoning the gaming GPU market as of now. Why have any midrange GPUs otherwise? Why not just sell all for AI people?

It would be interesting to see the architectural modification . Maybe these are sort of poised as 9900X3D / 9950X3D equivalents.

3valatzy · Feb 13, 2025

The Norwegian Drone Pilot said:
But they are however stringy with their features compared to NVIDIA though. And that's where AMDs problem is, or most of it.

FeelinFroggy said:
Utter waste of VRAM. No game uses that much VRAM, any VRAM not being used just sits idle doing nothing. All this does is artificially inflates the cost of the card with zero performance improvements. Maybe do better with faster memory than more of it.

TPUnique said:
Ok. And what about non-game applications ? Is that also an utter waste for them ?

The general Radeon performance for compute is extremely poor.

Yashyyyk said:
Modded Skyrim (esp. VR) takes >20GB VRAM, with Nvidia still limiting VRAM, I'd welcome it

This is not the card for you, then. Buy the RX 7900 XTX instead!

freeagent said:
Lots of VRAM, not enough horsepower.. cool!

Not enough AMD software support, either.

Jtuck9 · Feb 13, 2025

3valatzy said:
Not enough AMD software support, either.

This is what I'm wondering if they'll slowly integrate (as much as possible) as they move over to UDNA

AusWolf · Feb 13, 2025

Jtuck9 said:
It would be interesting to see the architectural modification . Maybe these are sort of poised as 9900X3D / 9950X3D equivalents.

We're talking about GPUs here.

Jtuck9 · Feb 13, 2025

AusWolf said:
We're talking about GPUs here.

Equivalents of the GPU world, i.e. "best of both worlds"

AusWolf · Feb 13, 2025

Jtuck9 said:
Equivalents of the GPU world, i.e. "best of both worlds"

Ah, I get your point. Perhaps you're right. It probably won't make any sense purely for gaming, just like the 9950X3D doesn't.

Jtuck9 · Feb 13, 2025

AusWolf said:
Ah, I get your point. Perhaps you're right. It probably won't make any sense purely for gaming, just like the 9950X3D doesn't.

My fault, I'm generally assuming and unclear.

True, but there seem to be a lot of people excited for both! Interested to see if we get a reason for dual V-Cache on Zen6

AusWolf · Feb 13, 2025

Jtuck9 said:
My fault, I'm generally assuming and unclear.

True, but there seem to be a lot of people excited for both! Interested to see if we get a reason for dual V-Cache on Zen6

Personally, I'd prefer a single 12-core CCD with V-cache (not that I need it, but that would feel more of an upgrade over my 7800X3D).

Ultron1337 · Feb 13, 2025

freeagent said:
Lots of VRAM, not enough horsepower.. cool!

It is so for gaming, where GPU has to work pretty hard for for each frame. In LLM world when running pre-trained model aka inferencing you will run into memory amount and them memory bandwidth bottleneck, lack of compute is not the issue. So 32GB 9070XT can run some LLM bigger models faster than 24GB 7900XTX or 4090.

Nhonho · Feb 13, 2025

Onasi said:
Useless for gaming, kind of useful as a bone thrown to the AI crowd, but, as noted above, it would have to be priced very aggressively to compensate for the lack of NV amenities like CUDA.

AMD pays a very high price today for not having developed a wide range of software compatible with its GPUs, as Nvidia did.

AMD has always been the type of company that just developed the hardware and expected software developers to optimize their apps for AMD hardware on their own.

On the other hand, Nvidia has always had a very close relationship with software developers to optimize all types of software for its hardware.

bitsandboots · Feb 13, 2025

32gb would be a day 1 buy for me, with all of the AMD crap driver quality that entails.
Been itching to have something faster than a 3090 that doesnt skimp on vram, has displayport 2, and doesnt have power circuitry design flaws that pose a fire hazard.
I just want to VR and AI faster than I can today.

10tothemin9volts · Feb 13, 2025

With the 5090, NV has released a 32GB VRAM consumer GPU, ofc AMD is going to do the same (wasn't it the same with 24GB VRAM consumer GPUs?). The difference is the 9070 (XT) is based on a 256-bit chip using GDDR6 ~600 GB/s vs 512-bit GDDR7 1792 GB/s for the 5090. Still fast enough.

AFAIK, only modded games may require more than 24GB VRAM in 4K right now, but 32GB are nice for fully offloading/hosting big-ish LLMs locally.

Regarding CUDA/ML stack, indeed, I think of AMD GPUs only in terms of running/inferencing LLMs, not training/finetuning, but I read it's still possible and supposedly got easier over the last years, but CUDA is tier agnostic and supports consumer GPUs, workstation GPUs and enterprise cards. To improve this, UDNA (U for unified) will replace RDNA at some point.

5090' idle power consumption unfortunately increased to 30W (4090 22W), but it's still not too bad (it's more than linear in video playback: 54W 5090 vs 26W 4090) considering there are 16 2GB modules (linear increase: 22W[4090]/12[GDDR6X]*16[GDDR7] = 29.33W).

For me to consider this RDNA4 32GB GPU (in no particular order):

DLSS 2-like upscaling quality improvement
Fix HDMI 2.1 48GB/s, aka HDMI 2.1a on Linux
Back to good power scaling like in RDNA2
Low idle power consumption, linear increase with the amount of VRAM compared to the 16GB VRAM, at the worst
Just like the 5090, 9070 (XT) 32GB also must be a consumer GPU, so that the price increase is minimal

So, AMD, it's 48GB VRAM consumer GPUs for the UDNA arch after RDNA4 then as well? Would allow to fully offload `Llama-3.3-70B-Instruct-Q4_K_M.gguf` (42.5GB) (by then we will have a different and more capable 70B LLM, ofc), or allow for much higher context.

csendesmark said:
Try to run a 30GB model on a fancy 24 or 16GB RTX with CUDA and compare the experience against a Radeon with 32GB VRAM.
VRAM is valuable real-estate!

Yes, 24GB VRAM can't fit a e.g 27GB `Qwen2.5-32B-Instruct-Q6_K.gguf` SOTA LLM, but the .gguf format allows to offload the rest of the LLM layers to RAM, but it will run much slower. The tokens per second speed increases exponentially the more layers are offloaded to the GPU, I did some testing:

csendesmark · Feb 13, 2025

10tothemin9volts said:
Yes, 24GB VRAM can't fit a e.g 27GB `Qwen2.5-32B-Instruct-Q6_K.gguf` SOTA LLM, but the .gguf format allows to offload the rest of the LLM layers to RAM, but it will run much slower. The tokens per second speed increases exponentially the more layers are offloaded to the GPU, I did some testing:
View attachment 384684

Oh, please don't tell me, I am painfully aware how slow things can get when you run LLM-s outside of a GPU

May I ask you to post here?

System Name	The Expanse
Processor	AMD Ryzen 7 9800X3D
Motherboard	Asus Prime X670E-Pro Wifi BIOS 3222 AGESA PI 1.2.0.3a
Cooling	Corsair H150i Elite LCD XT
Memory	64GB G.SKILL Trident Z5 Neo RGB DDR5 6000 CL 30-40-40-96 1T
Video Card(s)	XFX Radeon RX 7900 XTX Magnetic Air (25.6.1)
Storage	WD SN850X 2TB / Corsair MP600 1TB / Samsung 860Evo 1TB x2 Raid 0 / Asus NAS AS1004T V2 20TB
Display(s)	LG 34GP83A-B 34 Inch 21: 9 UltraGear Curved QHD (3440 x 1440) 1ms Nano IPS 160Hz
Case	Fractal Design Meshify S2
Audio Device(s)	Creative X-Fi + Logitech Z-5500 + HS80 Wireless
Power Supply	Corsair AX850 Titanium
Mouse	Corsair Dark Core RGB SE
Keyboard	Corsair K100
Software	Windows 10 Pro x64 22H2
Benchmark Scores	https://valid.x86.fr/asijsu https://browser.geekbench.com/v6/cpu/11073923

System Name	XPS, Lenovo and HP Laptops, HP Xeon Mobile Workstation, HP Servers, Dell Desktops
Processor	Everything from Turion to 13900kf
Motherboard	MSI - they own the OEM market
Cooling	Air on laptops, lots of air on servers, AIO on desktops
Memory	I think one of the laptops is 2GB, to 64GB on gamer, to 128GB on ZFS Filer
Video Card(s)	A pile up to my knee, with a RTX 4090 teetering on top
Storage	Rust in the closet, solid state everywhere else
Display(s)	Laptop crap, LG UltraGear of various vintages
Case	OEM and a 42U rack
Audio Device(s)	Headphones
Power Supply	Whole home UPS w/Generac Standby Generator
Software	ZFS, UniFi Network Application, Entra, AWS IoT Core, Splunk
Benchmark Scores	1.21 GigaBungholioMarks

System Name	The Expanse
Processor	AMD Ryzen 7 9800X3D
Motherboard	Asus Prime X670E-Pro Wifi BIOS 3222 AGESA PI 1.2.0.3a
Cooling	Corsair H150i Elite LCD XT
Memory	64GB G.SKILL Trident Z5 Neo RGB DDR5 6000 CL 30-40-40-96 1T
Video Card(s)	XFX Radeon RX 7900 XTX Magnetic Air (25.6.1)
Storage	WD SN850X 2TB / Corsair MP600 1TB / Samsung 860Evo 1TB x2 Raid 0 / Asus NAS AS1004T V2 20TB
Display(s)	LG 34GP83A-B 34 Inch 21: 9 UltraGear Curved QHD (3440 x 1440) 1ms Nano IPS 160Hz
Case	Fractal Design Meshify S2
Audio Device(s)	Creative X-Fi + Logitech Z-5500 + HS80 Wireless
Power Supply	Corsair AX850 Titanium
Mouse	Corsair Dark Core RGB SE
Keyboard	Corsair K100
Software	Windows 10 Pro x64 22H2
Benchmark Scores	https://valid.x86.fr/asijsu https://browser.geekbench.com/v6/cpu/11073923

System Name	XPS, Lenovo and HP Laptops, HP Xeon Mobile Workstation, HP Servers, Dell Desktops
Processor	Everything from Turion to 13900kf
Motherboard	MSI - they own the OEM market
Cooling	Air on laptops, lots of air on servers, AIO on desktops
Memory	I think one of the laptops is 2GB, to 64GB on gamer, to 128GB on ZFS Filer
Video Card(s)	A pile up to my knee, with a RTX 4090 teetering on top
Storage	Rust in the closet, solid state everywhere else
Display(s)	Laptop crap, LG UltraGear of various vintages
Case	OEM and a 42U rack
Audio Device(s)	Headphones
Power Supply	Whole home UPS w/Generac Standby Generator
Software	ZFS, UniFi Network Application, Entra, AWS IoT Core, Splunk
Benchmark Scores	1.21 GigaBungholioMarks

Processor	13th Gen Intel Core i9-13900KS
Motherboard	ASUS ROG Maximus Z790 Apex Encore
Cooling	Pichau Lunara ARGB 360 + Honeywell PTM7950
Memory	32 GB G.Skill Trident Z5 RGB @ 7600 MT/s
Video Card(s)	Palit GameRock OC GeForce RTX 5090 32 GB
Storage	500 GB WD Black SN750 + 4x 300 GB WD VelociRaptor WD3000HLFS HDDs
Display(s)	55-inch LG G3 OLED
Case	Cooler Master MasterFrame 700 benchtable
Audio Device(s)	EVGA NU Audio + Sony MDR-V7 headphones
Power Supply	EVGA 1300 G2 1.3kW 80+ Gold
Mouse	Microsoft Classic IntelliMouse
Keyboard	IBM Model M type 1391405
Software	Windows 10 Pro 22H2
Benchmark Scores	I pulled a Qiqi~

System Name	My second and third PCs are Intel + Nvidia
Processor	AMD Ryzen 7 7800X3D @ 45 W TDP Eco Mode
Motherboard	MSi Pro B650M-A Wifi
Cooling	Noctua NH-U9S chromax.black push+pull
Memory	2x 24 GB Corsair Vengeance DDR5-6000 CL36
Video Card(s)	PowerColor Reaper Radeon RX 9070 XT
Storage	2 TB Corsair MP600 GS, 4 TB Seagate Barracuda
Display(s)	Dell S3422DWG 34" 1440 UW 144 Hz
Case	Corsair Crystal 280X
Audio Device(s)	Logitech Z333 2.1 speakers, AKG Y50 headphones
Power Supply	750 W Seasonic Prime GX
Mouse	Logitech MX Master 2S
Keyboard	Logitech G413 SE
Software	Bazzite (Fedora Linux) KDE Plasma

Processor	AMD 5600X
Motherboard	ASUS TUF GAMING B550M-Plus WiFi
Cooling	be quiet! Dark Rock 4
Memory	G.Skill Ripjaws 2 x 32 GB DDR4-3600 CL18-22-22-42 1.35V F4-3600C18D-64GVK
Video Card(s)	Sapphire Nitro+ RX 7900 XTX 24GB
Storage	Kingston KC3000 2TB + QNAP TBS-464
Display(s)	LG 35" LCD 35WN75C-B 3440x1440
Case	Kolink Bastion RGB Midi-Tower
Power Supply	Seasonic VERTEX PX-750 80+ Platinum
Mouse	Razer Deathadder v2
Benchmark Scores	phi4 - 62 tokens/s gemma3:27B - 35 tps

System Name	SOCIETY
Processor	AMD Ryzen 9 7800x3D
Motherboard	MSI MAG X670E TOMAHAWK
Cooling	Arctic Liquid Freezer II 420
Memory	64GB 6000mhz
Video Card(s)	Nvidia RTX 3090
Storage	WD SN850X 4TB, Micron 1100 2TB, ZFS NAS over 10gbe network
Display(s)	27" Dell S2721DGF, 24" ASUS IPS, 24" Dell IPS
Case	Corsair 750D
Power Supply	Cooler Master 1200W Gold
Mouse	Razer Deathadder
Keyboard	ROG Falchion
VR HMD	Varjo Aero, Quest Pro, Pimax 8KX
Software	Windows 10 with Debian VM

Processor	7800X3D @ Curve Optimizer: All Core: -25
Motherboard	TUF Gaming B650-Plus
Memory	2xKSM48E40BD8KM-32HM ECC RAM (ECC enabled in BIOS)
Video Card(s)	4070 @ 110W
Display(s)	SAMSUNG S95B 55" QD-OLED TV
Power Supply	RM850x

System Name	Kincsem
Processor	AMD Ryzen 9 9950X
Motherboard	ASUS ProArt X870E-CREATOR WIFI
Cooling	Be Quiet Dark Rock Pro 5
Memory	Kingston Fury KF560C32RSK2-96 (2×48GB 6GHz)
Video Card(s)	Sapphire AMD RX 7900 XT Pulse
Storage	Samsung 990PRO 2TB + Samsung 980PRO 2TB + FURY Renegade 2TB+ Adata 2TB + WD Ultrastar HC550 16TB
Display(s)	Acer QHD 27"@144Hz 1ms + UHD 27"@60Hz
Case	Cooler Master CM 690 III
Power Supply	Seasonic 1300W 80+ Gold Prime
Mouse	Logitech G502 Hero
Keyboard	HyperX Alloy Elite RGB
Software	Windows 10-64
Benchmark Scores	https://valid.x86.fr/9qw7iq https://valid.x86.fr/4d8n02 X570 https://www.techpowerup.com/gpuz/g46uc