AMD's RX Vega to Feature 4 GB and 8 GB Memory

Fluffmeister · Apr 1, 2017

It'll just be nice when they release the fucking thing.

I've already bought the candles for the GTX 1080's first birthday.

Prima.Vera · Apr 1, 2017

Guys relax. 4GB of VRAM is plenty enough for 1080p, on which those 4GB cards will probably aim. Since MSAA is quickly replaced by FXAA/SMAA and new texture and color compression algorithms are getting better, no need to dramatize about low VRAM.

FordGT90Concept · Apr 1, 2017

Vega is not aimed at 1920x1080, it is aimed at 2560x1440 and up. RX 480/580 is sufficient for 1920x1080.

The reason for huge pools of memory is because of massive textures and models which have sharply been increasing since PS4 and XB1 released. Selling a premium card with budget amounts of memory makes no sense.

Captain_Tom said:
AMD claims to have new tech that allows for half as much memory to be used as normal. If that's true, then a 4GB model is more than viable. In fact I would say this kinda confirms that AMD is confident it will work. They wouldn't do it otherwise...

Developers will do whatever they want to. AMD is banking on them implementing their optimizations to reduce the memory footprint. Very few titles will.

Some games don't use >4 GiB of VRAM but some do and with every passing year, the latter group gets bigger.

Captain_Tom · Apr 1, 2017

FordGT90Concept said:
Vega is not aimed at 1920x1080, it is aimed at 2560x1440 and up. RX 480/580 is sufficient for 1920x1080.

The reason for huge pools of memory is because of massive textures and models which have sharply been increasing since PS4 and XB1 released. Selling a premium card with budget amounts of memory makes no sense.

Developers will do whatever they want to. AMD is banking on them implementing their optimizations to reduce the memory footprint. Very few titles will.

Some games don't use >4 GiB of VRAM but some do and with every passing year, the latter group gets bigger.

It's not that simple. This is at the architectural level (Just like memory compression, though not the same thing).

FordGT90Concept · Apr 1, 2017

Any assets not loaded into the HBM2 on board memory will see bandwidth drop from 512+ GB/s to ~20 GB/s. More cache means less likely to incur that performance penalty. Even if the architecture is better at deciding what assets are needed in the HBM2 cache, it still going to get it wrong especially when it can only cache half as much.

RejZoR · Apr 1, 2017

Dude, where were you when HBC was explained and presented? They ran Deus Ex Mankind Divided on RX Vega with VRAM artificially limited to 2GB, to induce a memory capacity issue scenario. And the game still ran smoothly. And I don't get the skepticism, the concept isn't exactly new, it existed years ago in form of HyperMemory for AMD and TurboCache for NVIDIA. It worked very similar, it's just faster and more adaptable now. Plus, things became faster on the system side as well. Faster RAM, triple and quad channel configurations etc

FordGT90Concept · Apr 1, 2017

DXMD has small environments. Games like Shadows of Mordor are a completely different animal.

Quad channel will still only get you 100 GB/s under the best circumstances which is a far cry from the minimum 512 GB/s of the HBM stacks.

mrthanhnguyen · Apr 1, 2017

I hope Vega won't disappoint us at launch coz someone will make an excuse like wait for game developers optimize Vega and it will outperf Nvidia counterpart soon.

RejZoR · Apr 1, 2017

FordGT90Concept said:
DXMD has small environments. Games like Shadows of Mordor are a completely different animal.

Quad channel will still only get you 100 GB/s under the best circumstances which is a far cry from the minimum 512 GB/s of the HBM stacks.

You still don't understand. There is NO NEED for RAM to have as high bandwidth as VRAM. Though it helps probably if it's higher. Small environments or big environments, it doesn't matter. At all. Ever played Killing Floor 2? It has quite big levels and tons of enemies. If you leave texture streaming enabled, it hardly uses any VRAM, even on Ultra settings. But if you turn it off, it'll fill basically all VRAM. This is how HBC works, just on hardware level and not on game level.

RejZoR · Apr 1, 2017

mrthanhnguyen said:
I hope Vega won't disappoint us at launch coz someone will make an excuse like wait for game developers optimize Vega and it will outperf Nvidia counterpart soon.

It's funny you say that, AMD has proved that to be the case on almost every graphics card launch. And they do in fact outperform NVIDIA long term when cards are going head to head with relatively similar performance. It's funny that people still keep bringing it up over and over again...

JMccovery · Apr 2, 2017

Captain_Tom said:
Thank you. Some people just can't read lol.

I wouldn't be surprised if someone like SAPPHIRE released a toxic edition with 16GB of VRAM, but 8GB standard would be more than enough.

I think you're a slight bit unsure about HBM. Since AMD places the HBM stacks along with the GPU on an interposer, Sapphire couldn't offer a 16GB edition, unless: AMD releases a GPU with 16GB, Sapphire releases a dual-GPU card or there are additional memory controllers in the GPU that will allow GDDR5/X to be used in combination with HBM.

FordGT90Concept · Apr 2, 2017

RejZoR said:
You still don't understand. There is NO NEED for RAM to have as high bandwidth as VRAM. Though it helps probably if it's higher. Small environments or big environments, it doesn't matter. At all. Ever played Killing Floor 2? It has quite big levels and tons of enemies. If you leave texture streaming enabled, it hardly uses any VRAM, even on Ultra settings. But if you turn it off, it'll fill basically all VRAM. This is how HBC works, just on hardware level and not on game level.

I had a longer post here but basically...
http://www.tweaktown.com/tweakipedia/90/much-vram-need-1080p-1440p-4k-aa-enabled/index.html
-Streaming engines (Killing Floor is not) use more VRAM in general.
-Streaming engines tend to have more data cached because the player simply turning around in the world can result in wildly different scene that needs to be rendered.
-A small, detailed environment can be more demanding than a not-so-detailed large environment. It boils down to textures, shaders, triangles, and post rendering effects applied (e.g. anti-aliasing).
-Thanks to tessellation, even if you have a lot of models of the same kind of enemy in the game, you can render them repeatedly at little memory cost.
-Texture streaming is not the same as a streaming engine. Texture streaming doesn't precache textures which means there's a frame time spike whenever a new resource needs to be pulled from the memory to the GPU to render the scene. In games where there's literally 10s of gigabytes of textures, it's impossible to precache all of them. Streaming engines naturally have to stream textures as well.
-HBCC literally only reduces the effect of the frame time spike by a few milliseconds. Preemption is far better.

Wrapping back to the link: Witcher 3, Far Cry 4, GTAV, and Shadow of Mordor are streaming engines. Of those, GTAV and Shadow of Mordor both either are at or exceed 4 GiB VRAM at 1920x1080 and climb from there. The other two are surprisingly well optimized likely because they aren't as aggressive at caching.

RejZoR · Apr 2, 2017

Erm, no, no, nope, NOPE.

- Streaming engines use LESS VRAM. That's the whole point of this tech. It loads textures into memory on the fly for the region of the game you're currently in, not whole level at start of the game. Killing Floor 2 most certainly DOES use texture streaming. I made a tweaker for it around this very feature for the game. Original Killing Floor is UE 2.5 game, of course it doesn't have texture streaming.
- See first point.
- Triangles or polygons have absolutely nothing to do with any of it.
- Tessellation has absolutely nothing to do with any of it.
- Texture streaming or streaming engine, same exact thing. Except first streams only textures, second can stream also world objects/entities. Texture streaming does in fact precache textures so they are ready for the engine before it actually needs them. If that was not the case, you'd see world without textures because you wouldn't have them when needed or they'd pop up into existence which is very unwanted behavior. Which is why you have to preemptively fetch them into memory (precache) and make them available to the engine a bit before they are actually needed. All this is happening in the background, all the time. Which is why such games experience stuttering with HDD's, because the game is constantly fetching data and loading it into VRAM, but HDD's do it very slowly. But the gain is, you need way less VRAM, because at any point, you don't have textures of whole level in VRAM, you only have for a small section of level where you're currently located. Textures fill majority of VRAM during rendering. Like 3/4 of VRAM are textures. The rest are model meshes, entities and framebuffer (+other game data).

FordGT90Concept · Apr 2, 2017

The VRAM in a streaming engine is going to depend on how far the draw distance is and the steepness of the level of detail line. A relatively flat line with a high draw distance (e.g. mountains very, very far away) will still amass a huge number of textures and triangles in VRAM to sort through to render the frame; likewise, a short draw distance with high levels of detail will amass a huge number of polygons to draw which occupies VRAM. Game development is always about striking a balance between draw distance, levels of detail, and acceptable framerates (filling VRAM, like any RAM, results in a huge drop in performance).

Triangles are literally everything you see in 3D games. GPUs draw them to create a frame. That's data that lives in the VRAM.

Tessellation is literally about doing more with less, because math. Have a presentation:
http://www.seas.upenn.edu/~cis565/LECTURE2010/GPU Tessellation.pptx

In hardware tessellation allows a simple mesh to be sent down to the GPU, converted to a complex mesh, and then displayed
-Decrease memory to the card
-Increase rendering performance by decreasing the number of polygons through the full pipeline

Streaming engines take a location in the world and then load everything it needs based on that. As you move about the world, it has to decide what can be disposed of and what needs to be loaded. That includes everything from collision objects to textures. When you fast travel in a streaming engine, there's always loading as it disposes of your current location and it loads the next location. If there was no loading, you'd literally see nothing and fall endlessly because there's literally nothing there until it has said "I have enough for the player now." Strictly streaming textures is minor compared to a streaming engine.

When streaming anything, there is a degree of precaching. The streaming code tries to preempt what can be seen in that context and attempts to get into the pipeline in case it is needed. If you move about the world faster than the textures can stream (e.g. very slow HDD), you'll either see texture pop in or you'll see loading screens (GTA3 did this way back, GTA4 did it on consoles).

The frame time spikes occur when preemption fails to catch a resource that's needed.

There's games that really don't use much of textures but can still saturate VRAM with polygons and post processing effects.

efikkan · Apr 2, 2017

RejZoR said:
I'm just wondering how they are doing it. How they prioritize what goes into on-board memory and what goes into system RAM. Is it game dependent or is it fully on-the-fly, do they have to make game profiles, this is the stuff I'm wondering the most about. Because if they can achieve this fully on-the-fly without any profiles, just a really intelligent algorithms (maybe assisted with driver updatable algorithms on software level) that could be really sweet.

Resource streaming has to be implemented in the game engine.
Prefetching in CPUs works by finding access patterns; e.g. access of block at address x, then x + k, then x + 2k, but it has three requirements:
- The data to be accessed needs to be specifically laid out the way it's going to be accessed.
- There has to be several accesses before there can be a pattern, which means several cache misses, which in turn means stutter or missing resources.
- The patterns have to occur over a relatively short time, and there is no way you can look for patterns in hundreds of thousands of memory accesses. A CPU for comparison looks through a instruction window of up to 224 instructions. For GPUs we have queues of up to several thousands of instructions, and it's not like the driver is going to analyze the queues for several frames to look for patterns and keep a giant table and resolve that immediately.

The only game data that would have benefits from this would be landscape data, but the data still needs to laid out in a specific pattern, which is something developers usually don't control. Also, this kind of caching would only work as long as the camera keeps moving in straight lines over time.

Resource streaming can be very successful when it's implemented properly in the rendering engine itself.

RejZoR said:
RX480 and RX580 might have 8GB, but they only have that and no more. RX Vega with HBC can address all the memory you have in the system. In my case that would be 16+ GB of always free RAM and 8GB on-board. Not even GTX 1080Ti or Titan X Pascal has that.

FYI: Sharing of memory between CPU and GPU has been available in CUDA for years, so the idea is not new. It does however have very limited use cases.

RejZoR said:
Problem with texture streaming is that you're essentially doing a VRAM+HDD/VRAM+SSD instead of something a lot faster. And Vega's HBC with VRAM+RAM (+SSD) could certainly address that far better in same way how CPU addresses it's memory hierarchically. L1 cache is VRAM. L2 is RAM. L3 can be SSD. Because texture streaming still causes hitching, stuttering and framerate lag when doing texture streaming the way current game engines do it (VRAM+HDD) because it's parsing textures from really slow medium.

As someone who has implemented texture streaming with a three-level hierarchy, I can tell you the problem is prediction. With HBC each access to RAM is still going to be very slow, so the data has to be prefetched. HBC is not going to make the accesses "better".

RejZoR · Apr 2, 2017

Apparently they have it working regardless, otherwise they'd not even be considering it if it was so problematic and gimped by every 3rd game. That's all I can say. And I well know everything you've said there as I've worked with similar caching system for storage. One was file based (basically like texture streaming which is highly configurable and selective) and later with block based which has no idea what each program/file (texture analogy) is, it just cached it based on access patterns.

efikkan · Apr 2, 2017

Engineering samples of Vega 10 uses 8 GB memory, so we can expect versions with 8 GB. Vega 10 is going to compete with GP104, so 8 GB will probably be fine.

FordGT90Concept · Apr 2, 2017

The only way 4 GiB makes sense is if they have to cut these chips down so far they end up retailing for $200-250 and I certainly hope that isn't the case.

RejZoR · Apr 2, 2017

That'll be the price of RX 580, you won't see any Vega for 250 bucks.

laszlo · Apr 3, 2017

nice! why pay more when 4 gb hbm2 usage is optimized to the level of 8 gb ddr5? cheaper tech always sound good for the wallet

Hotobu · Apr 16, 2017

So when is the estimated release of Vega? I'm seeing a lot of conflicting news

kruk · Apr 16, 2017

Hotobu said:
So when is the estimated release of Vega? I'm seeing a lot of conflicting news

According to AMD it's just around the corner, according to astronomers it's 25.04 light years away, but it should certainly appear before end of 1H 2017

. Since the leaks are so scarce, I wouldn't expect anything before june this year (Fury X and RX 480 series launched at about the same timeframe).

64K · Apr 16, 2017

In other words, AMD is planning to launch a competitor for the mid range GTX 1080 (non-Ti) a year and a half after the 1080 (non-Ti) was launched. Mid-range Voltas will be launched a few months later and then we wait a year and a half for Navi to drop and hope that it can compete with the mid-range Volta. This is just sad. We will pay through the nose for those Voltas because of this.

ratirt · Apr 17, 2017

64K said:
In other words, AMD is planning to launch a competitor for the mid range GTX 1080 (non-Ti) a year and a half after the 1080 (non-Ti) was launched. Mid-range Voltas will be launched a few months later and then we wait a year and a half for Navi to drop and hope that it can compete with the mid-range Volta. This is just sad. We will pay through the nose for those Voltas because of this.

From what I have read about Vega there will be several different cards released. In this case maybe there will be a 1080Ti competition anyway. Knowing 6 different vega cards configuration is in store there. What makes you so sure 1080Ti won't get a competition from AMD? I surely hope it will get a competition. That would make the pricing better and I'm all after that.

64K · Apr 17, 2017

ratirt said:
From what I have read about Vega there will be several different cards released. In this case maybe there will be a 1080Ti competition anyway. Knowing 6 different vega cards configuration is in store there. What makes you so sure 1080Ti won't get a competition from AMD? I surely hope it will get a competition. That would make the pricing better and I'm all after that.

Just going from what AMD has been saying about Vega. I think it will perform somewhere around a 1080 (non-Ti) in most games and outperform the 1080 in DX12 or Vulkan games. I see no reason why AMD couldn't release a beefier faster Vega that competed with the 1080 Ti but can they while keeping the wattage used reasonable? I hope so.

Processor	AMD Ryzen 7 5700X3D
Motherboard	MSI MAG B550 TOMAHAWK
Cooling	Thermalright Peerless Assassin 120 SE
Memory	Team Group Dark Pro 8Pack Edition 3600Mhz CL16
Video Card(s)	Sapphire AMD Radeon RX 9070 XT NITRO+
Storage	Kingston A2000 1TB + Seagate HDD workhorse
Display(s)	Hisense 55" U7K 4K@144Hz
Case	Thermaltake Ceres 500 TG ARGB
Power Supply	Seasonic Focus GX-850
Mouse	Razer Deathadder Chroma
Keyboard	Logitech UltraX
Software	Windows 11

Processor	Intel® Core™ i7-13700K
Motherboard	Gigabyte Z790 Aorus Elite AX
Cooling	Noctua NH-D15
Memory	32GB(2x16) DDR5@6600MHz G-Skill Trident Z5
Video Card(s)	KUROUTOSHIKOU RTX 5080 GALAKURO
Storage	2TB SK Platinum P41 SSD + 4TB SanDisk Ultra SSD + 500GB Samsung 840 EVO SSD
Display(s)	Acer Predator X34 3440x1440@100Hz G-Sync
Case	NZXT PHANTOM410-BK
Audio Device(s)	Creative X-Fi Titanium PCIe
Power Supply	Corsair 850W
Mouse	Logitech Hero G502 SE
Software	Windows 11 Pro - 64bit
Benchmark Scores	30FPS in NFS:Rivals

System Name	BY-2021
Processor	AMD Ryzen 7 5800X (65w eco profile)
Motherboard	MSI B550 Gaming Plus
Cooling	Scythe Mugen (rev 5)
Memory	2 x Kingston HyperX DDR4-3200 32 GiB
Video Card(s)	AMD Radeon RX 7900 XT
Storage	Samsung 980 Pro, Seagate Exos X20 TB 7200 RPM
Display(s)	Nixeus NX-EDG274K (3840x2160@144 DP) + Samsung SyncMaster 906BW (1440x900@60 HDMI-DVI)
Case	Coolermaster HAF 932 w/ USB 3.0 5.25" bay + USB 3.2 (A+C) 3.5" bay
Audio Device(s)	Realtek ALC1150, Micca OriGen+
Power Supply	Enermax Platimax 850w
Mouse	Nixeus REVEL-X
Keyboard	Tesoro Excalibur
Software	Windows 10 Home 64-bit
Benchmark Scores	Faster than the tortoise; slower than the hare.

System Name	BY-2021
Processor	AMD Ryzen 7 5800X (65w eco profile)
Motherboard	MSI B550 Gaming Plus
Cooling	Scythe Mugen (rev 5)
Memory	2 x Kingston HyperX DDR4-3200 32 GiB
Video Card(s)	AMD Radeon RX 7900 XT
Storage	Samsung 980 Pro, Seagate Exos X20 TB 7200 RPM
Display(s)	Nixeus NX-EDG274K (3840x2160@144 DP) + Samsung SyncMaster 906BW (1440x900@60 HDMI-DVI)
Case	Coolermaster HAF 932 w/ USB 3.0 5.25" bay + USB 3.2 (A+C) 3.5" bay
Audio Device(s)	Realtek ALC1150, Micca OriGen+
Power Supply	Enermax Platimax 850w
Mouse	Nixeus REVEL-X
Keyboard	Tesoro Excalibur
Software	Windows 10 Home 64-bit
Benchmark Scores	Faster than the tortoise; slower than the hare.

System Name	Dark Monolith
Processor	AMD Ryzen 7 5800X3D
Motherboard	ASUS Strix X570-E
Cooling	Arctic Cooling Freezer II 240mm + 2x SilentWings 3 120mm
Memory	64 GB G.Skill Ripjaws V Black
Video Card(s)	XFX Radeon RX 9070 XT Mercury OC Magnetic Air
Storage	Seagate Firecuda 530 4 TB SSD + Samsung 850 Pro 2 TB SSD + Seagate Barracuda 8 TB HDD
Display(s)	ASUS ROG Swift PG27AQDM 240Hz OLED
Case	Silverstone Kublai KL-07
Audio Device(s)	Sound Blaster AE-9 MUSES Edition + Altec Lansing MX5021 2.1 Nichicon Gold
Power Supply	BeQuiet DarkPower 11 Pro 750W
Mouse	Logitech G502 Proteus Spectrum
Keyboard	UVI Pride MechaOptical
Software	Windows 11 Pro

AMD's RX Vega to Feature 4 GB and 8 GB Memory

Fluffmeister

Prima.Vera

FordGT90Concept

"I go fast!1!11!1!"

Captain_Tom

FordGT90Concept

"I go fast!1!11!1!"

RejZoR

FordGT90Concept

"I go fast!1!11!1!"

mrthanhnguyen

RejZoR

RejZoR

JMccovery

FordGT90Concept

"I go fast!1!11!1!"

RejZoR

FordGT90Concept

"I go fast!1!11!1!"

efikkan

RejZoR

efikkan

FordGT90Concept

"I go fast!1!11!1!"

RejZoR

laszlo

Hotobu

kruk

64K

ratirt

64K

Processor	Intel core i9 13900ks sp117 direct die
Motherboard	Asus Maximus Apex Z790
Cooling	Custom loop 3*360 45mm thick+ 3 x mo-ra3 420 +Dual D5 pump and dual ddc pump
Memory	2x24gb Gskill 8800c38
Video Card(s)	Asus RTX 4090 Strix
Storage	2 tb crucial t700, raid 0 samsung 970 pro 2tb
Display(s)	Sammsung G7 32”
Case	Dynamic XL
Audio Device(s)	Creative Omni 5.1 usb sound card
Power Supply	Corsair AX1600i
Mouse	Model O-
Keyboard	Hyper X Alloy Origin Core

Processor	AMD Ryzen 9 5900X \|\|\| Intel Core i7-3930K
Motherboard	ASUS ProArt B550-CREATOR \|\|\| Asus P9X79 WS
Cooling	Noctua NH-U14S \|\|\| Be Quiet Pure Rock
Memory	Crucial 2 x 16 GB 3200 MHz \|\|\| Corsair 8 x 8 GB 1333 MHz
Video Card(s)	MSI GTX 1060 3GB \|\|\| MSI GTX 680 4GB
Storage	Samsung 970 PRO 512 GB + 1 TB \|\|\| Intel 545s 512 GB + 256 GB
Display(s)	Asus ROG Swift PG278QR 27" \|\|\| Eizo EV2416W 24"
Case	Fractal Design Define 7 XL x 2
Audio Device(s)	Cambridge Audio DacMagic Plus
Power Supply	Seasonic Focus PX-850 x 2
Mouse	Razer Abyssus
Keyboard	CM Storm QuickFire XT
Software	Ubuntu

System Name	2nd AMD puppy
Processor	FX-8350 vishera
Motherboard	Gigabyte GA-970A-UD3
Cooling	Cooler Master Hyper TX2
Memory	16 Gb DDR3:8GB Kingston HyperX Beast + 8Gb G.Skill Sniper(by courtesy of tabascosauz &TPU)
Video Card(s)	Sapphire RX 580 Nitro+;1450/2000 Mhz
Storage	SSD :840 pro 128 Gb;Iridium pro 240Gb ; HDD 2xWD-1Tb
Display(s)	Benq XL2730Z 144 Hz freesync
Case	NZXT 820 PHANTOM
Audio Device(s)	Audigy SE with Logitech Z-5500
Power Supply	Riotoro Enigma G2 850W
Mouse	Razer copperhead / Gamdias zeus (by courtesy of sneekypeet & TPU)
Keyboard	MS Sidewinder x4
Software	win10 64bit ltsc
Benchmark Scores	irrelevant for me

System Name	---
Processor	Ryzen 1600
Motherboard	ASRock Taichi X370
Cooling	Noctua D15
Memory	G.Skill 3200 DDR4 2x8GB
Video Card(s)	EVGA 1080 TI SC
Storage	500GB Samsung Evo 970 NVMe + 860 Evo 2TB SSD + 5x 2TB HDDs
Display(s)	LG CX 65"
Case	Phanteks P600S (white)
Audio Device(s)	Onboard
Power Supply	Corsair RM850x (white)

Processor	i7 7700k
Motherboard	MSI Z270 SLI Plus
Cooling	CM Hyper 212 EVO
Memory	2 x 8 GB Corsair Vengeance
Video Card(s)	Temporary MSI RTX 4070 Super
Storage	Samsung 850 EVO 250 GB and WD Black 4TB
Display(s)	Temporary Viewsonic 4K 60 Hz
Case	Corsair Obsidian 750D Airflow Edition
Audio Device(s)	Onboard
Power Supply	EVGA SuperNova 850 W Gold
Mouse	Logitech G502
Keyboard	Logitech G105
Software	Windows 10

System Name	Bro2
Processor	Ryzen 5800X
Motherboard	Gigabyte X570 Aorus Elite
Cooling	Corsair h115i pro rgb
Memory	32GB G.Skill Flare X 3200 CL14 @3800Mhz CL16
Video Card(s)	Powercolor 6900 XT Red Devil 1.1v@2400Mhz
Storage	M.2 Samsung 970 Evo Plus 500MB/ Samsung 860 Evo 1TB
Display(s)	LG 27UD69 UHD / LG 27GN950
Case	Fractal Design G
Audio Device(s)	Realtec 5.1
Power Supply	Seasonic 750W GOLD
Mouse	Logitech G402
Keyboard	Logitech slim
Software	Windows 10 64 bit