Thursday, September 26th 2024
NVIDIA GeForce RTX 5090 and RTX 5080 Specifications Surface, Showing Larger SKU Segmentation
Thanks to the renowned NVIDIA hardware leaker kopite7Kimi on X, we are getting information about the final versions of NVIDIA's first upcoming wave of GeForce RTX 50 series "Blackwell" graphics cards. The two leaked GPUs are the GeForce RTX 5090 and RTX 5080, which now feature a more significant gap between xx80 and xx90 SKUs. For starters, we have the highest-end GeForce RTX 5090. NVIDIA has decided to use the GB202-300-A1 die and enabled 21,760 FP32 CUDA cores on this top-end model. Accompanying the massive 170 SM GPU configuration, the RTX 5090 has 32 GB of GDDR7 memory on a 512-bit bus, with each GDDR7 die running at 28 Gbps. This translates to 1,568 GB/s memory bandwidth. All of this is confined to a 600 W TGP.
When it comes to the GeForce RTX 5080, NVIDIA has decided to further separate its xx80 and xx90 SKUs. The RTX 5080 has 10,752 FP32 CUDA cores paired with 16 GB of GDDR7 memory on a 256-bit bus. With GDDR7 running at 28 Gbps, the memory bandwidth is also halved at 784 GB/s. This SKU uses a GB203-400-A1 die, which is designed to run within a 400 W TGP power envelope. For reference, the RTX 4090 has 68% more CUDA cores than the RTX 4080. The rumored RTX 5090 has around 102% more CUDA cores than the rumored RTX 5080, which means that NVIDIA is separating its top SKUs even more. We are curious to see at what price point NVIDIA places its upcoming GPUs so that we can compare generational updates and the difference between xx80 and xx90 models and their widened gaps.
Sources:
kopite7kimi (RTX 5090), kopite7kimi (RTX 5080)
When it comes to the GeForce RTX 5080, NVIDIA has decided to further separate its xx80 and xx90 SKUs. The RTX 5080 has 10,752 FP32 CUDA cores paired with 16 GB of GDDR7 memory on a 256-bit bus. With GDDR7 running at 28 Gbps, the memory bandwidth is also halved at 784 GB/s. This SKU uses a GB203-400-A1 die, which is designed to run within a 400 W TGP power envelope. For reference, the RTX 4090 has 68% more CUDA cores than the RTX 4080. The rumored RTX 5090 has around 102% more CUDA cores than the rumored RTX 5080, which means that NVIDIA is separating its top SKUs even more. We are curious to see at what price point NVIDIA places its upcoming GPUs so that we can compare generational updates and the difference between xx80 and xx90 models and their widened gaps.
181 Comments on NVIDIA GeForce RTX 5090 and RTX 5080 Specifications Surface, Showing Larger SKU Segmentation
When I'm saying soldered memory I mean soldered to PCB (and wired by pcb tracks) not die to die soldering, direct bonding or any form of advanced packaging.
I think we are bit closer to agrement now.
When I'm saying decoupled memory with optical interface - I mean (affordable) dynamic memory not static one.
Low latency static memory or even HBM memory are quite different categories for the sake of (high) costs per bit.
I'm sure in 5 years timeframe decoupled memory will be competitive to GDDR7 soldered to pcb. ( GDDR7 as chiplets is quite different story ).
But of course I can be wrong and few more years we will have to waite for this fundamental changes on market.
But even if I'm wrong it still have minor impact on validity of my conclusion - at that fundamentally changed market today 5090 with their soldered GDDR7 ram will looks like a toy. That is my point.
If a 5090 is still able to be competitive with the status quo 5+ years from now, something wrong happened along the way.
But it is totally false picture.
The similiar picture were painted not so far ago in space industry - access to orbit must be expensive. But Musk show us otherwise.
edit
There are more factors than pure Moore law which keeping progress at fast pace now like arms race, US -China rivalry, etc
So goverments trying to stimulate their high-tech to stimulate their expansion plans and pace of progress as well.
Marketing departments trying to fool us in every possible way but we should be aware - what today looks like a bargain it wont be after a year or two so we should be more carefull which way we are spending our money cos future bargains coming to us (despite mainstream media outlets are mostly silent )- like decoupled memories - so we should be a bit more patient.
I have a 4090 because I play at 4K but when I see how it struggles with Next-Gen games at 4K already I don't even want to know how badly it will age! Ray Tracing and mostly Path Tracing are making games too hard to run, and Developers barely optimize their games anymore, so we have to use DLSS and Frame Generation to get decent performance! What a joke...
Sure I enjoy being able to play Cyberpunk 2077, Alan Wake 2, Black Myth: Wukong, etc. with Path Tracing but without DLSS and FG the games run around 25fps at Native 4K lol.
So even if the 5090 was able to 2x performance vs 4090 it would still be below 60fps... meaning we will need to wait for the 6090 to do that, and by then games will be a lot more demanding... it's a never ending story lol. 8K@240Hz ? Even DP 2.1 80Gbps with DSC won't be enough... We'll probably have to wait for DP 3.0 to do that lol.
But 8K@120Hz should be doable with a DP 2.1 80Gbps w/ DSC since it can do 4K@240Hz aka 8K@60Hz without DSC. You'll have to wait for the RTX 5090 and DP 2.1 port though. For Professionals yeah NVLink is a blessing compared to PCI-Express, but for Gamers even the PCIe 3.0 is not fully saturated yet...so PCIe 6.0 and 7.0 will be more useful for SSDs than GPUs.