• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD "Navi 31" Rumored to Feature 384-bit GDDR6 Memory Interface

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
46,439 (7.66/day)
Location
Hyderabad, India
System Name RBMK-1000
Processor AMD Ryzen 7 5700G
Motherboard ASUS ROG Strix B450-E Gaming
Cooling DeepCool Gammax L240 V2
Memory 2x 8GB G.Skill Sniper X
Video Card(s) Palit GeForce RTX 2080 SUPER GameRock
Storage Western Digital Black NVMe 512GB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Gamdias Hermes E2
Software Windows 11 Pro
AMD has historically thrown brute memory bus width at solving memory-management problems in its graphics architectures, but the Infinity Cache technology launched with RDNA2 proved to be a game changer, as GPUs with narrow 256-bit memory interfaces could compete with NVIDIA's offerings that have 384-bit wide memory interfaces and faster GDDR6X memory types. It looks like the competition between NVIDIA "Ada" and AMD RDNA3 graphics architectures is about to heat up, as rumors are emerging of AMD giving its biggest next-gen ASIC, the "Navi 31," a 384-bit wide memory interface.

This 50 percent increase in memory bus width, runs in concert with two associated rumors—one, that the company will use faster 20 Gbps GDDR6 memory chips; and two, that AMD may increase the size of the on-die Infinity Cache memory. Samsung is already mass-producing 20 Gbps and 24 Gbps GDDR6 memory chips. These are regular GDDR6 memory chips with JEDEC-standard signaling, and not GDDR6X, an exclusive memory type innovated by NVIDIA and Micron Technology, which leverages PAM4 signaling to increase data-rates. A theoretical "Navi 31" with 20 Gbps GDDR6 memory speeds would enjoy 960 GB/s of memory bandwidth, a massive 87.5 percent bandwidth increase over the RX 6900 XT. The on-die Infinity Cache operates at speeds measured in several TB/s. The increased bus width could also signal an increase in memory sizes, with the RX 6950 XT successor featuring at least 24 GB of memory.



View at TechPowerUp Main Site | Source
 
Joined
Oct 2, 2015
Messages
2,997 (0.95/day)
Location
Argentina
System Name Ciel
Processor AMD Ryzen R5 5600X
Motherboard Asus Tuf Gaming B550 Plus
Cooling ID-Cooling 224-XT Basic
Memory 2x 16GB Kingston Fury 3600MHz@3933MHz
Video Card(s) Gainward Ghost 3060 Ti 8GB + Sapphire Pulse RX 6600 8GB
Storage NVMe Kingston KC3000 2TB + NVMe Toshiba KBG40ZNT256G + HDD WD 4TB
Display(s) AOC Q27G3XMN + Samsung S22F350
Case Cougar MX410 Mesh-G
Audio Device(s) Kingston HyperX Cloud Stinger Core 7.1 Wireless PC
Power Supply Aerocool KCAS-500W
Mouse EVGA X15
Keyboard VSG Alnilam
Software Windows 11
Sounds like a beast for emulation.
...
On Linux.
 
Joined
May 31, 2016
Messages
4,329 (1.49/day)
Location
Currently Norway
System Name Bro2
Processor Ryzen 5800X
Motherboard Gigabyte X570 Aorus Elite
Cooling Corsair h115i pro rgb
Memory 16GB G.Skill Flare X 3200 CL14 @3800Mhz CL16
Video Card(s) Powercolor 6900 XT Red Devil 1.1v@2400Mhz
Storage M.2 Samsung 970 Evo Plus 500MB/ Samsung 860 Evo 1TB
Display(s) LG 27UD69 UHD / LG 27GN950
Case Fractal Design G
Audio Device(s) Realtec 5.1
Power Supply Seasonic 750W GOLD
Mouse Logitech G402
Keyboard Logitech slim
Software Windows 10 64 bit
Lets wait and see. Anyway, I'm more concerned about the prices of gpu's...
I'm sure the cards will perform very well but as you have said, I'm also more concerned about the price than the performance.
 
Joined
May 3, 2018
Messages
2,335 (1.06/day)
Lets wait and see. Anyway, I'm more concerned about the prices of gpu's...
7900XT will probably hit $1599 vut they argue you can buy a 7800XT for probably $999 that can destroy a 6900XT or a $699 7700XT than can still easily beat a 6900XT. You just have to decide now how fast is fast enough.
 

Keullo-e

S.T.A.R.S.
Joined
Dec 16, 2012
Messages
11,142 (2.68/day)
Location
Finland
System Name 4K-gaming
Processor AMD Ryzen 7 5800X up to 5.05GHz
Motherboard Gigabyte B550M Aorus Elite
Cooling Custom loop (CPU+GPU, 240 & 120 rads)
Memory 32GB Kingston HyperX Fury @ DDR4-3466
Video Card(s) PowerColor RX 6700 XT Fighter OC/UV
Storage ~4TB SSD + 6TB HDD
Display(s) Acer XV273K 4K120 + Lenovo L32p-30 4K60
Case Corsair 4000D Airflow White
Audio Device(s) Asus TUF H3 Wireless
Power Supply EVGA Supernova G2 750W
Mouse Logitech MX518
Keyboard Roccat Vulcan 121 AIMO
VR HMD Oculus Rift CV1
Software Windows 11 Pro
Benchmark Scores It runs Crysis remastered at 4K
What a funny coincidence, as HD 7900 series were the first AMD cards to have a 384-bit memory bus. And the only one so far (I don't count HD 8900 and R9 280 series as new products).
 
Joined
Apr 22, 2021
Messages
249 (0.22/day)
The biggest retailers/OEMs in the world shares have crashed very hard recently (for many, decades lows) and this is just the beginning, mainly due to their own selfish greed, overpricing, and scalping of their products (looking at you Newegg, Amazon, MSI, etc.) which has contributed to today's global inflation that will turn into a global recession soon.

Just imagine if they do dare to attempt another deliberate price hike 2.0 for this next-gen of GPUs (but this time, though, there won't be any extra stimulus checks, etc. for them to bet on receiving to increase sales) and they mass produce cards with super inflated prices. How would the wounded global economy respond to that?
 

Keullo-e

S.T.A.R.S.
Joined
Dec 16, 2012
Messages
11,142 (2.68/day)
Location
Finland
System Name 4K-gaming
Processor AMD Ryzen 7 5800X up to 5.05GHz
Motherboard Gigabyte B550M Aorus Elite
Cooling Custom loop (CPU+GPU, 240 & 120 rads)
Memory 32GB Kingston HyperX Fury @ DDR4-3466
Video Card(s) PowerColor RX 6700 XT Fighter OC/UV
Storage ~4TB SSD + 6TB HDD
Display(s) Acer XV273K 4K120 + Lenovo L32p-30 4K60
Case Corsair 4000D Airflow White
Audio Device(s) Asus TUF H3 Wireless
Power Supply EVGA Supernova G2 750W
Mouse Logitech MX518
Keyboard Roccat Vulcan 121 AIMO
VR HMD Oculus Rift CV1
Software Windows 11 Pro
Benchmark Scores It runs Crysis remastered at 4K
The biggest retailers/OEMs in the world stock have crashed very hard recently (for many, decades lows) and this is just the beginning, mainly due to their own selfish greed, overpricing, and scalping of their products (looking at you Newegg, Amazon, MSI, etc.) which has contributed to today's global inflation that will turn into a global recession soon.

Just imagine if they do dare to attempt another deliberate price hike 2.0 for this next-gen of GPUs (but this time, though, there won't be any extra stimulus checks, etc. for them to bet on receiving to increase sales) and they mass produce cards with super inflated prices. How would the wounded global economy respond to that?
Is play money mining still profitable? If it is, and mining isn't limited on the coming GPUs, then the prices will surely stay high or go even higher.
 
Joined
Dec 30, 2010
Messages
2,105 (0.43/day)
What a funny coincidence, as HD 7900 series were the first AMD cards to have a 384-bit memory bus. And the only one so far (I don't count HD 8900 and R9 280 series as new products).

They initially opted for a 256 bits bus. Less power required. Less space on PCB needed. Less stress on the IMC. Compensated that with Cache and you have a product that is competing with Nvidia's high end offerings of over 2000$. Mission accomplished i should say.

But with cache you can only go to such extend. At some point you need more bandwidth to supply all the available cores fast enough.
 
Joined
May 8, 2018
Messages
1,498 (0.68/day)
Location
London, UK
AMD used to put 512 bit memory for top end gpus, last gen was 256 bit for top end gpus, lets hope AMD does 512 bit memory again + infinity cache.
 

Keullo-e

S.T.A.R.S.
Joined
Dec 16, 2012
Messages
11,142 (2.68/day)
Location
Finland
System Name 4K-gaming
Processor AMD Ryzen 7 5800X up to 5.05GHz
Motherboard Gigabyte B550M Aorus Elite
Cooling Custom loop (CPU+GPU, 240 & 120 rads)
Memory 32GB Kingston HyperX Fury @ DDR4-3466
Video Card(s) PowerColor RX 6700 XT Fighter OC/UV
Storage ~4TB SSD + 6TB HDD
Display(s) Acer XV273K 4K120 + Lenovo L32p-30 4K60
Case Corsair 4000D Airflow White
Audio Device(s) Asus TUF H3 Wireless
Power Supply EVGA Supernova G2 750W
Mouse Logitech MX518
Keyboard Roccat Vulcan 121 AIMO
VR HMD Oculus Rift CV1
Software Windows 11 Pro
Benchmark Scores It runs Crysis remastered at 4K
AMD used to put 512 bit memory for top end gpus, last gen was 256 bit for top end gpus, lets hope AMD does 512 bit memory again + infinity cache.
The only consumer ones with 512-bit were HD 2900 and R9 290 series, they weren't that common.
 

Keullo-e

S.T.A.R.S.
Joined
Dec 16, 2012
Messages
11,142 (2.68/day)
Location
Finland
System Name 4K-gaming
Processor AMD Ryzen 7 5800X up to 5.05GHz
Motherboard Gigabyte B550M Aorus Elite
Cooling Custom loop (CPU+GPU, 240 & 120 rads)
Memory 32GB Kingston HyperX Fury @ DDR4-3466
Video Card(s) PowerColor RX 6700 XT Fighter OC/UV
Storage ~4TB SSD + 6TB HDD
Display(s) Acer XV273K 4K120 + Lenovo L32p-30 4K60
Case Corsair 4000D Airflow White
Audio Device(s) Asus TUF H3 Wireless
Power Supply EVGA Supernova G2 750W
Mouse Logitech MX518
Keyboard Roccat Vulcan 121 AIMO
VR HMD Oculus Rift CV1
Software Windows 11 Pro
Benchmark Scores It runs Crysis remastered at 4K
Joined
Oct 12, 2005
Messages
682 (0.10/day)
A larger bus mean a more complex PCB with more component.

But, latest rumors state a single compute CCD with 6 64 bit memory controller/64 MB Infinity cache chiplets on a cheaper nodes. This way they don't have the problem of 2 GPU on a same card like MI250 and they can maximize the compute CCD on the premium node as much as possible.

Can't wait to see how it will end, that will probably be interesting for sure. There was already an association between infinity cache and each 32 bit memory controller in Navi 2X. It really look each block cache the memory attached to the memory controller. A simple way to make it coherent.
 
Joined
Aug 6, 2020
Messages
729 (0.53/day)
AMD used to put 512 bit memory for top end gpus, last gen was 256 bit for top end gpus, lets hope AMD does 512 bit memory again + infinity cache.


No, that was out of desperation for bandwidth! With improvements to compression,they were able to slim the 3090 performance down to 256-bit

They only moved Vega 64 to hBM2 to make power-consumption more manageable on such a high-end card ( because if the Vega 64 had used GTX 1080 Ti width plus memory capacity, it would have had even lower margins/ higher power, all for a card with GTX 1080 performance!)

But with RDNA they finally have comparable memory efficiency / power consumption to Nvidia. So now would be a good time to go back for a second iteration!
 
Joined
Oct 12, 2005
Messages
682 (0.10/day)
No, that was out of desperation for bandwidth! With improvements to compression,they were able to slim the 3090 performance down to 256-bit

They only moved Vega 64 to hBM2 to make power-consumption more manageable on such a high-end card ( because if the Vega 64 had used GTX 1080 Ti width plus memory capacity, it would have had even lower margins/ higher power, all for a card with GTX 1080 performance!)

But with RDNA they finally have comparable memory efficiency / power consumption to Nvidia. So now would be a good time to go back for a second iteration!
The reason RDNA 2 is more efficient with memory is the addition of the last level cache AKA the Infinity cache, not due to much better compression than Nvidia. They probably be on par in that aspect.

That is also one of the drawback of the cache, the hit ratio get lower at higher resolutions and will probably also get lower on much more complex scene where each pixel will require more data to be processed. This is one of the reason RDNA 2 seems to scale less at 4K than Ampere.

The rumors are for a tripling of that cache so that may be enough to get good scaling at 4k.

But there are other benefits than bandwidth saving for the cache. Since it's much faster (in both bandwidth and latency), it help the frequency scaling since each compute units have to wait less to get the data. One of the reason GPU are clocked lower is to try to hide the memory latency as much as possible. else you just get wasted clocks
 
Joined
Oct 26, 2008
Messages
2,251 (0.40/day)
System Name Budget AMD System
Processor Threadripper 1900X @ 4.1Ghz (100x41 @ 1.3250V)
Motherboard Gigabyte X399 Aorus Gaming 7
Cooling EKWB X399 Monoblock
Memory 4x8GB GSkill TridentZ RGB 14-14-14-32 CR1 @ 3266
Video Card(s) XFX Radeon RX Vega₆⁴ Liquid @ 1,800Mhz Core, 1025Mhz HBM2
Storage 1x ADATA SX8200 NVMe, 1x Segate 2.5" FireCuda 2TB SATA, 1x 500GB HGST SATA
Display(s) Vizio 22" 1080p 60hz TV (Samsung Panel)
Case Corsair 570X
Audio Device(s) Onboard
Power Supply Seasonic X Series 850W KM3
Software Windows 10 Pro x64
I really wish they would go back to HBM2* for their high end GPUs. Less power consumed, smaller PCBs which means less demand on PCB manufacturing.. Yes, its more expensive but hell, with the price of GPUs now on the high end.. We should be getting something like HBM for our money. lol

I'm still rocking the Vega64 and its been running perfectly fine at 1000-1050 on the HBM2 for over 530GB/s bandwidth.
 
Joined
Aug 6, 2020
Messages
729 (0.53/day)
The reason RDNA 2 is more efficient with memory is the addition of the last level cache AKA the Infinity cache, not due to much better compression than Nvidia. They probably be on par in that aspect.

disagree - RDNA 1 had bus-for-bus performance match-up, all before Infinity Cache


Here at 14Gbps 192-bit bus (same as 2060), it matches the 2060 pretty closely!



They only added Infinity cache to be able to improve power efficiency, plus increase performance

They really benefited, with a 2x performance increase over the old model:

 
Last edited:
Joined
Oct 12, 2005
Messages
682 (0.10/day)
disagree - RDNA 1 had bus-for-bus performance match-up, all before Infinity Cache


They only added Infinity cache to be able to improve power efficiency, plus increase performance

So explain to me why being more efficient doesn't mean improved memory frequency plus increase performance ? What more efficient mean then

Explain why on RDNA1 with no infinity cache, They require the same bus width to be able to compete where on RDNA 2, a 256 bit GPU by example (6900xt) compete with GPU that have 368 bus width (3090). Same for 6700xt (192 bit) with 3070 (256 bit) etc..
 
Joined
Aug 6, 2020
Messages
729 (0.53/day)
So explain to me why being more efficient doesn't mean improved memory frequency plus increase performance ? What more efficient mean then

Explain why on RDNA1 with no infinity cache, They require the same bus width to be able to compete where on RDNA 2, a 256 bit GPU by example (6900xt) compete with GPU that have 368 bus width (3090). Same for 6700xt (192 bit) with 3070 (256 bit) etc..


Memory compression is the reason the 128-bit Maxwell GTx 960 was usually as fast as the 384-bit 7950



RDNA1 closed that gap with Turing

Once your architectures are matched on memory compression, then it becomes a question of how much extra die space you want to waste with all that cache? And is it cheaper than spending money on custom GDDR6X?
 
Joined
Mar 10, 2010
Messages
11,878 (2.30/day)
Location
Manchester uk
System Name RyzenGtEvo/ Asus strix scar II
Processor Amd R5 5900X/ Intel 8750H
Motherboard Crosshair hero8 impact/Asus
Cooling 360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory Corsair Vengeance Rgb pro 3600cas14 16Gb in four sticks./16Gb/16GB
Video Card(s) Powercolour RX7900XT Reference/Rtx 2060
Storage Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s) Samsung UAE28"850R 4k freesync.dell shiter
Case Lianli 011 dynamic/strix scar2
Audio Device(s) Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply corsair 1200Hxi/Asus stock
Mouse Roccat Kova/ Logitech G wireless
Keyboard Roccat Aimo 120
VR HMD Oculus rift
Software Win 10 Pro
Benchmark Scores 8726 vega 3dmark timespy/ laptop Timespy 6506
The biggest retailers/OEMs in the world shares have crashed very hard recently (for many, decades lows) and this is just the beginning, mainly due to their own selfish greed, overpricing, and scalping of their products (looking at you Newegg, Amazon, MSI, etc.) which has contributed to today's global inflation that will turn into a global recession soon.

Just imagine if they do dare to attempt another deliberate price hike 2.0 for this next-gen of GPUs (but this time, though, there won't be any extra stimulus checks, etc. for them to bet on receiving to increase sales) and they mass produce cards with super inflated prices. How would the wounded global economy respond to that?
My middle fingers ready.

I expect to pay, but I am not paying silly money even with a 384bit bus.
 
Joined
Oct 12, 2005
Messages
682 (0.10/day)
Memory compression is the reason the 128-bit Maxwell GTx 960 was usually as fast as the 384-bit 7950



RDNA1 closed that gap with Turing

Once your architectures are matched on memory compression, then it becomes a question of how much extra die space you want to waste with all that cache? And is it cheaper than spending money on custom GDDR6X?
Ok, i am not sure why you argue with me.

The reason RDNA 2 is more efficient with memory is the addition of the last level cache AKA the Infinity cache, not due to much better compression than Nvidia. They probably be on par in that aspect.


I said that memory compression is probably on par with NVidia and the reason RDNA 2 is more efficient (same performance with lower bandwidth) is the infinity cache and not the compression and I see you don't really contradict that. You stated that AMD was able to match Nvidia 386 bit bus with a 256 bit bus due to memory compression.


As for GDDR6X vs Infinity Cache, it's really up to cost and power usage. More cost on the silicon or more cost on the board and components. The fact that Nvidia drastically increase the cache on Lovelace is probably a good hint that just pushing the memory like crazy (with large bus and high power consumption) isn't the best approach.
 
Top