Thursday, May 19th 2022

AMD "Navi 31" Rumored to Feature 384-bit GDDR6 Memory Interface

AMD has historically thrown brute memory bus width at solving memory-management problems in its graphics architectures, but the Infinity Cache technology launched with RDNA2 proved to be a game changer, as GPUs with narrow 256-bit memory interfaces could compete with NVIDIA's offerings that have 384-bit wide memory interfaces and faster GDDR6X memory types. It looks like the competition between NVIDIA "Ada" and AMD RDNA3 graphics architectures is about to heat up, as rumors are emerging of AMD giving its biggest next-gen ASIC, the "Navi 31," a 384-bit wide memory interface.

This 50 percent increase in memory bus width, runs in concert with two associated rumors—one, that the company will use faster 20 Gbps GDDR6 memory chips; and two, that AMD may increase the size of the on-die Infinity Cache memory. Samsung is already mass-producing 20 Gbps and 24 Gbps GDDR6 memory chips. These are regular GDDR6 memory chips with JEDEC-standard signaling, and not GDDR6X, an exclusive memory type innovated by NVIDIA and Micron Technology, which leverages PAM4 signaling to increase data-rates. A theoretical "Navi 31" with 20 Gbps GDDR6 memory speeds would enjoy 960 GB/s of memory bandwidth, a massive 87.5 percent bandwidth increase over the RX 6900 XT. The on-die Infinity Cache operates at speeds measured in several TB/s. The increased bus width could also signal an increase in memory sizes, with the RX 6950 XT successor featuring at least 24 GB of memory.
Sources: Greymon55 (Twitter), VideoCardz
Add your own comment

22 Comments on AMD "Navi 31" Rumored to Feature 384-bit GDDR6 Memory Interface

#1
GoldenX
Sounds like a beast for emulation.
...
On Linux.
Posted on Reply
#2
ixi
Lets wait and see. Anyway, I'm more concerned about the prices of gpu's...
Posted on Reply
#3
ratirt
ixiLets wait and see. Anyway, I'm more concerned about the prices of gpu's...
I'm sure the cards will perform very well but as you have said, I'm also more concerned about the price than the performance.
Posted on Reply
#4
Minus Infinity
ixiLets wait and see. Anyway, I'm more concerned about the prices of gpu's...
7900XT will probably hit $1599 vut they argue you can buy a 7800XT for probably $999 that can destroy a 6900XT or a $699 7700XT than can still easily beat a 6900XT. You just have to decide now how fast is fast enough.
Posted on Reply
#5
Keullo-e
S.T.A.R.S.
What a funny coincidence, as HD 7900 series were the first AMD cards to have a 384-bit memory bus. And the only one so far (I don't count HD 8900 and R9 280 series as new products).
Posted on Reply
#6
GunShot
The biggest retailers/OEMs in the world shares have crashed very hard recently (for many, decades lows) and this is just the beginning, mainly due to their own selfish greed, overpricing, and scalping of their products (looking at you Newegg, Amazon, MSI, etc.) which has contributed to today's global inflation that will turn into a global recession soon.

Just imagine if they do dare to attempt another deliberate price hike 2.0 for this next-gen of GPUs (but this time, though, there won't be any extra stimulus checks, etc. for them to bet on receiving to increase sales) and they mass produce cards with super inflated prices. How would the wounded global economy respond to that?
Posted on Reply
#7
Keullo-e
S.T.A.R.S.
GunShotThe biggest retailers/OEMs in the world stock have crashed very hard recently (for many, decades lows) and this is just the beginning, mainly due to their own selfish greed, overpricing, and scalping of their products (looking at you Newegg, Amazon, MSI, etc.) which has contributed to today's global inflation that will turn into a global recession soon.

Just imagine if they do dare to attempt another deliberate price hike 2.0 for this next-gen of GPUs (but this time, though, there won't be any extra stimulus checks, etc. for them to bet on receiving to increase sales) and they mass produce cards with super inflated prices. How would the wounded global economy respond to that?
Is play money mining still profitable? If it is, and mining isn't limited on the coming GPUs, then the prices will surely stay high or go even higher.
Posted on Reply
#8
GunShot
LenneIs play money mining still profitable? If it is, and mining isn't limited on the coming GPUs, then the prices will surely stay high or go even higher.
GPU mining is almost dead and POS is coming in fast. Miners are selling their cards, fast, as we speak extremely low-priced.
Posted on Reply
#9
Jism
LenneWhat a funny coincidence, as HD 7900 series were the first AMD cards to have a 384-bit memory bus. And the only one so far (I don't count HD 8900 and R9 280 series as new products).
They initially opted for a 256 bits bus. Less power required. Less space on PCB needed. Less stress on the IMC. Compensated that with Cache and you have a product that is competing with Nvidia's high end offerings of over 2000$. Mission accomplished i should say.

But with cache you can only go to such extend. At some point you need more bandwidth to supply all the available cores fast enough.
Posted on Reply
#10
Metroid
AMD used to put 512 bit memory for top end gpus, last gen was 256 bit for top end gpus, lets hope AMD does 512 bit memory again + infinity cache.
Posted on Reply
#11
Keullo-e
S.T.A.R.S.
MetroidAMD used to put 512 bit memory for top end gpus, last gen was 256 bit for top end gpus, lets hope AMD does 512 bit memory again + infinity cache.
The only consumer ones with 512-bit were HD 2900 and R9 290 series, they weren't that common.
Posted on Reply
#12
Metroid
LenneThe only consumer ones with 512-bit were HD 2900 and R9 290 series, they weren't that common.
Add Radeon R9 390X to that list in which it was the last one.
Posted on Reply
#13
Keullo-e
S.T.A.R.S.
MetroidAdd Radeon R9 390X to that list in which it was the last one.
I don't count rebrands.
Posted on Reply
#14
Punkenjoy
A larger bus mean a more complex PCB with more component.

But, latest rumors state a single compute CCD with 6 64 bit memory controller/64 MB Infinity cache chiplets on a cheaper nodes. This way they don't have the problem of 2 GPU on a same card like MI250 and they can maximize the compute CCD on the premium node as much as possible.

Can't wait to see how it will end, that will probably be interesting for sure. There was already an association between infinity cache and each 32 bit memory controller in Navi 2X. It really look each block cache the memory attached to the memory controller. A simple way to make it coherent.
Posted on Reply
#15
defaultluser
MetroidAMD used to put 512 bit memory for top end gpus, last gen was 256 bit for top end gpus, lets hope AMD does 512 bit memory again + infinity cache.
No, that was out of desperation for bandwidth! With improvements to compression,they were able to slim the 3090 performance down to 256-bit

They only moved Vega 64 to hBM2 to make power-consumption more manageable on such a high-end card ( because if the Vega 64 had used GTX 1080 Ti width plus memory capacity, it would have had even lower margins/ higher power, all for a card with GTX 1080 performance!)

But with RDNA they finally have comparable memory efficiency / power consumption to Nvidia. So now would be a good time to go back for a second iteration!
Posted on Reply
#16
Punkenjoy
defaultluserNo, that was out of desperation for bandwidth! With improvements to compression,they were able to slim the 3090 performance down to 256-bit

They only moved Vega 64 to hBM2 to make power-consumption more manageable on such a high-end card ( because if the Vega 64 had used GTX 1080 Ti width plus memory capacity, it would have had even lower margins/ higher power, all for a card with GTX 1080 performance!)

But with RDNA they finally have comparable memory efficiency / power consumption to Nvidia. So now would be a good time to go back for a second iteration!
The reason RDNA 2 is more efficient with memory is the addition of the last level cache AKA the Infinity cache, not due to much better compression than Nvidia. They probably be on par in that aspect.

That is also one of the drawback of the cache, the hit ratio get lower at higher resolutions and will probably also get lower on much more complex scene where each pixel will require more data to be processed. This is one of the reason RDNA 2 seems to scale less at 4K than Ampere.

The rumors are for a tripling of that cache so that may be enough to get good scaling at 4k.

But there are other benefits than bandwidth saving for the cache. Since it's much faster (in both bandwidth and latency), it help the frequency scaling since each compute units have to wait less to get the data. One of the reason GPU are clocked lower is to try to hide the memory latency as much as possible. else you just get wasted clocks
Posted on Reply
#17
Flyordie
I really wish they would go back to HBM2* for their high end GPUs. Less power consumed, smaller PCBs which means less demand on PCB manufacturing.. Yes, its more expensive but hell, with the price of GPUs now on the high end.. We should be getting something like HBM for our money. lol

I'm still rocking the Vega64 and its been running perfectly fine at 1000-1050 on the HBM2 for over 530GB/s bandwidth.
Posted on Reply
#18
defaultluser
PunkenjoyThe reason RDNA 2 is more efficient with memory is the addition of the last level cache AKA the Infinity cache, not due to much better compression than Nvidia. They probably be on par in that aspect.
disagree - RDNA 1 had bus-for-bus performance match-up, all before Infinity Cache


Here at 14Gbps 192-bit bus (same as 2060), it matches the 2060 pretty closely!



They only added Infinity cache to be able to improve power efficiency, plus increase performance

They really benefited, with a 2x performance increase over the old model:

Posted on Reply
#19
Punkenjoy
defaultluserdisagree - RDNA 1 had bus-for-bus performance match-up, all before Infinity Cache


They only added Infinity cache to be able to improve power efficiency, plus increase performance
So explain to me why being more efficient doesn't mean improved memory frequency plus increase performance ? What more efficient mean then

Explain why on RDNA1 with no infinity cache, They require the same bus width to be able to compete where on RDNA 2, a 256 bit GPU by example (6900xt) compete with GPU that have 368 bus width (3090). Same for 6700xt (192 bit) with 3070 (256 bit) etc..
Posted on Reply
#20
defaultluser
PunkenjoySo explain to me why being more efficient doesn't mean improved memory frequency plus increase performance ? What more efficient mean then

Explain why on RDNA1 with no infinity cache, They require the same bus width to be able to compete where on RDNA 2, a 256 bit GPU by example (6900xt) compete with GPU that have 368 bus width (3090). Same for 6700xt (192 bit) with 3070 (256 bit) etc..
Memory compression is the reason the 128-bit Maxwell GTx 960 was usually as fast as the 384-bit 7950



RDNA1 closed that gap with Turing

Once your architectures are matched on memory compression, then it becomes a question of how much extra die space you want to waste with all that cache? And is it cheaper than spending money on custom GDDR6X?
Posted on Reply
#21
TheoneandonlyMrK
GunShotThe biggest retailers/OEMs in the world shares have crashed very hard recently (for many, decades lows) and this is just the beginning, mainly due to their own selfish greed, overpricing, and scalping of their products (looking at you Newegg, Amazon, MSI, etc.) which has contributed to today's global inflation that will turn into a global recession soon.

Just imagine if they do dare to attempt another deliberate price hike 2.0 for this next-gen of GPUs (but this time, though, there won't be any extra stimulus checks, etc. for them to bet on receiving to increase sales) and they mass produce cards with super inflated prices. How would the wounded global economy respond to that?
My middle fingers ready.

I expect to pay, but I am not paying silly money even with a 384bit bus.
Posted on Reply
#22
Punkenjoy
defaultluserMemory compression is the reason the 128-bit Maxwell GTx 960 was usually as fast as the 384-bit 7950



RDNA1 closed that gap with Turing

Once your architectures are matched on memory compression, then it becomes a question of how much extra die space you want to waste with all that cache? And is it cheaper than spending money on custom GDDR6X?
Ok, i am not sure why you argue with me.
PunkenjoyThe reason RDNA 2 is more efficient with memory is the addition of the last level cache AKA the Infinity cache, not due to much better compression than Nvidia. They probably be on par in that aspect.
I said that memory compression is probably on par with NVidia and the reason RDNA 2 is more efficient (same performance with lower bandwidth) is the infinity cache and not the compression and I see you don't really contradict that. You stated that AMD was able to match Nvidia 386 bit bus with a 256 bit bus due to memory compression.


As for GDDR6X vs Infinity Cache, it's really up to cost and power usage. More cost on the silicon or more cost on the board and components. The fact that Nvidia drastically increase the cache on Lovelace is probably a good hint that just pushing the memory like crazy (with large bus and high power consumption) isn't the best approach.
Posted on Reply
Add your own comment
May 18th, 2024 10:18 EDT change timezone

New Forum Posts

Popular Reviews

Controversial News Posts