AMD Radeon "Navi 3x" Could See 50% Increase in Shaders, Double the Cache Memory

btarunr · Dec 20, 2021

AMD's next generation Radeon "Navi 3x" line of GPUs could see a 50% increase in shaders and a doubling Infinity Cache memory size, according to some educated-guesswork and intelligence by Greymon55, a reliable source with GPU leaks. The Navi 31, Navi 32, and Navi 33 chips are expected to debut the new RDNA3 graphics architecture, and succeed the 6 nm optical-shrinks of existing Navi 2x chips that AMD is rumored to be working on.

The top Navi 31 part allegedly features 60 workgroup processors (WGPs), or 120 compute units. Assuming an RDNA3 CU still holds 64 stream processors, you're looking at 7,680 stream processors, a 50% increase over Navi 21. The Navi 32 silicon features 40 WGPs, and exactly the same number of shaders as the current Navi 21, at 5,120. The smallest of the three, the Navi 33, packs 16 WGPs, or 2,048 shaders. There is a generational doubling in cache memory, with 256 MB on the Navi 31, 192 MB on the Navi 32, and 64 MB on the Navi 33. Interestingly, the memory sizes and bus widths are unchanged, but AMD could leverage faster GDDR6 memory types. 2022 will see the likes of Samsung ship GDDR6 chips with data-rates as high as 24 Gbps.

View at TechPowerUp Main Site

davideneco · Dec 20, 2021

It's literally the same specifications than the last time he just made a other tweet for like

Website are desperate

Chomiq · Dec 20, 2021

And triple the price.

Pumper · Dec 20, 2021

Chomiq said:
And triple the price.

With 50% of current supply.

AlwaysHope · Dec 20, 2021

When these hit the market, maybe my next gpu upgrade unless Intel come up with something before?
Love the speculation about supply & prices into the future, so entertaining! :laugh:

ARF · Dec 20, 2021

When?

7689 shaders or double that up to 15360 shaders?

AMD Next-Gen RDNA 3 & RDNA 4 GPU Rumors: Over 50% Performance Increase, Increased Radeon RX 7000 Pricing & Launch In 2H 2022 (wccftech.com)

Jeager · Dec 20, 2021

Chomiq said:
And triple the price.

Should only be x2.5, x0.5 for the 50% shader increase and x2 for the double cache

Chrispy_ · Dec 20, 2021

It only matters if people can buy it, otherwise they may as well just stop calling it Navi and start calling it CMP.

ARF · Dec 20, 2021

Jeager said:
Should only be x2.5, x0.5 for the 50% shader increase and x2 for the double cache

AMD doesn't want to bankrupt its GPU division, does it?

Punkenjoy · Dec 20, 2021

ARF said:
When?

7689 shaders or double that up to 15360 shaders?

View attachment 229535
AMD Next-Gen RDNA 3 & RDNA 4 GPU Rumors: Over 50% Performance Increase, Increased Radeon RX 7000 Pricing & Launch In 2H 2022 (wccftech.com)

Still have to see what they will do witch case, is it 256 MB per die (512 MB total) or 128 MB per die (256 MB total) the fact that it's now MCM seem to send rumors all over the place.

I hope it's 256 MB per die, or at least 512 MB total in the case of those cache would be in a third I/O or connector die.

There are also no rumors on architectural change. Just a compute unit count. AMD use their regular units to do raytracing vs Nvidia that have dedicated units. We will see how RDNA3 perform in raytracing workload. Indeed the increased performance (and maybe cache) will help but will it be enough to compete in those workload with Ampere ? we will see.

As for the performance increase, it could be this

- 50% more core counts
- Increased frequency due to better nodes (3GHz?)
- More infinity cache (and faster too if 3GHz reachable)
- Higher memory bandwidth
- increased IPC

is 2.5x possible ? i do not know but we will need to have all those checked to be able to achieve the 2.5x people are speculating about.

ARF · Dec 20, 2021

Punkenjoy said:
Still have to see what they will do witch case, is it 256 MB per die (512 MB total) or 128 MB per die (256 MB total) the fact that it's now MCM seem to send rumors all over the place.

I hope it's 256 MB per die, or at least 512 MB total in the case of those cache would be in a third I/O or connector die.

There are also no rumors on architectural change. Just a compute unit count. AMD use their regular units to do raytracing vs Nvidia that have dedicated units. We will see how RDNA3 perform in raytracing workload. Indeed the increased performance (and maybe cache) will help but will it be enough to compete in those workload with Ampere ? we will see.

As for the performance increase, it could be this

- 50% more core counts
- Increased frequency due to better nodes (3GHz?)
- More infinity cache (and faster too if 3GHz reachable)
- Higher memory bandwidth
- increased IPC

is 2.5x possible ? i do not know but we will need to have all those checked to be able to achieve the 2.5x people are speculating about.

The speculations come from similar articles:

AMD & NVIDIA Next-Gen Flagship GPUs Detailed: RDNA 3 Radeon RX 7900 XT With 15360 Cores, Ada Lovelace GeForce RTX 4090 With 18432 Cores (wccftech.com)

The details are not that interesting. It is obvious that they have found a way with significant architectural improvements to make the performance better.

The question is when the market will see it. Because if it is 2023, then Navi 21 would be more than two years old already!

Also, they are not optimistic at all about the shortages and the scalper pricings.

Oberon · Dec 20, 2021

Punkenjoy said:
Still have to see what they will do witch case, is it 256 MB per die (512 MB total) or 128 MB per die (256 MB total) the fact that it's now MCM seem to send rumors all over the place.

I hope it's 256 MB per die, or at least 512 MB total in the case of those cache would be in a third I/O or connector die.

256 MB on the IOD, none on the GCDs.

The speculations come from similar articles:

View attachment 229577
AMD & NVIDIA Next-Gen Flagship GPUs Detailed: RDNA 3 Radeon RX 7900 XT With 15360 Cores, Ada Lovelace GeForce RTX 4090 With 18432 Cores (wccftech.com)

The details are not that interesting. It is obvious that they have found a way with significant architectural improvements to make the performance better.

The question is when the market will see it. Because if it is 2023, then Navi 21 would be more than two years old already!

Also, they are not optimistic at all about the shortages and the scalper pricings.

Hopper isn't even a consumer graphics part...

mechtech · Dec 20, 2021

And optimized for mining

aka the legal way to get your own money printing press

Vayra86 · Dec 20, 2021

ARF said:
The speculations come from similar articles:

View attachment 229577
AMD & NVIDIA Next-Gen Flagship GPUs Detailed: RDNA 3 Radeon RX 7900 XT With 15360 Cores, Ada Lovelace GeForce RTX 4090 With 18432 Cores (wccftech.com)

The details are not that interesting. It is obvious that they have found a way with significant architectural improvements to make the performance better.

The question is when the market will see it. Because if it is 2023, then Navi 21 would be more than two years old already!

Also, they are not optimistic at all about the shortages and the scalper pricings.

x2,2 ? Ah, yes of the unobtanium full die that never gets into the Geforce stack. Gotcha. Where'd that Volta thing go anyway ?

Meanwhile, the realistic gen-to-gen per-tier perf increase is and has always been 20-50%. 50% being the absolute jaw droppers, like Pascal.

ARF · Dec 20, 2021

Vayra86 said:
x2,2 ? Ah, yes of the unobtanium full die that never gets into the Geforce stack. Gotcha. Where'd that Volta thing go anyway ?

Meanwhile, the realistic gen-to-gen per-tier perf increase is and has always been 20-50%. 50% being the absolute jaw droppers, like Pascal.

Well, architectural improvements, more shaders, die shrink, up the wattage to 450-500 watts and call it a hey-day.

mastrdrver · Dec 20, 2021

ARF said:
The speculations come from similar articles:

View attachment 229577
AMD & NVIDIA Next-Gen Flagship GPUs Detailed: RDNA 3 Radeon RX 7900 XT With 15360 Cores, Ada Lovelace GeForce RTX 4090 With 18432 Cores (wccftech.com)

The details are not that interesting. It is obvious that they have found a way with significant architectural improvements to make the performance better.

The question is when the market will see it. Because if it is 2023, then Navi 21 would be more than two years old already!

Also, they are not optimistic at all about the shortages and the scalper pricings.

That graph from WCCFtech is fanciful thinking. At best there will be a 50% increase in performance. There is no way they're going to get a 100%+ increase in performance over the previous generation. You can't increase wattage enough to get that kind of generational uplift.

Even when you had 5870 (which was a doubling of 4870) you didn't even see a 100% increase over the previous generation. You only saw (at best) a 50% increase in performance but that was at 2560x1600 with was on a $1,000 USD monitor that very few had. 40% increase at 1080p was the reality of that card. Even with in the same generation, a halving of the gpu power never produces half the performance. It's because there's always an issue with feeding the gpu enough data.

TheoneandonlyMrK · Dec 20, 2021

mastrdrver said:
That graph from WCCFtech is fanciful thinking. At best there will be a 50% increase in performance. There is no way they're going to get a 100%+ increase in performance over the previous generation. You can't increase wattage enough to get that kind of generational uplift.

Even when you had 5870 (which was a doubling of 4870) you didn't even see a 100% increase over the previous generation. You only saw (at best) a 50% increase in performance but that was at 2560x1600 with was on a $1,000 USD monitor that very few had. 40% increase at 1080p was the reality of that card. Even with in the same generation, a halving of the gpu power never produces half the performance. It's because there's always an issue with feeding the gpu enough data.

Firstly the node shrink and architectural changes can account for some of that 50% (possibly not proven)
Second they provided proof already that infinity cache works they just need to improve/enlarge.

londiste · Dec 20, 2021

Oberon said:
256 MB on the IOD, none on the GCDs.

Would it make sense to only have cache on IOD? Impact depends on what packaging they use but IOD is always a hop away.

bug · Dec 21, 2021

At this point, I believe buyers would be happy with a meager performance improvement, but actual availability and reasonable prices (e.g. $200 for FHD gaming) instead.

mastrdrver · Dec 21, 2021

TheoneandonlyMrK said:
Firstly the node shrink and architectural changes can account for some of that 50% (possibly not proven)
Second they provided proof already that infinity cache works they just need to improve/enlarge.

Historical trends say that it's not going to happen.

6900 XT is about 40% faster then 5700 XT. 25% of that is from just the difference in clock speeds. There is no president for a 100%+ increase in performance (if that's what you're arguing for). If there was some simple way to do it, both nVidia and AMD (ATI) would have done it a long time ago.

Remember that even 30% increase over generation is an impressive feat these days. So, I do think a 50% increase is in a realistic possibility.

With Infinity Cache (IFC), they're increasing the size because they're putting two dies together. From the speculation I've read, each die has a two 64-bit memory controllers on them and they use the IFC (128MB each die) as the "crossbar" between the two. They already have a 58% hit rate according to AMD. I know CPU architectures are different, but even Tim Keller said there is a deminising return for increasing hit rate in caches as you have to increase the transistor count exponentially to receive a increase (though reduced) hit rate. I'm sure the same applied to GPU caches otherwise they would have already done it by now. Likewise, in RDNA2, the ICF is there to reduce the hits to memory. So not only are you looking to increase the hitrate and save memory transfers, but you're also looking have a good hit rate so the two dies don't have to hit memory that often either. Maybe that's what they plan on using the other 40% that's not being as efficient.

TheoneandonlyMrK · Dec 21, 2021

mastrdrver said:
Historical trends say that it's not going to happen.

6900 XT is about 40% faster then 5700 XT. 25% of that is from just the difference in clock speeds. There is no president for a 100%+ increase in performance (if that's what you're arguing for). If there was some simple way to do it, both nVidia and AMD (ATI) would have done it a long time ago.

Remember that even 30% increase over generation is an impressive feat these days. So, I do think a 50% increase is in a realistic possibility.

With Infinity Cache (IFC), they're increasing the size because they're putting two dies together. From the speculation I've read, each die has a two 64-bit memory controllers on them and they use the IFC (128MB each die) as the "crossbar" between the two. They already have a 58% hit rate according to AMD. I know CPU architectures are different, but even Tim Keller said there is a deminising return for increasing hit rate in caches as you have to increase the transistor count exponentially to receive a increase (though reduced) hit rate. I'm sure the same applied to GPU caches otherwise they would have already done it by now. Likewise, in RDNA2, the ICF is there to reduce the hits to memory. So not only are you looking to increase the hitrate and save memory transfers, but you're also looking have a good hit rate so the two dies don't have to hit memory that often either. Maybe that's what they plan on using the other 40% that's not being as efficient.

Tim Keller?!

There's a whole lineup planned, I expect the top end with more than one chip could double performance alone without the other upgrades.

Depends on many things really they'll be two to four SKUs I hope I'm more optimistic then you but it's far from proven , I wouldn't debate my stance too hard, it's rumours.

Punkenjoy · Dec 21, 2021

londiste said:
Would it make sense to only have cache on IOD? Impact depends on what packaging they use but IOD is always a hop away.

From what i seen from Dieshot of Navi 22 and Navi 23, it look like the Cache is tied to the memory controller/memory bus and they had the option to put less or more cache per "lane"

So from that point of view, it would make sense.

But that would mean the infinity fabric link between the I/O die and the chiplet is huge. right now, on die, AMD state that it's 16 x 64b for NAVI21. it would mean probably at least 12 x 2 x 64b for Navi 31. Not undoable but i wonder how it will be expensive to make with an interposer.

I think at 2 GHz, infinity cache Bandwidth is around 1.9 TB/s.

mastrdrver · Dec 21, 2021

TheoneandonlyMrK said:
Tim Keller?!

There's a whole lineup planned, I expect the top end with more than one chip could double performance alone without the other upgrades.

Depends on many things really they'll be two to four SKUs I hope I'm more optimistic then you but it's far from proven , I wouldn't debate my stance too hard, it's rumours.

Plus too I think they're expecting a 40%+ efficiency increase? I'm very skeptical of the more then 50% increase, but it will be amazing if true. Wouldn't have seen those improvement since the late 90s and 3Dfx SLI setups.

Prima.Vera · Dec 21, 2021

hopefully they won't follow nGredia recepy:
more expensive, less yields, sold to black market first, etc...

Oberon · Dec 21, 2021

mastrdrver said:
Even when you had 5870 (which was a doubling of 4870) you didn't even see a 100% increase over the previous generation. You only saw (at best) a 50% increase in performance but that was at 2560x1600 with was on a $1,000 USD monitor that very few had. 40% increase at 1080p was the reality of that card.

That's a 58.7% increase...

System Name	RBMK-1000
Processor	AMD Ryzen 7 5700G
Motherboard	Gigabyte B550 AORUS Elite V2
Cooling	DeepCool Gammax L240 V2
Memory	2x 16GB DDR4-3200
Video Card(s)	Galax RTX 4070 Ti EX
Storage	Samsung 990 1TB
Display(s)	BenQ 1440p 60 Hz 27-inch
Case	Corsair Carbide 100R
Audio Device(s)	ASUS SupremeFX S1220A
Power Supply	Cooler Master MWE Gold 650W
Mouse	ASUS ROG Strix Impact
Keyboard	Gamdias Hermes E2
Software	Windows 11 Pro

Processor	Ryzen 7 5800X3D
Motherboard	Gigabyte X570 Aorus Elite
Cooling	Thermalright Phantom Spirit 120 SE
Memory	2x16 GB Crucial Ballistix 3600 CL16 Rev E @ 3600 CL14
Video Card(s)	RTX3080 Ti FE
Storage	SX8200 Pro 1 TB, Plextor M6Pro 256 GB, WD Blue 2TB
Display(s)	LG 34GN850P-B
Case	SilverStone Primera PM01 RGB
Audio Device(s)	SoundBlaster G6 \| Fidelio X2 \| Sennheiser 6XX
Power Supply	SeaSonic Focus Plus Gold 750W
Mouse	Endgame Gear XM1R
Keyboard	Wooting Two HE

Processor	Ryzen 9 3900x
Motherboard	MSI B550 Gaming Plus
Cooling	be quiet! Dark Rock Pro 4
Memory	32GB GSkill Ripjaws V 3600CL16
Video Card(s)	3060Ti FE 0.9v
Storage	Samsung 970 EVO 1TB, 2x Samsung 840 EVO 1TB
Display(s)	ASUS ProArt PA278QV
Case	be quiet! Pure Base 500
Audio Device(s)	Edifier R1850DB
Power Supply	Super Flower Leadex III 650W
Mouse	A4Tech X-748K
Keyboard	Logitech K300
Software	Win 10 Pro 64bit

Processor	5700X
Motherboard	ASRock Gaming X
Memory	2x8 3600 CL14
Video Card(s)	1080 SEAHAWK
Display(s)	ViewSonic XG2703-GS

System Name	Bragging Rights
Processor	Atom Z3735F 1.33GHz
Motherboard	It has no markings but it's green
Cooling	No, it's a 2.2W processor
Memory	2GB DDR3L-1333
Video Card(s)	Gen7 Intel HD (4EU @ 311MHz)
Storage	32GB eMMC and 128GB Sandisk Extreme U3
Display(s)	10" IPS 1280x800 60Hz
Case	Veddha T2
Audio Device(s)	Apparently, yes
Power Supply	Samsung 18W 5V fast-charger
Mouse	MX Anywhere 2
Keyboard	Logitech MX Keys (not Cherry MX at all)
VR HMD	Samsung Oddyssey, not that I'd plug it into this though....
Software	W10 21H1, barely
Benchmark Scores	I once clocked a Celeron-300A to 564MHz on an Abit BE6 and it scored over 9000.

AMD Radeon "Navi 3x" Could See 50% Increase in Shaders, Double the Cache Memory

btarunr

Editor & Senior Moderator

davideneco

Chomiq

Pumper

AlwaysHope

ARF

Jeager

Chrispy_

ARF

Punkenjoy

ARF

Attachments

Oberon

mechtech

Vayra86

ARF

mastrdrver

TheoneandonlyMrK

londiste

bug

mastrdrver

TheoneandonlyMrK

Punkenjoy

mastrdrver

Prima.Vera

Oberon

Processor	AMD Ryzen 5900X
Motherboard	MSI MAG X570 Tomahawk
Cooling	Dual custom loops
Memory	4x8GB G.SKILL Trident Z Neo 3200C14 B-Die
Video Card(s)	AMD Radeon RX 6800XT Reference
Storage	ADATA SX8200 480GB, Inland Premium 2TB, various HDDs
Display(s)	MSI MAG341CQ
Case	Meshify 2 XL
Audio Device(s)	Schiit Fulla 3
Power Supply	Super Flower Leadex Titanium SE 1000W
Mouse	Glorious Model D
Keyboard	Drop CTRL, lubed and filmed Halo Trues

Processor	Ryzen 5700x
Motherboard	Gigabyte X570S Aero G R1.1 BiosF5g
Cooling	Noctua NH-C12P SE14 w/ NF-A15 HS-PWM Fan 1500rpm
Memory	Micron DDR4-3200 2x32GB D.S. D.R. (CT2K32G4DFD832A)
Video Card(s)	AMD RX 6800 - Asus Tuf
Storage	Kingston KC3000 1TB & 2TB & 4TB Corsair MP600 Pro LPX
Display(s)	LG 27UL550-W (27" 4k)
Case	Be Quiet Pure Base 600 (no window)
Audio Device(s)	Realtek ALC1220-VB
Power Supply	SuperFlower Leadex V Gold Pro 850W ATX Ver2.52
Mouse	Mionix Naos Pro
Keyboard	Corsair Strafe with browns
Software	W10 22H2 Pro x64

System Name	Tiny the White Yeti
Processor	7800X3D
Motherboard	MSI MAG Mortar b650m wifi
Cooling	CPU: Thermalright Peerless Assassin / Case: Phanteks T30-120 x3
Memory	32GB Corsair Vengeance 30CL6000
Video Card(s)	ASRock RX7900XT Phantom Gaming
Storage	Lexar NM790 4TB + Samsung 850 EVO 1TB + Samsung 980 1TB + Crucial BX100 250GB
Display(s)	Gigabyte G34QWC (3440x1440)
Case	Lian Li A3 mATX White
Audio Device(s)	Harman Kardon AVR137 + 2.1
Power Supply	EVGA Supernova G2 750W
Mouse	Steelseries Aerox 5
Keyboard	Lenovo Thinkpad Trackpoint II
VR HMD	HD 420 - Green Edition ;)
Software	W11 IoT Enterprise LTSC
Benchmark Scores	Over 9000

System Name	Money Hole
Processor	Core i7 970
Motherboard	Asus P6T6 WS Revolution
Cooling	Noctua UH-D14
Memory	2133Mhz 12GB (3x4GB) Mushkin 998991
Video Card(s)	Sapphire Tri-X OC R9 290X
Storage	Samsung 1TB 850 Evo
Display(s)	3x Acer KG240A 144hz
Case	CM HAF 932
Audio Device(s)	ADI (onboard)
Power Supply	Enermax Revolution 85+ 1050w
Mouse	Logitech G602
Keyboard	Logitech G710+
Software	Windows 10 Professional x64

System Name	RyzenGtEvo/ Asus strix scar II
Processor	Amd R5 5900X/ Intel 8750H
Motherboard	Crosshair hero8 impact/Asus
Cooling	360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory	Gskill Trident Z 3900cas18 32Gb in four sticks./16Gb/16GB
Video Card(s)	Asus tuf RX7900XT /Rtx 2060
Storage	Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s)	Samsung UAE28"850R 4k freesync.dell shiter
Case	Lianli 011 dynamic/strix scar2
Audio Device(s)	Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply	corsair 1200Hxi/Asus stock
Mouse	Roccat Kova/ Logitech G wireless
Keyboard	Roccat Aimo 120
VR HMD	Oculus rift
Software	Win 10 Pro
Benchmark Scores	laptop Timespy 6506

Processor	Ryzen 7800X3D
Motherboard	ROG STRIX B650E-F GAMING WIFI
Memory	2x16GB G.Skill Flare X5 DDR5-6000 CL36 (F5-6000J3636F16GX2-FX5)
Video Card(s)	INNO3D GeForce RTX™ 4070 Ti SUPER TWIN X2
Storage	2TB Samsung 980 PRO, 4TB WD Black SN850X
Display(s)	42" LG C2 OLED, 27" ASUS PG279Q
Case	Thermaltake Core P5
Power Supply	Fractal Design Ion+ Platinum 760W
Mouse	Corsair Dark Core RGB Pro SE
Keyboard	Corsair K100 RGB
VR HMD	HTC Vive Cosmos

Processor	Intel i5-12600k
Motherboard	Asus H670 TUF
Cooling	Arctic Freezer 34
Memory	2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s)	EVGA GTX 1060 SC
Storage	500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s)	Dell U3219Q + HP ZR24w
Case	Raijintek Thetis
Audio Device(s)	Audioquest Dragonfly Red :D
Power Supply	Seasonic 620W M12
Mouse	Logitech G502 Proteus Core
Keyboard	G.Skill KM780R
Software	Arch Linux + Win10

Processor	Intel® Core™ i7-13700K
Motherboard	Gigabyte Z790 Aorus Elite AX
Cooling	Noctua NH-D15
Memory	32GB(2x16) DDR5@6600MHz G-Skill Trident Z5
Video Card(s)	KUROUTOSHIKOU RTX 5080 GALAKURO
Storage	2TB SK Platinum P41 SSD + 4TB SanDisk Ultra SSD + 500GB Samsung 840 EVO SSD
Display(s)	Acer Predator X34 3440x1440@100Hz G-Sync
Case	NZXT PHANTOM410-BK
Audio Device(s)	Creative X-Fi Titanium PCIe
Power Supply	Corsair 850W
Mouse	Logitech Hero G502 SE
Software	Windows 11 Pro - 64bit
Benchmark Scores	30FPS in NFS:Rivals