Tuesday, May 7th 2019

AMD Radeon RX 3080 XT "Navi" to Challenge RTX 2070 at $330

May 7th, 2019 01:46 Discuss (213 Comments)

Rumors of AMD's next-generation performance-segment graphics card are gaining traction following a leak of what is possibly its PCB. Tweaktown put out a boatload of information of the so-called Radeon RX 3080 XT graphics card bound for an 2019 E3 launch, shortly after a Computex unveiling. Based on the 7 nm "Navi 10" GPU, the RX 3080 XT will feature 56 compute units based on the faster "Navi" architecture (3,584 stream processors), and 8 GB of GDDR6 memory across a 256-bit wide memory bus.

The source puts out two very sensational claims: one, that the RX 3080 XT performs competitively with NVIDIA's $499 GeForce RTX 2070; and two, that AMD could start a price-war against NVIDIA by aggressively pricing the card around the $330 mark, or about two-thirds the price of the RTX 2070. Even if either if not both hold true, AMD will fire up the performance-segment once again, forcing NVIDIA to revisit the RTX 2070 and RTX 2060.

Source: Tweaktown

Add your own comment

213 Comments on AMD Radeon RX 3080 XT "Navi" to Challenge RTX 2070 at $330

#201

EarthDog

Midland Dogim here to tell you it aint, gtx 960 vs r9 380, gtx 1060 vs rx 480, gtx 1080 vs vega 64, gtx 2080 vs vega 7. proof is there, GCN doesnt scale with teraflops, and hits a bandwidth wall quickly

the biggest bottleneck in any and every gpu is bandwidth

And if you look at benchmarks, hbm enabled cards dont catch up until high res. It's really not considered a good thing when most game at 1080p or 2560x1440. Half the cards you listed arent more than 1080p and 2660x1440 cards anyway. Those cards only catch up at higher res. They would have been better served using gddr5.

#202

RichF

spnidelisn't the rtx 2070 pretty much a 1080 performance-wise? if so, then god damn, AMD... node shrink, and you still can't beat a 1080 ti with something that isn't HBM memory with power consumption that isn't trash. sad.

AMD seems to be interested in tiny dies these days to maximize profits.

Manoathis sounds like the bulldozer improves to excavator xD

Excavator didn't have enough cores nor cache nor clockspeed potential (due to low-grade 28nm process) to impress. It was designed to be cheap to produce. At the very least we're looking at a process improvement. The 28nm bulk Excavator used was actually inferior to GF 32nm SOI in terms of high performance.

AMD didn't develop its Bulldozer architecture the way it could have, had it chosen to go for high performance. We have no idea what a Keller-level talent could have done with it, let alone what more ordinary engineers could have done had AMD chosen to upgrade from Piledriver with a high-performance node (e.g. 22nm IBM or even 32nm GF) successor designed with things that were missing from Piledriver, like better microop caching, more capable individual cores, better AVX performance (e.g. fixing the regression from Bulldozer) and AVX-2 support, and L3 cache with decent performance. I have also heard anecdotally that Linux runs Piledriver much more efficiently than Windows when tuned for the architecture, so there may still be a Windows performance obstacle that could have been overcome.

People praised SMT and condemned CMT but we've seen enough examples recently of Intel not even enabling SMT in CPUs that offer good performance. I think it's therefore dubious to assume that SMT is needed for high performance, making the SMT is vastly superior to CMT argument questionable. I wonder if it's possible/worthwhile to do the opposite of what AMD did and have two FPU units for every integer unit.

One of the worst things about Bulldozer is that we'll never know what the architecture could have been had it been developed more effectively. It should have never been released in its original state ("Bulldozer") and Piledriver wasn't enough of an improvement either. 8 core consumer CPUs were also premature considering the primitiveness of Windows and most software.

#203

HwGeek

Looks like that until NAVI comes out- the Mining craze will be back- Miners would love the new 7nm parts :-(.

#204

ValenOne

Midland DogGCN is always bandwidth starved regardless of being 7nm its gddr not hbm, amd will struggle to get it any better

256bit GDDR6-14000 would yield about 448 GB/s memory bandwidth.

Vega 56 has 410 GB/s memory bandwidth.

Vega 64 LC OC+UV at ~1750 Mhz yields similar results to VII despite it's 2X the memory bandwidth over Vega 64 LC's.

Amd/comments/9du2w4NAVI has memory compression improvements.

Facts remains RTX 2080 Ti has 88 ROPS with six GPC blocks (with each GPC has at least a raster engine) superiority over VII's 64 ROPS and four raster engines.

TFLOPS is nothing without raster engines and ROPS (graphics read/write units). Note why AMD is pushing for compute shader path i.e. using TMUs for read/write units

#205

Midland Dog

rvalencia256bit GDDR6-14000 would yield about 448 GB/s memory bandwidth.

Vega 56 has 410 GB/s memory bandwidth.

Vega 64 LC OC+UV at ~1750 Mhz yields similar results to VII despite it's 2X the memory bandwidth over Vega 64 LC's.

Amd/comments/9du2w4NAVI has memory compression improvements.

Facts remains RTX 2080 Ti has 88 ROPS with six GPC blocks (with each GPC has at least a raster engine) superiority over VII's 64 ROPS and four raster engines.

TFLOPS is nothing without raster engines and ROPS (graphics read/write units). Note why AMD is pushing for compute shader path i.e. using TMUs for read/write units

ill stay skeptical until release, amd hype has always fallen short of the truth (since FIJI tried to Titan X but couldnt 980ti at least)

#206

vega22

John NaylorThis was the 1st instance of AMD very aggressively clocking cards in the box. While the 290x was faster then the 780 out of the box, the teeny OC headroom left it unable to compete with the 780 .... with both cards overclocked... it was all 780 ... even Linus figured that out.

See 8:40 mark

It gets worse under water...4:30 mark

Aside from these prerelease announcements never living up to the hype ever sinc AMDs 2xx series, there's one thing here that gives me great pause with this announcements.... the name. Now if you want to distinguish your product from the competition because you have a better one, as is taught in Marketing 101 is "distinguish your product". The "RX 3080 XT" ... the copied the RX, they went from 2 to 3 and from 70 to 80 and threw in an XT for "extra" I guess. We saw the same thing with MoBos in mimicking the Intel MoBo naming conventions switching Z to an X. When you mimic the competition, it says "I wanna make mine sound like theirs so they will see RX 3 to their RX 2 ad 80 is bigger than 70 and infer that its "like theirs but newer, bigger, badder, faster". That was nVidias whole goal with the partnering idea ... "we will loosen up restrictions on our cards if you agree that we will lock down the naming so this type of thing won't cut into our sales". Regardless of what the new card line actually does, I wish they'd stake out their own naming conventions.

I do hope that AMD can actually deliver on this kind performance .... But if they gonna push the value claim, let's do apples and apples for a change. Right now the 2060 is faster for 100 watts less ... 100 watts at 30 hours a week costs me $44.20 a year. If the new RX 3080 XT is 100 watts more .... from a cost PoV ...

+100 watts would add +$20 to PSU Cost (Focus Gold Plus)
+100 watts would warrant an extra $15 case fan
+$44.20 a year is $176.80 ... $211.18 total ... I'd rather the pay the extra $170 the 1070.

Now my cost for electricity is way higher than most folks in USA , comparable to many Eurpean countraies and a lot cheaper than many of those. I pay 24 cents per kwh versus average US peep pays $0.11 ...for those folks the cost would be $81.03 over 4 years.

The reality is that most folks won't consider electric cost and if that's the case, the "value' argument is no longer apples and apples. Many live in apartments and it's in the rent, some living at parents house ... but if ya gonna make the "well it's not as fast but it has best value claim", it isn't valid w/o including all associated costs. Those would be mine, others may not mind the extra heat and load / extra inefficiency of on PSU; but whatever they are in each instance, all impacts should be considered.

Now with "apples and apples' having been considered, I would much welcome a card that was comparable in performance, comparable in power usage and comparable in sound and heat generated.... but in each instance only interested in comparisons with both cards were at max overclock. I hope against hope that AMD can deliver one but I'm weary of pre-release fanfare that consistenty fails to deliver. I hop that this time they can manage to out out something that fullfills the promise, but weary of followinmg pre-release news for 6 months only to be disappointed.

You used lots of words to say very little there dude.

You compared cards which are not the same as those I mentioned and then went on to waffle about things which are less relevant to most.

#207

Super XP

RichFAMD seems to be interested in tiny dies these days to maximize profits.

Excavator didn't have enough cores nor cache nor clockspeed potential (due to low-grade 28nm process) to impress. It was designed to be cheap to produce. At the very least we're looking at a process improvement. The 28nm bulk Excavator used was actually inferior to GF 32nm SOI in terms of high performance.

AMD didn't develop its Bulldozer architecture the way it could have, had it chosen to go for high performance. We have no idea what a Keller-level talent could have done with it, let alone what more ordinary engineers could have done had AMD chosen to upgrade from Piledriver with a high-performance node (e.g. 22nm IBM or even 32nm GF) successor designed with things that were missing from Piledriver, like better microop caching, more capable individual cores, better AVX performance (e.g. fixing the regression from Bulldozer) and AVX-2 support, and L3 cache with decent performance. I have also heard anecdotally that Linux runs Piledriver much more efficiently than Windows when tuned for the architecture, so there may still be a Windows performance obstacle that could have been overcome.

People praised SMT and condemned CMT but we've seen enough examples recently of Intel not even enabling SMT in CPUs that offer good performance. I think it's therefore dubious to assume that SMT is needed for high performance, making the SMT is vastly superior to CMT argument questionable. I wonder if it's possible/worthwhile to do the opposite of what AMD did and have two FPU units for every integer unit.

One of the worst things about Bulldozer is that we'll never know what the architecture could have been had it been developed more effectively. It should have never been released in its original state ("Bulldozer") and Piledriver wasn't enough of an improvement either. 8 core consumer CPUs were also premature considering the primitiveness of Windows and most software.

I agree, Bulldozer was a major issue, because AMD relied more on automation for the Core Design of this interesting CPU. In the past AMD CPU Architects were a lot more intimate with CPU designs, such as the Athlon & Athlon 64 for example. Several years before Bulldozer was designed & launched, there was some AMD internal struggles & changes in upper management, which ultimately allowed "A Bulldozer Type Decision" Of course, most of what I just said is from memory, but I remember reading multiple articles about this. I won't put the entire blame on Rory Read, as he became CEO when Bulldozer just launched. CEO Dirk Meyer was a Computer Engineer and was the decision maker with Bulldozer. And after Lisa Su was appointed CEO, again she's a Electrical Engineer, things turned for the better. Bulldozer was on Rory Read's watch and it failed, but it did not SINK the company. Lisa Su was quick to hire Jim Keller to start the ZEN project. And so on, bla bla bla all from memory lol

Piledriver was a much more efficient version of Bulldozer, which did significantly increase the overall performance. AMD had no choice but to do this, at least for the Desktop Gaming segment.

Bulldozer -Piledriver -Steamroller -Excavator -ZEN -ZEN+ & ZEN2......

EDITED.
I got my CEO's confused and made corrections.

#208

steen

efikkanThe problem is not ROP performance, it's management of resources.
GCN have changed very little over the years, while Kepler -> Maxwell -> Pascal -> Turing have continued to advance and achieve more performance per core and GFlop, to the point where they have about twice the performance per watt and 30-50% more performance per GFlop.

Sorry, I missed this earlier.

Where did you see me mentioning RBE/ROP performance? Fermi was performant, not simplistically due to GS yielding 50% > perf/clk, but due to the follow-on urach benefits of the polymorph engines allowing decoupling of the front end resulting in far greater extraction of parallelism. This gave better utilization, less bubbles/stalls in the pipeline. GF silicon implementation didn't match the expected RTL, but each iteration since has lead to improvements.

More is usually better, except when it comes at a great cost.

Does that also extend to die area? ;)

16 GB of 1 TB/s HBM2 is just pointless for gaming purposes. AMD could have used 8 or even 12 GB, and priced it lower.

It's a repurposed Mi50, whattayagonnado? As a low volume gaming SKU, it's probably the bottom of the barrel 7nm working chips that might be marginal thermal/load. The cost to package as a lower frame buffer/bandwidth SKU might be marginal & the full spec can be exploited by marketing vs the competition.

rvalenciaFacts remains RTX 2080 Ti has 88 ROPS with six GPC blocks (with each GPC has at least a raster engine) superiority over VII's 64 ROPS and four raster engines.

There's a simple metric really, TU102=18b transistors outperforms Vega20=13b transistors as the silicon is deployed in a much better uarch, eg 3.3TFOPs FP64 for Vega is no benefit to gamers.

TFLOPS is nothing without raster engines and ROPS (graphics read/write units). Note why AMD is pushing for compute shader path i.e. using TMUs for read/write units

The traditional GS/HS/DS geometry stages may well be deprecated in favor of more flexible & performant primitive/mesh shaders, but don't conflate GF->TU & GCN 1->9. It's not just the ROPs/TMUs in NV's favour, it's the decoupling of the front end and the ability to extract much more parallelism that allows higher utilization from lower peak FLOPs. We also need to consider better bandwidth utilization, data reuse (register/cache), etc.

#209

medi01

AssimilatorThey've never undercut their competitor by such a significant amount

Assimilatorunless they want to go all-out on trying to regain marketshare.

Ah, ok then.

#210

Mephis

Nope. Maybe btarunr wants to start thinking about not writing headlines that declare leaks as if they are factual. Just a thought.

#211

HenrySomeone

Hehe, well the prices will probably crash to those kinds of levels soon enough anyway, provided that they actually want to sell any, lol

#212

Assimilator

MephisNope. Maybe btarunr wants to start thinking about not writing headlines that declare leaks as if they are factual. Just a thought.

Yeah, good luck with that.

#213

lexluthermiester

MephisNope. Maybe btarunr wants to start thinking about not writing headlines that declare leaks as if they are factual. Just a thought.

All he did was report something interesting that was discovered.

Add your own comment

AMD Radeon RX 3080 XT "Navi" to Challenge RTX 2070 at $330

213 Comments on AMD Radeon RX 3080 XT "Navi" to Challenge RTX 2070 at $330

Latest GPU Drivers

New Forum Posts

Popular Reviews

Controversial News Posts

AMD Radeon RX 3080 XT "Navi" to Challenge RTX 2070 at $330

Related News

213 Comments on AMD Radeon RX 3080 XT "Navi" to Challenge RTX 2070 at $330

Latest GPU Drivers

New Forum Posts

Popular Reviews

Controversial News Posts