NVIDIA GeForce RTX 40 Series "AD104" Could Match RTX 3090 Ti Performance

Colddecked · Aug 1, 2022

nguyen said:
Disappointed if true, AD104 should beat GA102 by 30% like what happened with Pascal
View attachment 256616

Nvidia was sandbagging with maxwell. So much overclocking potential.

ppn · Aug 1, 2022

I was hoping 4070Ti is the cut GA103 256 bit. 192 bit is a joke, my unrealistic expectations are shattered. But perhaps a SUper.

Daven said:
I could be wrong but isn’t CUDA core count calculated by adding FP32 cores (7680) to FP64 cores (3840) for a total of 11,520 cores. That’s 10% higher cores than the 3090Ti at 12% lower power. I mean it’s not great but it’s not bad either.

I remember all the rumors not knowing how to count CUDA cores before the 3000 series launch. Looks like that might be the case again.

No FP64 (double) performance is executed at 1:64 speed, it is not additive, as far as I understand.

The confusion with GA102 Ampere is that 5376 Cuda can only do fp32 and 5376 Cuda that can execute both fp32 or int32, but not at the same time, so a total of 10752, but when INT32 is running the FP32 is greatly reduced.

This could change in Ada, 7680 fp32 AND 3840 int32 separately not shared =11520

lexluthermiester · Aug 1, 2022

ppn said:
But cut to 192 bit bus and using same memory speed G6X.

That better not be true. 256bit mem bus or NVidia can suck a d... duck, yes, suck a duck. Seriously NVidia? Did we not learn from past mistakes? The GTX460 comes to mind, as does the GTX970...

Daven · Aug 1, 2022

ppn said:
I was hoping 4070Ti is the cut GA103 256 bit. 192 bit is a joke, my unrealistic expectations are shattered. But perhaps a SUper.

No FP64 (double) performance is executed at 1:64 speed, it is not additive, as far as I understand.

The confusion with GA102 Ampere is that 5376 Cuda can only do fp32 and 5376 Cuda that can execute both fp32 or int32, but not at the same time, so a total of 10752, but when INT32 is running the FP32 is greatly reduced.

This could change in Ada, 7680 fp32 AND 3840 int32 separately not shared =11520

Thank you for the clarification. I guess I mixed up FP and int terminology.

IceShroom · Aug 1, 2022

john_ said:
So, the new 400W model matches the old 450W model? Considering they moved from the "bad" 8nm Samsung node to the "great" 5nm TSMC node, it's not exactly a breathtaking result.
I believe either performance is higher or wattage much lower. 50W less for the same performance looks too little to me.

Samsung's 8nm is quite good node. Much better then Nvidia fansboys like to tell every one. Ampere has high TDP because Ampere is inefficient.

lexluthermiester · Aug 1, 2022

IceShroom said:
Ampere has high TDP because Ampere is inefficient.

Not quite. Ampere has a high TDP because it is doing hardware real-time raytracing, which is a VERY complex and compute heavy type of task. When not RTRT is not being performed, Ampere GPUs are good on power. Turing is/was no different. AMD's RTRT functionality is no different, turn on raytracing and power usage takes a big bump.

Daven · Aug 1, 2022

lexluthermiester said:
Not quite. Ampere has a high TDP because it is doing hardware real-time raytracing, which is a VERY complex and compute heavy type of task. When not RTRT is not being performed, Ampere GPUs are good on power. Turing is/was no different. AMD's RTRT functionality is no different, turn on raytracing and power usage takes a big bump.

AFAIK, TPU power graphs and perf/W graphs are done without RT enabled. I think theses commenters are referring to these graphs when commenting on efficiency.

napata · Aug 1, 2022

IceShroom said:
Samsung's 8nm is quite good node. Much better then Nvidia fansboys like to tell every one. Ampere has high TDP because Ampere is inefficient.

It's not that simple. Just look at A4000. A big part of efficiency just depends on the performance you target in terms of clock speed and the yields you target. Higher clockspeeds means less efficiency. It's possible bad yields resulted in the 3000 series being relatively inefficient.

Just look how efficient A4000 is. Less power than a 6600xt while performance is equal to a 3060Ti. Just a result of a bigger chip with lower clocks and possibily binned for low voltages.

Nvidia RTX A4000 Review

Today we're taking a look at the Nvidia RTX A4000, but this isn't a member of the rumored next-gen GeForce RTX 4000 series, but rather an Ampere-based...

www.techspot.com

lexluthermiester · Aug 1, 2022

Daven said:
AFAIK, TPU power graphs and perf/W graphs are done without RT enabled.

You need to take a closer look.

Daven said:
I think theses commenters are referring to these graphs when commenting on efficiency.

That's an assumption on your part. The devil is in the details.

napata said:
Higher clockspeeds means less efficiency.

This too!

TheinsanegamerN · Aug 1, 2022

holyprof said:
With electricity costs going up in the whole world, how come the 4070Ti draws more than double of power compared to a 1080 (180W)? New generations should bring more performance at the same power. We are getting expensive space heaters instead.
Add to that 400W heating in your room the costs of air conditioning and it's becoming a quite expensive hobby.
On a second thought, the RTX 3070Ti has awful performance per watt, especially compared to the RTX 3070 and RX6800, so there's hope my intended upgrade RTX 4070 will perform close to the Ti version but at much lower power draw.

Once again it needs to be stated that if an extra 200-300 watts makes you pucker at electric costs you cant afford a PC anyway. Look into how much power your HVAC takes, or those new "green" electric cars.

Power draw is going up because capability is going up and there is demand for mroe powerful cards. Economey 101. If you dont want high power draw you can still buy the mid range cards and benefit from the major perf/watt improvements, or undervolt to dramatically reduce power use while only losing <5% performance.

lexluthermiester said:
This too!

Everyone always assumes samsung's process sucked just because nvidia OCed their parts too high to dominate said graphs out of the box. Sub 2 GHz the 8nm node is great.

What I wouldnt give to find one of those A2000s to upgrade my SFF box.

Colddecked said:
Nvidia was sandbagging with maxwell. So much overclocking potential.

Honestly though that was a good thing, it resulted in maxwell gaining the reputation for being WAY more efficient then AMD, a badge it still wears today, and also allowed us enthusiasts to have tons of fun OCing them to the moon.

Lei · Aug 1, 2022

@AleksandarK
Rumors before 30 launch were under estimating CUDA count by a large number.
It's likely they're overestimating performance this time to reach an equilibrium.

The only impressive leak was the cooler which wasn't from kopite7kimi anyway.

look at his numbers:

he's predicted half the CUDA cores for 3090. leaked 5248 vs real 10496
his 4352 for 3080 -which is same cores as 2080Ti had- is less than half the actual 8960
his leak is exactly half for 3070 and 3070Ti

And here is his crazy 20gb rumors about 3080Ti

lexluthermiester · Aug 1, 2022

TheinsanegamerN said:
What I wouldnt give to find one of those A2000s to upgrade my SFF box.

Right there with you. I'd like to find a 12GB version but a 6GB would do.

TheinsanegamerN · Aug 1, 2022

lexluthermiester said:
Right there with you. I'd like to find a 12GB version but a 6GB would do.

I'd be fine with either. My RX560 is getting long int he tooth and 4GB VRAM just doesnt do it anymore for certian games. The 6400 and 6500 were major dissapointments and the 3050 is too power hungry. The intel card, if it ever get shere, likely will be slower then what I have.

I've debated trying to build something into a bk 623 and get rid of a SFF case altogether.

Beertintedgoggles · Aug 1, 2022

lexluthermiester said:
You need to take a closer look.

That's an assumption on your part. The devil is in the details.

I did take a closer look and Techpowerup measures gaming power consumption with ray tracing disabled:

Gaming: Cyberpunk 2077 is running at 2560x1440 with Ultra settings and ray tracing disabled. We ensure the card is heated up properly, which ensures a steady-state result instead of short-term numbers that won't hold up in long-term usage.

lexluthermiester · Aug 1, 2022

Beertintedgoggles said:
I did take a closer look and Techpowerup measures gaming power consumption with ray tracing disabled:

Gaming: Cyberpunk 2077 is running at 2560x1440 with Ultra settings and ray tracing disabled. We ensure the card is heated up properly, which ensures a steady-state result instead of short-term numbers that won't hold up in long-term usage.

And what did you see when you looked at the RTRT enabled numbers? Hmm?

I'm not going to hand-hold anyone. Pick a card review and be diligent when reading it.

Beertintedgoggles · Aug 1, 2022

lexluthermiester said:
And what did you see when you looked at the RTRT enabled numbers? Hmm?

I'm not going to hand-hold anyone. Pick a card review and be diligent when reading it.

Don't know because the graphs show with ray tracing disabled. That was the point wasn't it?

Pick any of the card reviews you want on this site, go to the power consumption page then click on the "Power Consumption Details" portion. That's as much hand holding as I'm willing to give.

Deleted member 185088 · Aug 1, 2022

Wonder what they mean by beat it? Maybe in Ray Tracing using the latest DLSS.

Lionheart · Aug 1, 2022

Bomby569 said:
There is clearly a barrier neither Nvidia or AMD can pass through, they cannot increase performance in a meaningful way in the 2 year cycle without going crazy on power draw, no point in beating the dead horse. Skip this generation if you don't agree with the way things are going or deal with it.
People want big performance leaps and 4k 200Hz with the 2 year cycle with the same power draw and they are being as unrealistic as Nvidia and AMD

Set a wattage limit, stay in the limit no matter what they release, or shut up about it.

Don't tell us to shutup about it if you can't handle proper criticism.

lexluthermiester · Aug 1, 2022

Beertintedgoggles said:
Don't know because the graphs show with ray tracing disabled. That was the point wasn't it?

Pick any of the card reviews you want on this site, go to the power consumption page then click on the "Power Consumption Details" portion. That's as much hand holding as I'm willing to give.

Did you cherrypick this review or did you look at others?

Then there's this review where they tested everything.

Owning a 2080 myself, I can tell you the power usage goes up when RTRT is enabled VS disabled.

Beertintedgoggles · Aug 1, 2022

lexluthermiester said:
Did you cherrypick this review or did you look at others?

It appears to me you are cherry picking here. Go to the home page, click the Reviews link and set the category for Graphics Cards. Pick a random one and go to the Power Consumption page and click on the drop link for "Power Consumption Testing Details". I had no idea before whether those measurements were taken with or without ray tracing so I followed your advice earlier and decided to read closer. Found out that power consumption measurements on TPU are done without ray tracing.

Daven · Aug 1, 2022

lexluthermiester said:
And what did you see when you looked at the RTRT enabled numbers? Hmm?

I'm not going to hand-hold anyone. Pick a card review and be diligent when reading it.

What are you talking about? A commentor said Nvidia is inefficient. You said something about RT. I said the inefficiency referred to was general rasterization. You told me to look closer. Another commentor said RT was disabled for power tests. Please just stop!

lexluthermiester · Aug 1, 2022

Beertintedgoggles said:
It appears to me you are cherry picking here. Go to the home page, click the Reviews link and set the category for Graphics Cards. Pick a random one and go to the Power Consumption page and click on the drop link for "Power Consumption Testing Details". I had no idea before whether those measurements were taken with or without ray tracing so I followed your advice earlier and decided to read closer. Found out that power consumption measurements on TPU are done without ray tracing.

Daven said:
Please just stop!

If you are not going to READ and understand context, take your own advice.

Beertintedgoggles · Aug 1, 2022

lexluthermiester said:
If you are not going to READ and understand context, take your own advice.

You got called out on an inaccurate statement. Reviews here on TPU show the power measurement numbers with ray tracing disabled. In a conversation / argument, what you are doing is called misdirection. And by the way, it was your advice I followed.

Lei · Aug 1, 2022

Here are Kopite7kimi's leaks about 3070Ti
just see how wrong he is

https://twitter.com/x/status/1319222558844014592

7424 cores instead of real 6144
320bit instead of real 256bit
10gb instead of real 8

If he doesn't tweet, many youtubers will have no way to buy bread.

Bomby569 · Aug 1, 2022

i heard we pay to many attention to this attention whore

Lionheart said:
Don't tell us to shutup about it if you can't handle proper criticism.

criticize me away, i don't mind, it will still not change facts and tech limitations

Processor	5800x3d
Motherboard	MSI Tomahawk x570
Cooling	Thermalright
Memory	32 gb 3200mhz E die
Video Card(s)	3080
Storage	2tb nvme
Display(s)	165hz 1440p
Case	Fractal Define R5
Power Supply	Toughpower 850 platium
Mouse	HyperX Hyperfire Pulse
Keyboard	EVGA Z15

Processor	Intel Core i5 4590
Motherboard	Gigabyte Z97x Gaming 3
Cooling	Intel Stock Cooler
Memory	8GiB(2x4GiB) DDR3-1600 [800MHz]
Video Card(s)	XFX RX 560D 4GiB
Storage	Transcend SSD370S 128GB; Toshiba DT01ACA100 1TB HDD
Display(s)	Samsung S20D300 20" 768p TN
Case	Cooler Master MasterBox E501L
Audio Device(s)	Realtek ALC1150
Power Supply	Corsair VS450
Mouse	A4Tech N-70FX
Software	Windows 10 Pro
Benchmark Scores	BaseMark GPU : 250 Point in HD 4600

System Name	Skunkworks 3.0
Processor	5800x3d
Motherboard	x570 unify
Cooling	Noctua NH-U12A
Memory	32GB 3600 mhz
Video Card(s)	asrock 6800xt challenger D
Storage	Sabarent rocket 4.0 2TB, MX 500 2TB
Display(s)	Asus 1440p144 27"
Case	Old arse cooler master 932
Power Supply	Corsair 1200w platinum
Mouse	squeak
Keyboard	Some old office thing
Software	Manjaro

Processor	3900x - Bykski waterblock
Motherboard	MSI b450m mortar max BIOS Date 27 Apr 2023
Cooling	αcool 560 rad - 2xPhanteks F140XP
Memory	Micron 32gb 3200mhz ddr4
Video Card(s)	Colorful 3090 ADOC active backplate cooling
Storage	WD SN850 2tb ,HP EX950 1tb, WD UltraStar Helioseal 18tb+18tb
Display(s)	24“ HUION pro 4k 10bit
Case	aluminium extrusions copper panels, 60 deliveries for every piece down to screws
Audio Device(s)	sony stereo mic, logitech c930, Gulikit pro 2 + xbox Series S controller, moded bt headphone 1200mAh
Power Supply	Corsair RM1000x
Mouse	pen display, no mouse no click
Keyboard	Microsoft aio media embedded touchpad (moded lithium battery 1000mAh)
Software	Win 11 23h2 build 22631
Benchmark Scores	cine23 20000

System Name	Skunkworks 3.0
Processor	5800x3d
Motherboard	x570 unify
Cooling	Noctua NH-U12A
Memory	32GB 3600 mhz
Video Card(s)	asrock 6800xt challenger D
Storage	Sabarent rocket 4.0 2TB, MX 500 2TB
Display(s)	Asus 1440p144 27"
Case	Old arse cooler master 932
Power Supply	Corsair 1200w platinum
Mouse	squeak
Keyboard	Some old office thing
Software	Manjaro

Processor	2500K @ 4.5GHz 1.28V
Motherboard	ASUS P8P67 Deluxe
Cooling	Corsair A70
Memory	8GB (2x4GB) Corsair Vengeance 1600 9-9-9-24 1T
Video Card(s)	eVGA GTX 470
Storage	Crucial m4 128GB + Seagate RAID 1 (1TB x 2)
Display(s)	Dell 22" 1680x1050 nothing special
Case	Antec 300
Audio Device(s)	Onboard
Power Supply	PC Power & Cooling 750W
Software	Windows 7 64bit Pro

System Name	Intel NUC 12 Extreme
Processor	Intel Core i7 12700 12 Core 20 Thread CPU
Motherboard	Intel NUC Module Motherboard
Cooling	NUC Blower Cooler + 3 x 92mm Fans
Memory	64GB RAM Corsair 3200Mhz CL22
Video Card(s)	PowerColor RX 9070 16GB Reaper
Storage	Silicon 500GB M.2 + WD 2TB External HDD
Display(s)	Sony 4K Bravia X85J 43Inch TV 120Hz
Case	Intel NUC 12 Extreme Mini ITX Case
Audio Device(s)	Realtek Audio + Dolby Atmos
Power Supply	SFX FSP 650 Gold Rated PSU
Mouse	Logitech G203 Lightsync Mouse
Keyboard	Red Dragon K552W RGB White KB.
VR HMD	( ◔ ʖ̯ ◔ )
Software	Windows 10 Home 64bit
Benchmark Scores	None. I also own a Apple Macbook Air M2

Processor	Ryzen 5 5700x
Motherboard	B550 Elite
Cooling	Thermalright Perless Assassin 120 SE
Memory	32GB Fury Beast DDR4 3200Mhz
Video Card(s)	Gigabyte 3060 ti gaming oc pro
Storage	Samsung 970 Evo 1TB, WD SN850x 1TB, plus some random HDDs
Display(s)	LG 27gp850 1440p 165Hz 27''
Case	Lian Li Lancool II performance
Power Supply	MSI 750w
Mouse	G502

NVIDIA GeForce RTX 40 Series "AD104" Could Match RTX 3090 Ti Performance

Deleted member 185088

Guest