• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

A hint on new king : GeForce GTX 790 and its performance calculated

Joined
Jun 9, 2011
Messages
32 (0.01/day)
Location
South Korea
(Contact : leedaeguen [at] kaist.ac.kr)


AMD and NVIDIA compete a balanced fight since Q4 2013 with their newest flagship silicons namely Hawaii and GK110. And this kind of frame might last a couple of quarters since transition to 20nm fabrication process, which is possibly a key to next generation flagship GPU such as Maxwell, is being delayed by TSMC. So it is more likely that each company's dual GPU SKU will be the closest 'balance breaker' of their competition rather than next generation SKUs. Yesterday, NVIDIA engage their first hook for that 'balance breaker' : GeForce GTX 790, a dual-GK110 SKU replacing NVIDIA's current king, GTX 690.

According to VideoCardz, GTX 790 includes:
- 2 x 2496 CUDA Cores (13 SMX per each GK110 GPU, 1 shorter than TITAN / 1 more than GTX 780)
- 2 x 320bit GDDR5 memory interface (highly implies that each GPU only has 40 ROP, not 48)
- 10GB of memory
(Source : http://videocardz.com/48610/nvidia-maxwell-details-revealed-ces-2014)

In this posting I'll calculate its speculative performance via 'VGA calculator' I designed. The calculator is actually a simple multivariable fractional equation whose variables are SP/TMU/ROP count and GPU/VRAM frequency. Each term represents the 'simulated' GPU's shader/texture mapping/rendering performance via harmonically (which means each term employs '1/n' form) thus we can easily see not only specification-wise bluff but also a true fact on each GPU's real performance. For example, Radeon HD 5830 has more SP/TMU and same amount of ROP than 4890 while their clock rates are 800 and 850MHz respectively, so it is quite natural that 5830 seems faster than 4890. But it's not. This is because of 5830's (a bit) slower ROP partition than 4890 which affects gaming performance despite entire dominance on SP/TMU. By this calculator, however, I successfully speculated that 5830 won't overwhelm 4890. Indeed, across more than 3 year, it still works for recent GPUs such as Volcanic Islands/Kepler family so that I speculated that Hawaii will faster than GTX TITAN/780 but not compete with a full-blown GK110, which now released as GTX 780 Ti, prior to Hawaii's release. (See that : http://udteam.tistory.com/535)


Well, let this lecture get finished. The results are as below:
(Assume that GPU/VRAM clock rate remains unchanged. GPU clock reflects the max boost frequency)

EGbXYAw.png


▲ It's obvious that the new card bests any of other predecessors including GTX 690, a dual-GK104 SKU. Roughly the margin between GTX 690 seems to be about 25%, and almost a half more than GTX 780 Ti. Speaking of SLI scale, however, it's also a bit disappointing when we compare this to actual SLI configurations. Let's see 2 x GTX 780 Ti config.

IJ4dnL7.png


▲ See what I mean? GTX 790 actually doesn't exhaust a full potential of two monstrous GPU. Let's figure this out component by component. First, let's compare a half of GTX 790 to other single-GK110 SKUs.

GSzna6e.png


▲ It seems like a half of GTX 790 doesn't even compete GTX 780 though it has 1 more SMX. The only difference is ROP count and memory interface wihch bonds together(8 ROP and 64bit interface are blocked together in GK110) so it is rational that the performance gap is originated from that. Let's try to prove this.

nQRM107.png


▲ The result above is given from a half of GTX 790 plus 1 more ROP-IMC cluster(means +8 ROP and +64bit GDDR5 memory controller). Another ROP-IMC cluster features almost 12% increase in performance so that it can overcome GTX 780 and goes very close to GTX TITAN.

Let's see the contrary : A full-blown SP/TMU (same amount as GTX 780 Ti) and a flawed ROP-IMC.

yPjjw3V.png


▲ It becomes obvious that ROP-IMC part affects more on performance than SP/TMU.

So, my conclusion is as follow:
- GTX 790 will gain the crown but not faster than SLI config. of today's highest end single-GPU SKU.
- GTX 790 will actually be slower than 2 x GTX 780, not 780 Ti nor TITAN because of its lack on ROP.

Well, the post is over. Thanks for reading. Have a nice day :-)
 
Jesus Christ!

And another $1000+ GPU coming up!
 
I would find this VGA highly illogical, as GK110 is already the limit of 28nm node. Take 2 GPU's capable of 200+ watts of power consumption and slap them on the same PCB. I think at that point to start to run into limitations of a 12 inch PCB as in how much you would be able to pack on it.
 
If it's even gonna be true then I'll say I smell another GTX 590.
 
If it's even gonna be true then I'll say I smell another GTX 590.

I disagree, If it is true, I do not believe NVidia would make the same mistake twice, but I have been proven wrong many times before.

I would think they would need to have a pretty hefty PCB for this type of card though.
 
The minimum price I would expect for this would be $1499, and not many would be able to afford it, plus the power requirements and size of the PCB puts it closer to 1699 unless they want to slow down the cores by a few hundred Mhz to improve heat and power consumption, and then we are only talking about 40-60% more performance than a single overclocked card for much more $$$.

http://www.techspot.com/review/139-geforce-gtx-295/page2.html

They would be better off trying this again before two on a single PCB, or excluding the PLX bridge and using 8X lanes each and forcing 2.0 minimum PCIe standard and special drivers.
 
"I could not verify this information, thus I decided not to make a separate post, but if these specs are even slightly true then I think you should know them."

Forgive me for being a party pooper, but if NVidia intend to create a card and then sell it for more than the current 780ti, then they can go f*** themselves and give up trying to innovate. Innovation should also demand the clause "widely available". 10GB of VRAM is a joke, NVidia have always had less VRAM than AMD, and when they have more/equal amounts you end up paying a tonne for it.

Probably fake or horribly wrong, either way.
 
"I could not verify this information, thus I decided not to make a separate post, but if these specs are even slightly true then I think you should know them."

Forgive me for being a party pooper, but if NVidia intend to create a card and then sell it for more than the current 780ti, then they can go f*** themselves and give up trying to innovate. Innovation should also demand the clause "widely available". 10GB of VRAM is a joke, NVidia have always had less VRAM than AMD, and when they have more/equal amounts you end up paying a tonne for it.

Probably fake or horribly wrong, either way.

I'm sorry, what? Not very true (the "ALWAYS" having less VRAM). G80 and GF100 would like a word with you...
 
Can be awesome GPU for 1k $ price tag =)
 
I'm sorry, what? Not very true (the "ALWAYS" having less VRAM). G80 and GF100 would like a word with you...

Well NVidia seem to be making the same play with less VRAM with that last few generations, and consistently have AIB's put out higher VRAM models for huge price increases
 
Pencil my bet down for $1199

It's gonna have less VRAM and 6 Gbps GDDR5 and be just around 1GHz, I optimistically say it'll take, still ridiculous $999 price point and push other prices down
 
A bigger cooler, two GPU dies, twice as many VRM and circuitry, PLX bridge chip, 6GB memory.

http://www.techpowerup.com/reviews/NVIDIA/GeForce_GTX_690/

Historically they have been twice as much as the single card on which they are based, and assuming a 780 Ti die it will be $1399 minimum, and likely higher by the time retailers get done with them.
 
Horrible investment.
 
Not one to get drawn into mass speculation. Rumour says less cores than Titan so you could argue base price similar to 780 (non-Ti). So double that value as opposed to Ti version. All this being said. why bother when you can get 780 sli... (or 290 crossfire).
 
Fast card for those wanting to invest in such a thing, and it will probably still be quieter than a reference 290 too.
 
Last edited:
They should have tested against two non ref 780 OC cards, because the reality is, two of those for as little as $940, are going to be hard to beat.
 
Well I guess it's an investment in entertainment value. I can purchase a card for half the price and get the same entertainment value. I suppose the other half of the money would go towards vanity or something.
 
Back
Top