1. Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

NVIDIA GeForce Kepler Packs Radically Different Number Crunching Machinery

Discussion in 'News' started by btarunr, Feb 10, 2012.

  1. btarunr

    btarunr Editor & Senior Moderator Staff Member

    Joined:
    Oct 9, 2007
    Messages:
    28,742 (11.14/day)
    Thanks Received:
    13,676
    Location:
    Hyderabad, India
    NVIDIA is bound to kickstart its competitive graphics processor lineup to AMD's Southern Islands Radeon HD 7000 series with GeForce Kepler 104 (GK104). We are learning through reliable sources that NVIDIA will implement a radically different design (by NVIDIA's standards anyway) for its CUDA core machinery, while retaining the basic hierarchy of components in its GPU similar to Fermi. The new design would ensure greater parallelism. The latest version of GK104's specifications looks like this:

    SIMD Hierarchy
    • 4 Graphics Processing Clusters (GPC)
    • 4 Streaming Multiprocessors (SM) per GPC = 16 SM
    • 96 Stream Processors (SP) per SM = 1536 CUDA cores

    TMU / Geometry Domain
    • 8 Texture Units (TMU) per SM = 128 TMUs
    • 32 Raster OPeration Units (ROPs)
    Memory
    • 256-bit wide GDDR5 memory interface
    • 2048 MB (2 GB) memory amount standard
    Clocks/Other
    • 950 MHz core/CUDA core (no hot-clocks)
    • 1250 MHz actual (5.00 GHz effective) memory, 160 GB/s memory bandwidth
    • 2.9 TFLOP/s single-precision floating point compute power
    • 486 GFLOP/s double-precision floating point compute power
    • Estimated die-area 340mm²
    Source: 3DCenter.org
     
  2. phanbuey

    phanbuey

    Joined:
    Nov 13, 2007
    Messages:
    5,205 (2.05/day)
    Thanks Received:
    975
    Location:
    Miami
    wow... that is definitely different...
     
  3. Live OR Die

    Live OR Die

    Joined:
    May 19, 2007
    Messages:
    3,997 (1.47/day)
    Thanks Received:
    411
    I bet your mommy always told you to eat your greens ;)
     
  4. ViperXTR

    ViperXTR

    Joined:
    Jan 31, 2011
    Messages:
    1,394 (1.02/day)
    Thanks Received:
    407
    its looking like an AMD specification now hehe (wait 32 ROPs? D: )
     
  5. puma99dk|

    puma99dk|

    Joined:
    Aug 29, 2005
    Messages:
    3,449 (1.03/day)
    Thanks Received:
    775
    i just hope they a serious about that 2048mb of memory if not it will be a shame.
     
  6. EpicShweetness

    EpicShweetness

    Joined:
    Dec 1, 2011
    Messages:
    307 (0.29/day)
    Thanks Received:
    30
    Location:
    Ft Gordon
    These specs are defiantly strange for an Nvidia chip. 1536 CUDA Cores is triple that of the GTX 580, yet with only a 30% reduction in the size of the fabrication as well the fact that GK104 is smaller then GF110. This only indicates a few things, a "nerf" on the CUDA core itself, or the architecture is much more "cluster based". Very Interesting I'll be following this closely
     
  7. LAN_deRf_HA

    LAN_deRf_HA

    Joined:
    Apr 4, 2008
    Messages:
    4,557 (1.90/day)
    Thanks Received:
    952
    It's a lot more shaders but they're running much slower too. Seems it'd even out on the heat front.
     
  8. ViperXTR

    ViperXTR

    Joined:
    Jan 31, 2011
    Messages:
    1,394 (1.02/day)
    Thanks Received:
    407
    just like what the HD 2000 and the present 7000 cards are doing, moar shaders but lower clocks (or rather clocks are tied with the TMU/ROP clocks)
     
  9. radrok

    radrok

    Joined:
    Oct 26, 2011
    Messages:
    2,990 (2.72/day)
    Thanks Received:
    803
    Location:
    Italy
    My massive loop is waiting for the heat :rockout:
     
  10. hardcore_gamer

    hardcore_gamer

    Joined:
    Jan 25, 2011
    Messages:
    388 (0.28/day)
    Thanks Received:
    174
    Location:
    Fabry Perot cavity,AlGaAs-GaAs Heterojunction
    Die size is very close to that of 7970 (365mm2). Interesting:cool:
     
  11. radarblade New Member

    Joined:
    Feb 8, 2012
    Messages:
    48 (0.05/day)
    Thanks Received:
    1
    Location:
    In the middle of Nowhere
    Seems like Nvidia's pretty prepped up to wipe AMD off the slate! But what would be the TDP on these things? Preferably lesser than the earlier 480 and 580 heaters. :)
     
    1c3d0g says thanks.
  12. theoneandonlymrk

    theoneandonlymrk

    Joined:
    Mar 10, 2010
    Messages:
    3,411 (2.01/day)
    Thanks Received:
    572
    Location:
    Manchester uk
    Interested In how this is going to be 50% faster then a7970 they seem similar I'm shader layout
     
    More than 25k PPD
  13. NC37

    NC37

    Joined:
    Oct 30, 2008
    Messages:
    1,198 (0.55/day)
    Thanks Received:
    266
    The end of NV's monolithic GPU era is at hand...was about to say...Bout freaken time! ATI was slower at first when they switched but I knew eventually NV would have to change too.

    Very interested to see how well NV does at ATI's own game.
     
  14. gaximodo

    gaximodo New Member

    Joined:
    Sep 7, 2008
    Messages:
    268 (0.12/day)
    Thanks Received:
    62
    Location:
    Canberra Australia
    this isn't supposed to be NV's flagship anywayz.
     
  15. Xaser04

    Xaser04

    Joined:
    May 15, 2007
    Messages:
    743 (0.27/day)
    Thanks Received:
    106
    GK104 so GTX560Ti replacement (ish).

    Considering this is 1536 shaders it would be logical to assume that the full fat model would have 2048 shaders, after all the GTX560TI was - in simplistic terms - roughly 75% of a GTX580.

    The shader count itself is very interesting.

    The increase in shaders (384-1536 if we assume a GTX560TI replacement) would suggest that each Kepler shader is less complex than its Fermi contemporary.

    If we also assume similar performance to the HD7950 (doesn't seem to unrealistic) then clock for clock GCN and Kepler could be quite evenly matched (HD7950 has more shaders but a lower core clock).

    Should be very interesting.
     
  16. Crap Daddy

    Crap Daddy

    Joined:
    Oct 29, 2010
    Messages:
    2,744 (1.88/day)
    Thanks Received:
    1,046
    This is not going to be 50% faster than 7970. Judging by the specs it should fall between 7950 and 7970 at a rumored 300$.
    GK110 will probably be the Tahiti killer. At a price...
     
  17. Red_Machine

    Red_Machine

    Joined:
    Oct 1, 2010
    Messages:
    1,733 (1.16/day)
    Thanks Received:
    380
    Location:
    Marlow, ENGLAND
    At this rate, I will feel compelled to replace my 580. GK110 will likely be 70-80% faster...
     
  18. pantherx12

    pantherx12 New Member

    Joined:
    Jan 2, 2009
    Messages:
    9,714 (4.57/day)
    Thanks Received:
    1,699
    Location:
    ENGLAND-LAND-LAND
    I reckon it will be half that, at best. :p
     
  19. Benetanegia

    Benetanegia New Member

    Joined:
    Sep 11, 2009
    Messages:
    2,683 (1.43/day)
    Thanks Received:
    694
    Location:
    Reaching your left retina.
    I assume this specs have been judged legit since Btarunr did post them unlike most others.

    Ah crap they are too different, imposible to guesstimate the performance based on them (don't know how other people are so sure). I'll try to make my analysis anyway.

    At a first glance it looks like they doubled GF104's shader domain (128 TMU, 4 GPCs, etc.) and then doubled the shader amount per SM because abandoning hot clocks allows for that. Performance wise the end result should be similar.

    Based on die size this chip must contain twice the amount of transistors on GF104, while retaining the 256 bit bus, so there's no compelling reason to assume the shaders are any less capable than they were in Fermi. They could have just as easily gone with 768 SPs and hot-clocks within the same die size.

    And finally efficiency. That's the key to knowing the performance. We don't know how well they will be able to use all those SP. I'd assume they are using 6x16 SP wide superscalar shader multiprocessors, but with how many schedulers? GF104 had 2. So now they have 4? Or since shaders run at half the speed the schedulers are just issuing the same amount of ops-per-cycle? (in reality cycles-per-op)

    So many questions but I had fun. Based on raw specs this chip has the potential to rape any other card on the market, think 2x GTX560 Ti, at least at 1080/1200p. But efficiency/scaling is the key factor and that's completely unknown to us.

    EDIT: As you can see, I changed my mind competely as I was writing this post. I first thought they were very different and came to realizing that they are pretty much the same. If you think about Fermi based GF104/114 as a 768 SP chip with no hot-clocks, they just doubled the amount of GPCs.
     
    Last edited: Feb 10, 2012
    Crap Daddy says thanks.
  20. Filiprino New Member

    Joined:
    Dec 12, 2011
    Messages:
    77 (0.07/day)
    Thanks Received:
    16
    NVIDIA seems that has come with something very similar to GCN from AMD. But after all it's NVIDIA and the successor to Fermi, so we'll have to wait and see performance numbers.
     
  21. General Lee New Member

    Joined:
    Apr 27, 2011
    Messages:
    51 (0.04/day)
    Thanks Received:
    5
    I wouldn't take them without a big grain of salt, but it's always fun to do some what iffing.

    The specs look similar to what AMD has now, so given the estimated die size and unit counts, I'd say it would reach 580/7950 level performance. I doubt they'll price it at 300$ if 7950 is at 470$. More likely it's at best 50$ cheaper, that's enought to get the ball rolling. It's not really difficult to undercut the 7900 series in price, so regardless of performance it shouldn't be hard for Nvidia to claim a perf/$ crown simply because 7900 is sold at a premium currently. Of course AMD should respond to that, and I think this is the scenario we all hope for.
     
  22. xenocide

    xenocide

    Joined:
    Mar 24, 2011
    Messages:
    2,149 (1.63/day)
    Thanks Received:
    463
    Location:
    Burlington, VT
    A lot of people are holding out for Nvidia just to see prices level out. If they sell a card on par for the 7950 $100 cheaper, they'll make up the difference in volume. I guarantee they would sell twice as many cards as if they priced it around $450.
     
  23. jamsbong New Member

    Joined:
    Mar 17, 2010
    Messages:
    83 (0.05/day)
    Thanks Received:
    7
    Confirmed Nvidia is doing an ATI!
    The specs look so identical that if I rename these specs as say....

    HD7870:
    256bit GDDR5 2GB memory
    1536 CU, 128TMU, 32ROP, small 340mm^2 die size, no hot clocks.

    It looks totally believable! Has Nvidia been hiring lots of ATI engineers? or they reversed engineered ATI's Cayman?

    Jokes aside, some rational observations:
    The specs itself looks like a mid-high end card, will be very competitive price wise as it uses 256bit memory and small die. I won't be surprise that it is only faster than cayman by 10-20%. It will be on par with GTX580 at best.
    I believe Nvidia is working on a high end card which has yet to show itself.
     
  24. Crap Daddy

    Crap Daddy

    Joined:
    Oct 29, 2010
    Messages:
    2,744 (1.88/day)
    Thanks Received:
    1,046
    Charlie seems to be very into Kepler these days. He says the ball is rolling :

    "Reports coming in from the far east say that those high up in the priority list started getting Kepler cards in various guises early this week, possibly late last. The number of sightings from sources that SemiAccurate trusts has been going up almost exponentially over the past few days, and will probably keep doing so for a bit."

    He concludes:

    "If things go as normal, it takes 4-6 weeks from AIB sampling to cards on the shelves. This would mean late March or early April, just like we have been saying for weeks."
     
  25. arnoo1

    Joined:
    Oct 15, 2009
    Messages:
    746 (0.41/day)
    Thanks Received:
    124
    Location:
    Netherlands
    seriously 1536 shaders? thats 3 x times more than fermi
     

Currently Active Users Viewing This Thread: 1 (0 members and 1 guest)

Share This Page