techPowerUp! Forums

Go Back   techPowerUp! Forums > www.techpowerup.com > News

Reply
 
Thread Tools
Old Oct 17, 2012, 03:40 AM   #1
btarunr
Editor & Senior Moderator
 
btarunr's Avatar
 
Join Date: Oct 2007
Location: Hyderabad, India
Posts: 14,983 (7.29/day)
Thanks: 788
Thanked 12,911 Times in 5,655 Posts
Send a message via AIM to btarunr Send a message via MSN to btarunr

System Specs

Tesla K20 GPU Compute Processor Specifications Released

Specifications of NVIDIA's Tesla K20 GPU compute processor, which was launched way back in May, are finally disclosed. We've known since then that the K20 is based on NVIDIA's large GK110 GPU, a chip never used to power a GeForce graphics card, yet. Apparently, NVIDIA is leaving some room on the silicon that allows it to harvest it better. According to a specifications sheet compiled by Heise.de, Tesla K20 will feature 13 SMX units, compared to the 15 available on the GK110 silicon.

With 13 streaming multiprocessor (SMX) units, the K20 will be configured with 2,496 CUDA cores (as opposed to 2,880 physically present on the chip). The core will be clocked at 705 MHz, yielding single-precision floating point performance of 3.52 TFLOP/s, and double-precision floating point performance of 1.17 TFLOP/s. The card packs 5 GB of GDDR5 memory, with memory bandwidth of 200 GB/s. Dynamic parallelism, Hyper-Q, GPUDirect with RDMA are part of the new feature-set. The TDP of the GPU is rated at 225W, and understandably, it uses a combination of 6-pin and 8-pin PCI-Express power connectors. Built in the 28 nm process, the GK110 packs a whopping 7.1 billion transistors.



Source: Heise.de

Last edited by btarunr; Oct 17, 2012 at 03:56 AM.
btarunr is offline  
Reply With Quote
Old Oct 17, 2012, 04:05 AM   #2
TheGuruStud
1000 Posts
 
TheGuruStud's Avatar
 
Join Date: Sep 2007
Location: Police/Nanny State of America
Posts: 1,394 (0.67/day)
Thanks: 45
Thanked 142 Times in 109 Posts

System Specs

So, buy 5870s. Got it :P
__________________
TheGuruStud is offline  
Reply With Quote
Old Oct 17, 2012, 04:14 AM   #3
sergionography
200 Posts
 
Join Date: Feb 2012
Posts: 244 (0.52/day)
Thanks: 63
Thanked 29 Times in 27 Posts

in other words it can almost match tahiti
sergionography is offline  
Reply With Quote
Old Oct 17, 2012, 05:00 AM   #4
HumanSmoke
500 Posts
 
HumanSmoke's Avatar
 
Join Date: Sep 2011
Posts: 582 (0.93/day)
Thanks: 132
Thanked 137 Times in 105 Posts

System Specs

Seems like a repeat of GF100/110. Hardly surprising if the die is 500mm^2+

The first Fermi Tesla's (M2050/M2070) out of the gate were basically GTX 470 spec. M2090 released more recently is pretty much a GTX 580.

Would be interesting to know whether these Tesla's are the same SKU's that ORNL are taking delivery of, or whether they are higher spec since Oak Ridge seemed to be the high profile launch customer.
Quote:
Originally Posted by sergionography View Post
in other words it can almost match tahiti
Any comparison probably depends on actual performance efficiency rather than hypothetical. Unless you know what K20 brings to the table, a theoretical comparison is largely useless.

BTW: The original site now no longer features any specification

Last edited by HumanSmoke; Oct 17, 2012 at 05:20 AM.
HumanSmoke is offline  
Reply With Quote
Old Oct 17, 2012, 05:05 AM   #5
Solaris17
Creator Solaris Utility DVD
 
Solaris17's Avatar
 
Join Date: Aug 2005
Location: Reinacting scenes from platoon with Charlie Sheen
Posts: 13,708 (4.83/day)
Thanks: 4,366
Thanked 3,295 Times in 2,311 Posts
Send a message via ICQ to Solaris17 Send a message via AIM to Solaris17 Send a message via MSN to Solaris17 Send a message via Yahoo to Solaris17 Send a message via Skype™ to Solaris17

System Specs

those cores.....my god.
__________________
I Made the Millionth post! | "Please come to WI now so I can beat you over the head with a bratwurst."-Kreij
PS3 mod 8500/8600GT Mod Guide Rebuild a Copperhead Heat Ware
NF4 Ultra SLI Mod Solaris Utility DVD 4.0 Broken CPU pin guide
Vista Mark
Solaris17 is offline  
Reply With Quote
Old Oct 17, 2012, 05:47 AM   #6
[H]@RD5TUFF
Eligible for custom title
 
Join Date: Nov 2009
Location: San Diego, CA
Posts: 5,589 (4.34/day)
Thanks: 1,825
Thanked 1,710 Times in 1,431 Posts

System Specs

do want
__________________
Stuff 4 Sale!
Heat Ware
ebay



[H]@RD5TUFF is offline  
Reply With Quote
Old Oct 17, 2012, 07:54 AM   #7
bogami
25 Posts
 
bogami's Avatar
 
Join Date: Jan 2012
Location: Slovenia
Posts: 67 (0.14/day)
Thanks: 0
Thanked 2 Times in 2 Posts
Send a message via Skype™ to bogami

System Specs

Estimated 20 PFOPS/s peak petaflops .!!! and3.52 TFLOP/s normal. D.P.1.17 TFLOPS/s.
Nice peak.
I wish 20 PFOPS/s on next GPU option.
bogami is offline  
Reply With Quote
Old Oct 17, 2012, 07:56 AM   #8
The Von Matrices
200 Posts
 
The Von Matrices's Avatar
 
Join Date: Dec 2010
Location: State College, PA, USA
Posts: 436 (0.49/day)
Thanks: 118
Thanked 107 Times in 78 Posts

System Specs

5GB of memory? That's not evenly divisible by the 384-bit memory bus it was rumored to have. Has it been reduced to 320-bit, which could produce an even 5GB?
__________________
The Von Matrices is offline  
Reply With Quote
Old Oct 17, 2012, 08:54 AM   #9
HumanSmoke
500 Posts
 
HumanSmoke's Avatar
 
Join Date: Sep 2011
Posts: 582 (0.93/day)
Thanks: 132
Thanked 137 Times in 105 Posts

System Specs

According to the original info available on the site, the card they were listing the specification for did have one memory controller inactive, so yes, a 320-bit memory bus. The full specification card is 384-bit/ 6, 12 and possibly 24GB
HumanSmoke is offline  
Reply With Quote
Old Oct 17, 2012, 09:21 AM   #10
btarunr
Editor & Senior Moderator
 
btarunr's Avatar
 
Join Date: Oct 2007
Location: Hyderabad, India
Posts: 14,983 (7.29/day)
Thanks: 788
Thanked 12,911 Times in 5,655 Posts
Send a message via AIM to btarunr Send a message via MSN to btarunr

System Specs

Quote:
Originally Posted by The Von Matrices View Post
5GB of memory? That's not evenly divisible by the 384-bit memory bus it was rumored to have. Has it been reduced to 320-bit, which could produce an even 5GB?
Mix matching. Just like 2 GB is made possible on 192-bit.
__________________

Gadgets, Phones, Tablets, Cameras, TVs, HiFi...NextPowerUp
btarunr is offline  
Reply With Quote
The Following User Says Thank You to btarunr For This Useful Post:
Old Oct 17, 2012, 09:30 AM   #11
Prima.Vera
1000 Posts
 
Prima.Vera's Avatar
 
Join Date: Sep 2011
Location: Antagonia
Posts: 1,367 (2.21/day)
Thanks: 228
Thanked 179 Times in 120 Posts

System Specs

LOL. 7 billion transistors! I remember that my old 3dfx VooDoo 3 was having 7 million transistors and was the fastest when released. )))
__________________
The richest man is not he who has the most, but he who needs the least.
Prima.Vera is offline  
Reply With Quote
Old Oct 17, 2012, 09:39 AM   #12
The Von Matrices
200 Posts
 
The Von Matrices's Avatar
 
Join Date: Dec 2010
Location: State College, PA, USA
Posts: 436 (0.49/day)
Thanks: 118
Thanked 107 Times in 78 Posts

System Specs

Quote:
Originally Posted by btarunr View Post
Mix matching. Just like 2 GB is made possible on 192-bit.
True, that is possible. But would it really be done on a high-end compute card where consistent and predictable performance is important? It would be a headache for developers to have to track which addresses they write and determine which data should go in the more or less interleaved parts of the memory space.
__________________
The Von Matrices is offline  
Reply With Quote
Old Oct 17, 2012, 09:53 AM   #13
Maban
1000 Posts
 
Maban's Avatar
 
Join Date: Mar 2008
Location: Minnesota
Posts: 1,925 (1.01/day)
Thanks: 805
Thanked 554 Times in 362 Posts

System Specs

It's probably twenty 256MB chips on a 320-bit bus.
Maban is offline  
5 Million points folded for TPU
Reply With Quote
Old Oct 17, 2012, 10:08 AM   #14
btarunr
Editor & Senior Moderator
 
btarunr's Avatar
 
Join Date: Oct 2007
Location: Hyderabad, India
Posts: 14,983 (7.29/day)
Thanks: 788
Thanked 12,911 Times in 5,655 Posts
Send a message via AIM to btarunr Send a message via MSN to btarunr

System Specs

Quote:
Originally Posted by The Von Matrices View Post
True, that is possible. But would it really be done on a high-end compute card where consistent and predictable performance is important? It would be a headache for developers to have to track which addresses they write and determine which data should go in the more or less interleaved parts of the memory space.
Low level video memory management is handled by API>CUDA>driver. Apps are oblivious to that. Apps are only told that there's 5 GB of memory, and to deal with it.
__________________

Gadgets, Phones, Tablets, Cameras, TVs, HiFi...NextPowerUp
btarunr is offline  
Reply With Quote
The Following User Says Thank You to btarunr For This Useful Post:
Old Oct 17, 2012, 02:11 PM   #15
largon
2000 Posts
 
largon's Avatar
 
Join Date: May 2005
Location: Tre, Suomi Finland
Posts: 2,696 (0.92/day)
Thanks: 1
Thanked 437 Times in 335 Posts

System Specs

That die shot definitely has 384bits worth of memory bus...
__________________
You were not supposed to see this.
¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯
¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯¯

Suum cuique pulchrum est.
largon is offline  
Reply With Quote
Old Oct 17, 2012, 03:22 PM   #16
T4C Fantasy
CPU & GPU DB Maintainer
 
T4C Fantasy's Avatar
 
Join Date: May 2012
Location: USA, Rhode Island
Posts: 873 (2.28/day)
Thanks: 208
Thanked 366 Times in 215 Posts
Send a message via AIM to T4C Fantasy Send a message via MSN to T4C Fantasy Send a message via Skype™ to T4C Fantasy

System Specs

http://www.techpowerup.com/gpudb/564...Tesla_K20.html
__________________
GPU Database <-- Want Me To Add A Specific Non-Reference GPU Model Not Listed? Send Me A PM
GPU Database Thread
Rare GPUs / Unreleased GPUs
List Your Favorite GPU
GPU-Z 0.6.7+ Screenshots!

CPU Database
T4C Fantasy is offline  
Reply With Quote
The Following User Says Thank You to T4C Fantasy For This Useful Post:
Old Oct 17, 2012, 04:55 PM   #17
Xzibit
500 Posts
 
Join Date: Apr 2012
Posts: 609 (1.56/day)
Thanks: 22
Thanked 95 Times in 74 Posts

System Specs

Quote:
Originally Posted by HumanSmoke View Post
Any comparison probably depends on actual performance efficiency rather than hypothetical. Unless you know what K20 brings to the table, a theoretical comparison is largely useless.
Incase you didnt know Mark Harris points out he works for Nvidia.

So you might want to check who runs the sites your linking to if you want to link to un-bias information.

It be like linking to sites/blog run by AMD employees to make a point or further a view point of a AMD product.

Just silly.
Xzibit is offline  
Reply With Quote
Old Oct 17, 2012, 09:04 PM   #18
HumanSmoke
500 Posts
 
HumanSmoke's Avatar
 
Join Date: Sep 2011
Posts: 582 (0.93/day)
Thanks: 132
Thanked 137 Times in 105 Posts

System Specs

Quote:
Originally Posted by Xzibit View Post
Incase you didnt know Mark Harris points out he works for Nvidia
The report is a scientific paper published by the University of Aizu. It has nothing to do with Nvidia. Take your useless trolling elsewhere
HumanSmoke is offline  
Reply With Quote
Old Oct 17, 2012, 09:06 PM   #19
cadaveca
My name is Dave
 
cadaveca's Avatar
 
Join Date: Apr 2006
Location: The Great White North
Posts: 10,779 (4.14/day)
Thanks: 4,504
Thanked 5,240 Times in 3,213 Posts

System Specs

woah, how'd i miss this. Thanks for bumping, Smoke!

__________________
Gadgets, Phones, Tablets, Cameras, TVs, HiFi...NextPowerUp


-Only real men play games THIS way.
cadaveca is offline  
Reply With Quote
Old Oct 17, 2012, 10:26 PM   #20
Xzibit
500 Posts
 
Join Date: Apr 2012
Posts: 609 (1.56/day)
Thanks: 22
Thanked 95 Times in 74 Posts

System Specs

Quote:
Originally Posted by HumanSmoke View Post
The report is a scientific paper published by the University of Aizu. It has nothing to do with Nvidia. Take your useless trolling elsewhere
Talk about idiot fanboyism.

That site is run by Mark Harris a Nvidia employee. Are you so naive that hes gonna post un-bias research link on his site/blog.
Nvidia would find a way to fire him in a second if he posted links to research papers that put Nvidia in a bad light.

It only took me 1 mouse click to findout he was a Nvidia employee. Come-on now. Whos trolling now ?

Atleast show both sides or attempt to so you wont seam like a Nvidia cheerleader

Quote:
The performance of DGEMM in Fermi using this algorithm is
shown in Figure 3, along with the DGEMM performance from CUBLAS 3.1.
Note that the theoretical peak of the Fermi, in this case a C2050, is 515 GFlop/s
in double precision (448 cores 1:15 GHz 1 instruction per cycle). The kernel
described achieves up to 58% of that peak.
Thats from a Oak Ridge National Labaratory along with University of Tennesse and University of Manchester in UK study.

58% is lower then 90% in DGEMM. Maybe Kepler GK100/110 has a 34% jump who knows but chip on the GTX 280 was only 34% in DGEMM.

What do i know tho. I would think Oak Ridge National Labaratory does since they use the darn things.

Last edited by Xzibit; Oct 17, 2012 at 10:33 PM.
Xzibit is offline  
Reply With Quote
Old Oct 17, 2012, 10:57 PM   #21
HumanSmoke
500 Posts
 
HumanSmoke's Avatar
 
Join Date: Sep 2011
Posts: 582 (0.93/day)
Thanks: 132
Thanked 137 Times in 105 Posts

System Specs

Quote:
Originally Posted by Xzibit View Post
Talk about idiot fanboyism.
Sure - I'll use your quotes (and mine since you obviously can't RTFP) as examples
Quote:
Originally Posted by Xzibit View Post
Thats from a Oak Ridge National Labaratory along with...
Yup. Which just goes to prove that real-world and theoretical numbers differ. Which is exactly as I noted. Likewise I made no assumption based upon a part whose performance is unknown...or do you have access to Kepler information that everyone outside of Nvidia and HPC projects don't?
Quote:
Unless you know what K20 brings to the table, a theoretical comparison is largely useless.
So what is the DGEMM efficiency of Kepler ?
All I see here is a brief synopsis of Fermi
And of course, at no point did I make an AMD vs Nvidia comparison- quite the opposite in fact
Quote:
Any comparison probably depends on actual performance efficiency rather than hypothetical
Get back under your bridge Xzibitroll - I'm sick of having to explain simple compound sentences to you.
HumanSmoke is offline  
Reply With Quote
Old Oct 17, 2012, 11:39 PM   #22
T4C Fantasy
CPU & GPU DB Maintainer
 
T4C Fantasy's Avatar
 
Join Date: May 2012
Location: USA, Rhode Island
Posts: 873 (2.28/day)
Thanks: 208
Thanked 366 Times in 215 Posts
Send a message via AIM to T4C Fantasy Send a message via MSN to T4C Fantasy Send a message via Skype™ to T4C Fantasy

System Specs

Quote:
Originally Posted by Xzibit View Post
Talk about idiot fanboyism.

That site is run by Mark Harris a Nvidia employee. Are you so naive that hes gonna post un-bias research link on his site/blog.
Nvidia would find a way to fire him in a second if he posted links to research papers that put Nvidia in a bad light.

It only took me 1 mouse click to findout he was a Nvidia employee. Come-on now. Whos trolling now ?

Atleast show both sides or attempt to so you wont seam like a Nvidia cheerleader



Thats from a Oak Ridge National Labaratory along with University of Tennesse and University of Manchester in UK study.

58% is lower then 90% in DGEMM. Maybe Kepler GK100/110 has a 34% jump who knows but chip on the GTX 280 was only 34% in DGEMM.

What do i know tho. I would think Oak Ridge National Labaratory does since they use the darn things.
http://www.techpowerup.com/gpudb/923...sla_C2050.html

previous gen NVidia architecture calculates floating points by shader clock so the C2050 would be 1Tflop of single precision
__________________
GPU Database <-- Want Me To Add A Specific Non-Reference GPU Model Not Listed? Send Me A PM
GPU Database Thread
Rare GPUs / Unreleased GPUs
List Your Favorite GPU
GPU-Z 0.6.7+ Screenshots!

CPU Database
T4C Fantasy is offline  
Reply With Quote
Old Oct 18, 2012, 12:13 AM   #23
Xzibit
500 Posts
 
Join Date: Apr 2012
Posts: 609 (1.56/day)
Thanks: 22
Thanked 95 Times in 74 Posts

System Specs

Quote:
Originally Posted by T4C Fantasy View Post
http://www.techpowerup.com/gpudb/923...sla_C2050.html

previous gen NVidia architecture calculates floating points by shader clock so the C2050 would be 1Tflop of single precision
Those test are done in Double-percision. For single-percision it would be SGEMM.
C2050 is 515 GFlop/s in double precision so its only 58% as advertised.

Kepler would have to make up alot of ground in effeciency.

The point i was try'n to make was..

Pointing to a 90% effeciency of Tahiti in DGEMM as if its a bad thing, Especially from a site/blog of a Nvidia employee.
As compared to what ? Nvidias Fermi 58% effeciency in DGEMM ? That Nvidia employee doesnt have a link to that on his site. Wonder why ?
Even if Tahiti ran 58% it still be twice as fast in DGEMM compared to Fermi.

Given K20 is similar spec to W9000 and W8000 It would have to bring its efficiency up in such a comparison.
Maybe the K20 has better effeciency but when someone says hey look AMD can only do 90% when they fail to mention Nvidia only does 58% thats kind cheerleading to me.

We need to see Keplers DGEMM effeciency to see what % it is to its specs/as advertised.



Update:
Nvidias marketing slides put DGEMM efficiency of K20 at 80% and Fermi at 60-65%. So if Oak Ridge National Laboratories put it 2% shy of 60% I would say the window would be 78-80% efficiency for K20. So we are more then likely going to see a draw between K20 & W9000 in DGEMM if the marketing slides of 80% effeciency are met.

Last edited by Xzibit; Oct 18, 2012 at 01:56 AM.
Xzibit is offline  
Reply With Quote
Old Oct 18, 2012, 03:27 AM   #24
HumanSmoke
500 Posts
 
HumanSmoke's Avatar
 
Join Date: Sep 2011
Posts: 582 (0.93/day)
Thanks: 132
Thanked 137 Times in 105 Posts

System Specs

Quote:
Originally Posted by Xzibit View Post
Update:
Nvidias marketing slides put DGEMM efficiency of K20 at 80% and Fermi at 60-65%.
As per usual the troll can't even parse a sentence without altering the content to suit its needs:
Quote:
Kepler GK110 will provide over 1 TFlop of double precision throughput with greater than 80% DGEMM efficiency
Nvidia whitepaper May 2012. (pdf)
Still, coming from someone who openly admits to lying, and up until recently didn't even know the difference between a 3D rendering card and a math co-processor, it's hardly surprising.
Quote:
Originally Posted by Xzibit View Post
I lied i just wanted to
Keep up with the straw man AMD vs Nvidia bullshit and the hypothetical numbers game. I'll stand by my preference for real world testing*
Quote:
Originally Posted by HumanSmoke View Post
Any comparison probably depends on actual performance efficiency rather than hypothetical. Unless you know what K20 brings to the table, a theoretical comparison is largely useless.
*By your reasoning the AMD FirePro W9000 (3.99 TF SP, 1 TF DP) should be four times faster than a Quadro 6000 (1 TF SP, 515 GF DP)...after all, numbers don't lie right?
No...
No...
No

Last edited by HumanSmoke; Oct 18, 2012 at 03:59 AM.
HumanSmoke is offline  
Reply With Quote
Old Oct 18, 2012, 03:36 AM   #25
Xzibit
500 Posts
 
Join Date: Apr 2012
Posts: 609 (1.56/day)
Thanks: 22
Thanked 95 Times in 74 Posts

System Specs

Quote:
Originally Posted by HumanSmoke View Post
As per usual the troll can't even parse a sentence without altering the content:

Nvidia whitepaper May 2012. (pdf)
Now we are taking marketing slides as facts. Guess that doesnt surprise me.

This coming from the idiot who didnt even know who ran GPGPU.ORG

Mark Harris,
Chief Technologist, GPU Computing @ Nvidia


I thought we wanted hard numbers not marketing B.S.

Are you gonna link to Jen-Hsun Huang blog next so we can get nvidia links from there aswell
Xzibit is offline  
Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is On

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
Nvidia's GTX680 gets thrashed by AMD's mid-range Radeon HD 7870 in GPU compute Sourc Protagonist NVIDIA 101 Mar 23, 2012 07:16 PM
Intel Confirms Ivy Bridge Core Processor Specifications btarunr News 37 Mar 3, 2012 03:05 AM
New NVIDIA Tesla GPU Smashes World Record in Scientific Computation btarunr News 25 May 20, 2011 04:27 AM
Cray Builds Supercomputer Blades with Tesla 20 Series GPU Compute Processors btarunr News 11 Sep 23, 2010 03:40 PM
NVIDIA Unveils Tesla GPU Computing Processor malware News 22 Jun 21, 2007 09:14 PM


All times are GMT. The time now is 11:43 AM.


Powered by vBulletin® Version 3.8.6
Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.
no new posts