• We've upgraded our forums. Please post any issues/requests in this thread.

New NVIDIA Tesla GPUs Reduce Cost Of Supercomputing By A Factor Of 10

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
34,334 (9.23/day)
Likes
17,427
Location
Hyderabad, India
System Name Long shelf-life potato
Processor Intel Core i7-4770K
Motherboard ASUS Z97-A
Cooling Xigmatek Aegir CPU Cooler
Memory 16GB Kingston HyperX Beast DDR3-1866
Video Card(s) 2x GeForce GTX 970 SLI
Storage ADATA SU800 512GB
Display(s) Samsung U28D590D 28-inch 4K
Case Cooler Master CM690 Window
Audio Device(s) Creative Sound Blaster Recon3D PCIe
Power Supply Corsair HX850W
Mouse Razer Abyssus 2014
Keyboard Microsoft Sidewinder X4
Software Windows 10 Pro Creators Update
#1
NVIDIA Corporation today unveiled the Tesla 20-series of parallel processors for the high performance computing (HPC) market, based on its new generation CUDA processor architecture, codenamed “Fermi”.

Designed from the ground-up for parallel computing, the NVIDIA Tesla 20-series GPUs slash the cost of computing by delivering the same performance of a traditional CPU-based cluster at one-tenth the cost and one-twentieth the power.

Show full news post
 
Last edited:

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
34,334 (9.23/day)
Likes
17,427
Location
Hyderabad, India
System Name Long shelf-life potato
Processor Intel Core i7-4770K
Motherboard ASUS Z97-A
Cooling Xigmatek Aegir CPU Cooler
Memory 16GB Kingston HyperX Beast DDR3-1866
Video Card(s) 2x GeForce GTX 970 SLI
Storage ADATA SU800 512GB
Display(s) Samsung U28D590D 28-inch 4K
Case Cooler Master CM690 Window
Audio Device(s) Creative Sound Blaster Recon3D PCIe
Power Supply Corsair HX850W
Mouse Razer Abyssus 2014
Keyboard Microsoft Sidewinder X4
Software Windows 10 Pro Creators Update
#2
So it's $3,999 if you want a GTX 380 before everyone else.
 
Joined
Oct 1, 2006
Messages
4,015 (0.98/day)
Likes
477
Location
Hong Kong
Processor AMD FX-8350 @4.8Ghz
Motherboard Asrock 990FX Extreme9
Cooling CoolerMaster Seidon 120XL
Memory A-RAM DDR3 2000
Video Card(s) PowerColor HD7970 3GB V3
Storage OCZ Vertex 3 120G + Max IOPS 120G + Seagate 7200.12 250GB *2 Raid 0 + Samsung HD103UJ 1TB *2 RAID 0
Display(s) Samsung SyncMaster S24B370
Case SilverStone Raven RV01
Power Supply Corsair TX850W
Software Windows 7 Ultimate 64-bit
#3
Blah, we finally see the real Fermi.
OMG, the IO plate of this card is the exact oppsite of the HD5k series. :roll:

Up to 3GB and 6GB (respectively) on-board GDDR5 memoryi
Typo on memory.
 
Joined
May 4, 2009
Messages
1,940 (0.62/day)
Likes
409
Location
Singapore
System Name penguin
Processor i3-4160
Motherboard Asus H81 Mini-ITX
Cooling Stock
Memory 2x4GB Kingston 1600MHz
Video Card(s) Saphire Radeon 7850 2GB
Storage Plextor M5S 120GB+1TB Seagate
Display(s) 23' Dell
Case CM Elite 130
Audio Device(s) stock
Power Supply Corsair CX430m
Software W7/Lubuntu
#4
The old teslas didn't even have a display port comming out, so that's an improovement :D
 
Joined
Jun 17, 2009
Messages
92 (0.03/day)
Likes
16
Location
Tokyo Japan
System Name kumiko
Processor AMD PhenomII 720 (OC 3.32ghz)
Motherboard Asus M3A78-EM
Cooling Zalman CNPS8700 LED
Memory 4GB DDR2-800 4-4-4-12
Video Card(s) MSI R4890 cyclone OC, Sparkle GTS250
Storage Hitachi Deskstar 7K1000.B 1TB
Case Silverstone SG03 SFF
Power Supply Zumax ZU-550Z
Software Windows7 rc1, MirrorsEdge
#5
wow Q2 2010. Also the price is not a good sign ($3999). Current top Tesla (C1060) which is similar to a GTX285 sells for ~$1300. Not trying get people upset, but Geforce fermi might be really expensive (? >$600 >$800?)
 
Joined
May 4, 2009
Messages
1,940 (0.62/day)
Likes
409
Location
Singapore
System Name penguin
Processor i3-4160
Motherboard Asus H81 Mini-ITX
Cooling Stock
Memory 2x4GB Kingston 1600MHz
Video Card(s) Saphire Radeon 7850 2GB
Storage Plextor M5S 120GB+1TB Seagate
Display(s) 23' Dell
Case CM Elite 130
Audio Device(s) stock
Power Supply Corsair CX430m
Software W7/Lubuntu
#6
Q1 ! :p
 
Joined
Oct 1, 2006
Messages
4,015 (0.98/day)
Likes
477
Location
Hong Kong
Processor AMD FX-8350 @4.8Ghz
Motherboard Asrock 990FX Extreme9
Cooling CoolerMaster Seidon 120XL
Memory A-RAM DDR3 2000
Video Card(s) PowerColor HD7970 3GB V3
Storage OCZ Vertex 3 120G + Max IOPS 120G + Seagate 7200.12 250GB *2 Raid 0 + Samsung HD103UJ 1TB *2 RAID 0
Display(s) Samsung SyncMaster S24B370
Case SilverStone Raven RV01
Power Supply Corsair TX850W
Software Windows 7 Ultimate 64-bit
#7
Joined
Nov 1, 2008
Messages
420 (0.13/day)
Likes
121
System Name It does stuff
Processor AMD FX-8320
Motherboard Gigabyte GA-970A-DS3
Cooling Antec Kuhler 620 + Fans Everywhere
Memory 16GB Crucial 1600Mhz DDR3
Video Card(s) Sapphire RX 580 Nitro+ 4GB
Storage Samsung 840 Pro 256GB, Seagate Barracuda 3TB
Display(s) Viewsonic VX2757, AOC 2243W
Case Antec Three Hundred Version Two
Audio Device(s) ASUS Xonar DG PCI / Sony MDR-XB500s
Power Supply EVGA 600W
Mouse Redragon Phoenix
Keyboard Steelseries Apex Raw
Software Win10 64 Professional
#9
ATI should make a little more noise in this market. The compute potential in R800 is enormous.
 
Joined
Oct 1, 2006
Messages
4,015 (0.98/day)
Likes
477
Location
Hong Kong
Processor AMD FX-8350 @4.8Ghz
Motherboard Asrock 990FX Extreme9
Cooling CoolerMaster Seidon 120XL
Memory A-RAM DDR3 2000
Video Card(s) PowerColor HD7970 3GB V3
Storage OCZ Vertex 3 120G + Max IOPS 120G + Seagate 7200.12 250GB *2 Raid 0 + Samsung HD103UJ 1TB *2 RAID 0
Display(s) Samsung SyncMaster S24B370
Case SilverStone Raven RV01
Power Supply Corsair TX850W
Software Windows 7 Ultimate 64-bit
#10
ATI should make a little more noise in this market. The compute potential in R800 is enormous.
Its not because its own technology (Stream), and the standards OpenCL + Direct Compute are not yet ready to counter CUDA.
 
Last edited:
Joined
Jul 2, 2008
Messages
3,588 (1.04/day)
Likes
536
Location
California
#11
  • Up to 1 terabyte of memory, concurrent kernel execution, fast context switching, 10x faster atomic instructions, 64-bit virtual address space, system calls and recursive functions
The card can use up to 1TB of system memory?
  • Double precision performance in the range of 520GFlops - 630 GFlops
That doesn't sound really impressed, anyone care to explain how powerful is this card compare to current workstation cards?
 
Joined
Aug 10, 2007
Messages
4,059 (1.07/day)
Likes
1,123
Location
Geneva, FL, USA
Processor Intel i5-6600
Motherboard ASRock H170M-ITX
Cooling Cooler Master Geminii S524
Memory G.Skill DDR4-2133 16GB (8GB x 2)
Video Card(s) Gigabyte R9-380X 4GB
Storage Samsung 950 EVO 250GB (mSATA)
Display(s) LG 29UM69G-B 2560x1080 IPS
Case Lian Li PC-Q25
Audio Device(s) Realtek ALC892
Power Supply Seasonic SS-460FL2
Mouse Logitech G700s
Keyboard Logitech G110
Software Windows 10 Pro
Joined
Sep 11, 2009
Messages
2,680 (0.89/day)
Likes
693
Location
Reaching your left retina.
#14
That doesn't sound really impressed, anyone care to explain how powerful is this card compare to current workstation cards?
Products based on GT200 has 78 Gflops of double precision performance, per GPU.

EDIT: Maybe that doesn't sound impressive yet.



Finally, notice that even the GTX 285 still gets less than twice the double precision throughput of an AMD Phenom II 940 or Intel Core i7, both of which get about 50 GFlop/s for double and don’t require sophisticated latency hiding data transfer or a complex programming model.
That's from here: http://perspectives.mvdirona.com/2009/03/15/HeterogeneousComputingUsingGPGPUsNVidiaGT200.aspx

What are these cards used for? What is the main market?
Scientists, engineers, economists... anyone with high computing requirements will greatly benefit from this. Until now most of them had to allocate computing time from a supercomputer (or build their own -> $$$$$$$$$). Now they can have something as powerful as the portion of the supercomputing they'd allocate, right on their desk, for a fraction of the money and without the need to worry about their allocating time ending before they finished their studies.
 
Last edited:
Joined
Oct 1, 2006
Messages
4,015 (0.98/day)
Likes
477
Location
Hong Kong
Processor AMD FX-8350 @4.8Ghz
Motherboard Asrock 990FX Extreme9
Cooling CoolerMaster Seidon 120XL
Memory A-RAM DDR3 2000
Video Card(s) PowerColor HD7970 3GB V3
Storage OCZ Vertex 3 120G + Max IOPS 120G + Seagate 7200.12 250GB *2 Raid 0 + Samsung HD103UJ 1TB *2 RAID 0
Display(s) Samsung SyncMaster S24B370
Case SilverStone Raven RV01
Power Supply Corsair TX850W
Software Windows 7 Ultimate 64-bit
#16
Products based on GT200 has 78 Gflops of double precision performance, per GPU.

EDIT: Maybe that doesn't sound impressive yet.

http://img.techpowerup.org/091116/DP.jpg



That's from here: http://perspectives.mvdirona.com/2009/03/15/HeterogeneousComputingUsingGPGPUsNVidiaGT200.aspx



Scientists, engineers, economists... anyone with high computing requirements will greatly benefit from this. Until now most of them had to allocate computing time from a supercomputer (or build their own -> $$$$$$$$$). Now they can have something as powerful as the portion of the supercomputing they'd allocate, right on their desk, for a fraction of the money and without the need to worry about their allocating time ending before they finished their studies.
Thanks for explaining. :toast:
So do you know the typical performance?
How does that compare to lets ay a FireStream?
 
Joined
Sep 11, 2009
Messages
2,680 (0.89/day)
Likes
693
Location
Reaching your left retina.
#17
Thanks for explaining. :toast:
So do you know the typical performance?
How does that compare to lets ay a FireStream?
The real performance in applications (i.e Linpack) you say? I have no idea, but based on the white papers it shouldn't be less efficient than Cell, which was used in RoadRunner (#1 supercomputer until recently). In fact it sounds more efficient than Cell and RoadRunner was almost on par with other supercomputers when it comes to efficiency (Rpeak vs. Rmax). What I'm trying to say is that maybe you have to extract a 20% or so from the peak numbers to obtain real throughoutput, BUT I HAVE NO IDEA OF SUPERCOMPUTING. It's just my estimation after looking at TOP500 supercomputers and Cell and Fermi whitepapers...

http://www.top500.org/

EDIT: Ah, yeah. I forgot Firestream is the Ati GPGPU card, this one seems to be the fastest one: http://ati.amd.com/technology/streamcomputing/product_firestream_9270.html

It says 250 GFlops of peak double precision. It's hard to say and I'm probably going to be flamed and called fanboy, but the actual throughoutput is probably much much lower. That's the same DP Gflops as a HD4870 card would have (it seems based on RV770 anyway) and based on how the Ati cards perform compared to Nvidia cards in things like F@H, IMO it's real Gflops have to be more like 50 Gflops.
 
Last edited:
Joined
Oct 1, 2006
Messages
4,015 (0.98/day)
Likes
477
Location
Hong Kong
Processor AMD FX-8350 @4.8Ghz
Motherboard Asrock 990FX Extreme9
Cooling CoolerMaster Seidon 120XL
Memory A-RAM DDR3 2000
Video Card(s) PowerColor HD7970 3GB V3
Storage OCZ Vertex 3 120G + Max IOPS 120G + Seagate 7200.12 250GB *2 Raid 0 + Samsung HD103UJ 1TB *2 RAID 0
Display(s) Samsung SyncMaster S24B370
Case SilverStone Raven RV01
Power Supply Corsair TX850W
Software Windows 7 Ultimate 64-bit
#18
The real performance in applications (i.e Linpack) you say? I have no idea, but based on the white papers it shouldn't be less efficient than Cell, which was used in RoadRunner (#1 supercomputer until recently). In fact it sounds more efficient than Cell and RoadRunner was almost on par with other supercomputers when it comes to efficiency (Rpeak vs. Rmax). What I'm trying to say is that maybe you have to extract a 20% or so from the peak numbers to obtain real throughoutput, BUT I HAVE NO IDEA OF SUPERCOMPUTING. It's just my estimation after looking at TOP500 supercomputers and Cell and Fermi whitepapers...

http://www.top500.org/
The thing about the ATi cards is that their SMID architechure seems less flexible than nVidia's MIMD route.
That is the reason I have doubts on its performance.

I am trying to understand this: :p
http://perspectives.mvdirona.com/2009/03/18/HeterogeneousComputingUsingGPGPUsAMDATIRV770.aspx
 
Last edited:
Joined
Aug 15, 2008
Messages
5,941 (1.74/day)
Likes
1,192
Location
Watauga, Texas
System Name Univac SLI Edition
Processor Intel Xeon 1650 V3 @ 4.2GHz
Motherboard eVGA X99 FTW K
Cooling EK Supremacy EVO, Swiftech MCP50x, Alphacool NeXXos UT60 360, Black Ice GTX 360
Memory 2x16GB Corsair Vengeance LPX 3000MHz
Video Card(s) Nvidia Titan X Tri-SLI w/ EK Blocks
Storage HyperX Predator 240GB PCI-E, Samsung 850 Pro 512GB
Display(s) Dell UltraSharp 34" Ultra-Wide (U3415W) / (Samsung 48" Curved 4k)
Case Phanteks Enthoo Pro M Acrylic Edition
Audio Device(s) Sound Blaster Z
Power Supply Thermaltake 1350watt Toughpower Modular
Mouse Logitech G502
Keyboard CODE 10 keyless MX Clears
Software Windows 10 Pro
#19
This is proof there is a gt300. So where is our desktop cards huh nvidia?
 

WarEagleAU

Bird of Prey
Joined
Jul 9, 2006
Messages
10,809 (2.59/day)
Likes
529
Location
Gurley, AL
System Name Boddha Getta Boddha Getta Bah!
Processor AMD FX 6100 @ 4.432Ghz @1.382
Motherboard ASUS M5A99X EVO AMD 990X AMD SB950
Cooling Custom Water. EK 240MM Kit, Supreme HSF - Runs 35C
Memory 2 x 4GB Corsair Vengeance White LP @ 1.35V
Video Card(s) XFX Radeon HD 6870 980/1100
Storage WD Caviar Black 1.0TB, WD Caviar Green 1.0TB, WD 160GB
Display(s) Asus VH222/S 22: (21.5" Viewable) 1920x1080p HDMI LCD Monitor
Case NZXT White Switch 810
Audio Device(s) Onboard Realtek 5.1
Power Supply NZXT Hale 90 Gold Cert 750W Modular PSU
Software Windows 8.1 Profession 64 Bit
#20
Pretty impressive to lower costs that much.

@Zubasa, why isn't ATI and Stream with Open CL ready to go against Cuda?
 
Joined
Oct 1, 2006
Messages
4,015 (0.98/day)
Likes
477
Location
Hong Kong
Processor AMD FX-8350 @4.8Ghz
Motherboard Asrock 990FX Extreme9
Cooling CoolerMaster Seidon 120XL
Memory A-RAM DDR3 2000
Video Card(s) PowerColor HD7970 3GB V3
Storage OCZ Vertex 3 120G + Max IOPS 120G + Seagate 7200.12 250GB *2 Raid 0 + Samsung HD103UJ 1TB *2 RAID 0
Display(s) Samsung SyncMaster S24B370
Case SilverStone Raven RV01
Power Supply Corsair TX850W
Software Windows 7 Ultimate 64-bit
#21
This is proof there is a gt300. So where is our desktop cards huh nvidia?
It is also prove that there are simply not a significant amount of them for retail.:shadedshu
They rather sell Teslas for thousands of dollars instead of hundreds for desktop parts. :respect:

Edit: The nVidia site also states that the Geforce should be ready for Q1, hope that is not a paper launch.

Pretty impressive to lower costs that much.

@Zubasa, why isn't ATI and Stream with Open CL ready to go against Cuda?
Well there is hardly anything that uses OpenCL yet, in fact ATi haven't release drivers that enables OpenCL and DirectCompute on older cards.
"Older" includes all the HD3k and 4k series.
Stream is in a even more pityful state, I hardly knows any software that supports it apart from stuff from Adobe.

Edit: According to Bjorn3D, there are a little more...
http://www.bjorn3d.com/read.php?cID=1408&pageID=5778
* Adobe Acrobat®Reader: “Up to 20%* performance improvement when working with graphically rich, high resolution PDF files when compared to using the CPU only”
* Adobe Photoshop CS4® Extended: “Accelerated image and 3D model previewing (panning, zooming, rotation) and 3D manipulations to photos, for example mapping an image onto a 3D object”
* Adobe After Effects®CS4: “Allows for the rapid application of special effects to digital media”
* Adobe Flash®10: “Dynamic, graphically engaging Web content designed with these capabilities in mind”
* Microsoft Windows Vista®: “Harness stream processing to make image adjustments on the fly in Microsoft’s Picture Viewer application”
* Microsoft Expression®Encoder: “Accelerated encoding of content for Microsoft®Silverlight™, Windows Media video and audio”
* Microsoft Office® PowerPoint 2007: “Acceleration of slideshow playback for smooth animations, transitions and slide display”
* Microsoft Silverlight: “Unlocking the full potential for web based multi-media and robust user experience and interface”
 
Last edited:
Joined
Sep 11, 2009
Messages
2,680 (0.89/day)
Likes
693
Location
Reaching your left retina.
#22
Well there is hardly anything that uses OpenCL yet, in fact ATi haven't release drivers that enables OpenCL and DirectCompute on older cards.
"Older" includes all the HD3k and 4k series.
Stream is in a even more pityful state, I hardly knows any software that supports it apart from stuff from Adobe.
Not to mention that CUDA has Visual Studio integration and many more tools, profilers, debuggers...

It's also a high level language* and that makes easier to program for than the other ones which are low-medium level languages.

Nvidia did really put a lot of effort into GPGPU since G80 days and it's really paying off now.

*You can still access low level if you wish, you can get pretty close to silicon.
 
Joined
Sep 24, 2008
Messages
2,470 (0.73/day)
Likes
613
Location
Hillsboro, Oregon, USA
System Name Dire Wolf II
Processor Intel Core i7 7820HQ (2.9Ghz, up to 3.9Ghz)
Motherboard HP 8275
Memory 32GB DDR4 2400Mhz
Video Card(s) Sapphire R9 Fury Nitro OC 4GB (Thunderbolt3 eGPU), nVidia Quadro M1200 (GTX750Ti) 4GB GDDR5
Storage HP NVMe 256GB
Display(s) HP Z27q (5120x2880) + Dell P2715Q (3840x2160)
Case HP ZBook 15 G4
Audio Device(s) Musiland Monitor 02 US, Skullcandy SLYR
Power Supply 150W HP PSU (for Laptop) + Corsair RM550x (for eGPU)
Mouse Logitech G400
Keyboard Corsair K95 RGB
Software Windows 10 Enterprise 64-bit
#23
As someone who is currently dabbling in OpenCL code on GT200 and G9X cards, the architectural changes are quite impressive over the previous series and will make a programmer's life easier.

But now is the question: WHERE IS MY GODDAMNED GTX380 ?!?!?! :D
 
Joined
Nov 13, 2009
Messages
5,614 (1.90/day)
Likes
1,678
Location
San Diego, CA
System Name White Boy
Processor Core i7 3770k @4.6 Ghz
Motherboard ASUS P8Z77-I Deluxe
Cooling CORSAIR H100
Memory CORSAIR Vengeance 16GB @ 2177
Video Card(s) EVGA GTX 680 CLASSIEFIED @ 1250 Core
Storage 2 Samsung 830 256 GB (Raid 0) 1 Hitachi 4 TB
Display(s) 1 Dell 30U11 30"
Case BIT FENIX Prodigy
Audio Device(s) none
Power Supply SeaSonic X750 Gold 750W Modular
Software Windows Pro 7 64 bit || Ubuntu 64 Bit
Benchmark Scores 2017 Unigine Heaven :: P37239 3D Mark Vantage
#24
wow Q2 2010. Also the price is not a good sign ($3999). Current top Tesla (C1060) which is similar to a GTX285 sells for ~$1300. Not trying get people upset, but Geforce fermi might be really expensive (? >$600 >$800?)
Don't you think it's a bit early for speculation? Also, you can't compare, industrial grade hardware meant for super computing, to consumer grade products! Seriously, use your head.
 
Joined
Oct 1, 2006
Messages
4,015 (0.98/day)
Likes
477
Location
Hong Kong
Processor AMD FX-8350 @4.8Ghz
Motherboard Asrock 990FX Extreme9
Cooling CoolerMaster Seidon 120XL
Memory A-RAM DDR3 2000
Video Card(s) PowerColor HD7970 3GB V3
Storage OCZ Vertex 3 120G + Max IOPS 120G + Seagate 7200.12 250GB *2 Raid 0 + Samsung HD103UJ 1TB *2 RAID 0
Display(s) Samsung SyncMaster S24B370
Case SilverStone Raven RV01
Power Supply Corsair TX850W
Software Windows 7 Ultimate 64-bit
#25
I know this is getting off topic, but what exactly is this?
It comes with CCC suite 9.10. :confused: