1. Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

New NVIDIA Tesla GPU Smashes World Record in Scientific Computation

Discussion in 'News' started by btarunr, May 17, 2011.

  1. btarunr

    btarunr Editor & Senior Moderator Staff Member

    Joined:
    Oct 9, 2007
    Messages:
    28,198 (11.43/day)
    Thanks Received:
    13,569
    Location:
    Hyderabad, India
    NVIDIA today unveiled the Tesla M2090 GPU, the world's fastest parallel processor for high performance computing. In addition, the Tesla M2090 GPU achieved the fastest-ever performance in a key measure of scientific computation. Equipped with 512 CUDA parallel processing cores, the Tesla M2090 GPU delivers 665 gigaflops of peak double-precision performance, enabling application acceleration by up to 10x compared to using a CPU alone.

    In the latest version of AMBER 11, one of the most widely used applications for simulating behaviors of biomolecules, four Tesla M2090 GPUs coupled with four CPUs delivered record performance of 69 nanoseconds of simulation per day. The fastest AMBER performance recorded on a CPU-only supercomputer is 46 ns/day.

    [​IMG] [​IMG]

    "This is the fastest result ever reported. With Tesla M2090 GPUs, AMBER users in university departments can obtain application performance that outstrips what is possible even with extensive supercomputer access," said Ross Walker, assistant research professor at the San Diego Computer Center, and principle contributor to the AMBER code.

    The Tesla M2090 GPU will be available in servers such as the new HP ProLiant SL390 G7 4U server. As part of the SL6500 Scalable System of HP server solutions, optimized for scale-out and HPC market segments, the SL390 family was purpose built for hybrid computing environments that combine GPUs and CPUs.

    The SL390 G7 4U server incorporates up to eight Tesla M2090 GPUs in a half-width 4U chassis and, with a configuration of eight GPUs to two CPUs, offers the highest GPU-to-CPU density on the market. The system is ideally suited for applications ranging from quantum chemistry and molecular dynamics to seismic processing and data analytics.

    "Clients running intensive data center applications require systems that can process massive amounts of complex data quickly and efficiently," said Glenn Keels, director of marketing, Hyperscale business unit, HP. "The decade long collaboration between HP and NVIDIA has created one of the industry's fastest CPU to GPU configurations available, delivering clients the needed processing power and speed to handle the most complex scientific computations."

    In addition to AMBER, the Tesla M2090 GPU is ideally suited to a wide range of GPU-accelerated HPC applications. These include molecular dynamics applications, NAMD and GROMACS, computer-aided engineering applications, ANSYS Mechanical, Altair Acusolve and Simulia Abaqus, earth science applications, WRF, HOMME and ASUCA, oil and gas applications, Paradigm Voxelgeo and Schlumberger Petrel as well as other key applications such as MATLAB, GADGET2 and GPU-BLAST.

    For more information on the HP ProLiant SL390 G7 4U server, go here, and for more information on the Tesla M2090 and the whole family of M-class products, please go here.
    1c3d0g and Jack Doph say thanks.
  2. Sasqui

    Sasqui

    Joined:
    Dec 6, 2005
    Messages:
    7,430 (2.37/day)
    Thanks Received:
    1,270
    Location:
    Manchester, NH
    What is a "ns" as in "46 ns/day" ?

    Edit, Nevermind... "nanoseconds of simulation per day"
  3. AsRock

    AsRock TPU addict

    Joined:
    Jun 23, 2007
    Messages:
    10,674 (4.15/day)
    Thanks Received:
    1,615
    Location:
    US
    I am more interested in that cooler lol.
  4. WhiteLotus

    WhiteLotus

    Joined:
    Jul 30, 2007
    Messages:
    6,530 (2.57/day)
    Thanks Received:
    847
    Can somebody please tell me again why you can't use these to play games?
  5. Sasqui

    Sasqui

    Joined:
    Dec 6, 2005
    Messages:
    7,430 (2.37/day)
    Thanks Received:
    1,270
    Location:
    Manchester, NH
    Was thinking that too.

    Because there is no DVI or HDMI connector :laugh:
  6. cheesy999

    Joined:
    Jul 2, 2010
    Messages:
    3,881 (2.64/day)
    Thanks Received:
    572
    Location:
    UK
    i was gonna say because at the price of this thing you could probably go dual proccesor and quad SLI
  7. btarunr

    btarunr Editor & Senior Moderator Staff Member

    Joined:
    Oct 9, 2007
    Messages:
    28,198 (11.43/day)
    Thanks Received:
    13,569
    Location:
    Hyderabad, India
    At its price you can build an entire gaming rig that can run games faster than it, if it had a display output.
  8. cheesy999

    Joined:
    Jul 2, 2010
    Messages:
    3,881 (2.64/day)
    Thanks Received:
    572
    Location:
    UK
    that's what i mean, a dual lga1366 build with dual 590
  9. LAN_deRf_HA

    LAN_deRf_HA

    Joined:
    Apr 4, 2008
    Messages:
    4,497 (1.96/day)
    Thanks Received:
    930
    I've gotten the impression that workstations cards have gotten better at playing games compared to their consumer counterparts, at least with the right drivers. However the last time I was able to see that put to the test was back with the G80s. Nobody bothers to test these things like that.
  10. Sasqui

    Sasqui

    Joined:
    Dec 6, 2005
    Messages:
    7,430 (2.37/day)
    Thanks Received:
    1,270
    Location:
    Manchester, NH
    In most cases, the workstation card drivers are optimized for OpenGL, and specialty products, usually not DirectX. It's the reason they cost so darn much because the user base is so much smaller and the economy of scale in writing drivers just isn't there. From what I've seen the ATI and NV offerings are slight one offs from the consumer cards (in some cases, only very slightly). So, depending on the game 3D display means, the workstation cards will be about the same, or better in games.
  11. wolf

    wolf Performance Enthusiast

    Joined:
    May 7, 2007
    Messages:
    5,541 (2.11/day)
    Thanks Received:
    842
    you wouldn't even need that much, a 2600K and two GTX570's or 6950's would cream a single Tesla M2090 for cheaper.
  12. Bjorn_Of_Iceland

    Bjorn_Of_Iceland

    Joined:
    Jan 2, 2008
    Messages:
    3,172 (1.33/day)
    Thanks Received:
    375
    Doing science!
  13. damric

    damric

    Joined:
    Feb 17, 2010
    Messages:
    359 (0.22/day)
    Thanks Received:
    95
    Are these GTX 580's with a different BIOS and drivers?
  14. Easy Rhino

    Easy Rhino Linux Advocate

    Joined:
    Nov 13, 2006
    Messages:
    13,382 (4.78/day)
    Thanks Received:
    3,218
    no, they are completely different. like apples and oranges really.
  15. Damn_Smooth

    Damn_Smooth New Member

    Joined:
    May 16, 2011
    Messages:
    1,435 (1.25/day)
    Thanks Received:
    478
    Location:
    A frozen turdberg.
    So basically they're saying that they achieved better performance by adding a gpu?

    I could have told them that.
    damric says thanks.
  16. Kreij

    Kreij Senior Monkey Moderator Staff Member

    Joined:
    Feb 6, 2007
    Messages:
    13,881 (5.12/day)
    Thanks Received:
    5,615
    Location:
    Cheeseland (Wisconsin, USA)
    These are not graphics cards ... at all.
    The suite of AMBER program calculates molecular force fields and molecular trajectories of interacting molecules within a given fixed field.
    The amount of data calculation to see what happens in a fixed multi-molecular field in 1 nanosecond is astronomical. The computations for single molecule interaction in staggering.
    Doing 69ns in a 24 hour period is a real achievement, and I applaude Nvidia on their continued advancement into supercomputing.

    Don't expect to see this in Duke Nukem Forever.
    1c3d0g and wolf say thanks.
  17. hellrazor

    hellrazor

    Joined:
    Feb 18, 2010
    Messages:
    1,568 (0.98/day)
    Thanks Received:
    315
    Looks like a wider version of the one on my CPU.
  18. damric

    damric

    Joined:
    Feb 17, 2010
    Messages:
    359 (0.22/day)
    Thanks Received:
    95
    But it is technically a Fermi GF 110, is it not? Just without display outputs?

    Edit: I found this. Looks like it is indeed a GF 110 core. I wonder if someone could take a GTX 580 and manipulate it into one of these new Tesla cards.


    http://www.nvidia.com/docs/IO/105880/DS_Tesla-M2090_LR.pdf
    Last edited: May 18, 2011
    Damn_Smooth says thanks.
  19. 1c3d0g

    1c3d0g

    Joined:
    Dec 9, 2007
    Messages:
    683 (0.28/day)
    Thanks Received:
    58
    Is this the new Folding@Home King? Either way, thumbs up, NVIDIA! :cool:
  20. bear jesus

    bear jesus New Member

    Joined:
    Aug 12, 2010
    Messages:
    1,535 (1.07/day)
    Thanks Received:
    200
    Location:
    Britland
    Always good to see advancing computer tech help advance science.

    But as this has four Tesla M2090 GPU's coupled with four CPU's and pumped out 69 nanoseconds of simulation in 24 hours, assuming perfect scaling how many would be needed to simulate in real time? :laugh: it's 6am so I'm not doing the math.
  21. caleb

    Joined:
    Sep 15, 2004
    Messages:
    1,536 (0.43/day)
    Thanks Received:
    204
    Location:
    Poland,Slask
    Why not it can be tessellation 10.0 :)
  22. vanyots

    vanyots

    Joined:
    May 23, 2007
    Messages:
    35 (0.01/day)
    Thanks Received:
    7
    Location:
    Sofia, Bulgaria
    1 252 173 913 043.5 according to my PC calculator:eek::eek::eek:
    (that's 1.25 trillion systems like the one in question)
    wolf and bear jesus say thanks.
    More than 25k PPD
  23. bear jesus

    bear jesus New Member

    Joined:
    Aug 12, 2010
    Messages:
    1,535 (1.07/day)
    Thanks Received:
    200
    Location:
    Britland
    :eek: OK we are probably quite a few generations away, probably after we hit the wall with current chip tech.
  24. MikeX New Member

    Joined:
    Jun 15, 2006
    Messages:
    125 (0.04/day)
    Thanks Received:
    10
    That came with a 6GB on board. Ten times more expensive than a Gtx 580.
    My logical GPGPU solution would be get the gtx 590 and 12GB kit. You can share system's memory anyway.
  25. Steevo

    Steevo

    Joined:
    Nov 4, 2005
    Messages:
    8,107 (2.56/day)
    Thanks Received:
    1,123
    I dont know if we will ever be able to compute real time life. With the limitations of size the only possible way would be quantum processing.
    10 Million points folded for TPU

Currently Active Users Viewing This Thread: 1 (0 members and 1 guest)

Share This Page