1. Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

NVIDIA Launches World's First High-Speed GPU Interconnect

Discussion in 'News' started by Cristian_25H, Mar 25, 2014.

  1. Cristian_25H

    Cristian_25H News Poster

    Joined:
    Dec 6, 2011
    Messages:
    4,675 (3.52/day)
    Thanks Received:
    1,174
    Location:
    Still on the East Side
    NVIDIA today announced that it plans to integrate a high-speed interconnect, called NVIDIA NVLink, into its future GPUs, enabling GPUs and CPUs to share data five to 12 times faster than they can today. This will eliminate a longstanding bottleneck and help pave the way for a new generation of exascale supercomputers that are 50-100 times faster than today's most powerful systems.

    NVIDIA will add NVLink technology into its Pascal GPU architecture -- expected to be introduced in 2016 -- following this year's new NVIDIA Maxwell compute architecture. The new interconnect was co-developed with IBM, which is incorporating it in future versions of its POWER CPUs.

    [​IMG] [​IMG]

    "NVLink technology unlocks the GPU's full potential by dramatically improving data movement between the CPU and GPU, minimizing the time that the GPU has to wait for data to be processed," said Brian Kelleher, senior vice president of GPU Engineering at NVIDIA.

    "NVLink enables fast data exchange between CPU and GPU, thereby improving data throughput through the computing system and overcoming a key bottleneck for accelerated computing today," said Bradley McCredie, vice president and IBM Fellow at IBM. "NVLink makes it easier for developers to modify high-performance and data analytics applications to take advantage of accelerated CPU-GPU systems. We think this technology represents another significant contribution to our OpenPOWER ecosystem."

    With NVLink technology tightly coupling IBM POWER CPUs with NVIDIA Tesla GPUs, the POWER data center ecosystem will be able to fully leverage GPU acceleration for a diverse set of applications, such as high performance computing, data analytics and machine learning.

    Advantages Over PCI Express 3.0
    Today's GPUs are connected to x86-based CPUs through the PCI Express (PCIe) interface, which limits the GPU's ability to access the CPU memory system and is four- to five-times slower than typical CPU memory systems. PCIe is an even greater bottleneck between the GPU and IBM POWER CPUs, which have more bandwidth than x86 CPUs. As the NVLink interface will match the bandwidth of typical CPU memory systems, it will enable GPUs to access CPU memory at its full bandwidth.

    This high-bandwidth interconnect will dramatically improve accelerated software application performance. Because of memory system differences -- GPUs have fast but small memories, and CPUs have large but slow memories -- accelerated computing applications typically move data from the network or disk storage to CPU memory, and then copy the data to GPU memory before it can be crunched by the GPU. With NVLink, the data moves between the CPU memory and GPU memory at much faster speeds, making GPU-accelerated applications run much faster.

    Unified Memory Feature
    Faster data movement, coupled with another feature known as Unified Memory, will simplify GPU accelerator programming. Unified Memory allows the programmer to treat the CPU and GPU memories as one block of memory. The programmer can operate on the data without worrying about whether it resides in the CPU's or GPU's memory.

    Although future NVIDIA GPUs will continue to support PCIe, NVLink technology will be used for connecting GPUs to NVLink-enabled CPUs as well as providing high-bandwidth connections directly between multiple GPUs. Also, despite its very high bandwidth, NVLink is substantially more energy efficient per bit transferred than PCIe.

    NVIDIA has designed a module to house GPUs based on the Pascal architecture with NVLink. This new GPU module is one-third the size of the standard PCIe boards used for GPUs today. Connectors at the bottom of the Pascal module enable it to be plugged into the motherboard, improving system design and signal integrity.

    NVLink high-speed interconnect will enable the tightly coupled systems that present a path to highly energy-efficient and scalable exascale supercomputers, running at 1,000 petaflops (1 x 1018 floating point operations per second), or 50 to 100 times faster than today's fastest systems.
     
    remixedcat says thanks.
  2. Hilux SSRG

    Hilux SSRG

    Joined:
    May 1, 2012
    Messages:
    1,025 (0.87/day)
    Thanks Received:
    170
    Location:
    New Jersey, USA
    Watching the keynote and Pascal looks like it will hammer away at Intel and their CPUs.
     
  3. btarunr

    btarunr Editor & Senior Moderator Staff Member

    Joined:
    Oct 9, 2007
    Messages:
    30,323 (10.64/day)
    Thanks Received:
    14,681
    Location:
    Hyderabad, India
    Hey there HyperTransport, long time!
     
  4. cadaveca

    cadaveca My name is Dave

    Joined:
    Apr 10, 2006
    Messages:
    14,372 (4.23/day)
    Thanks Received:
    7,697
    Location:
    Edmonton, Alberta
    I believe you mean "SidePort".

    = AMD SidePort/IOMMU.

    ;)

    As is the norm, AMD creates the idea, and Nvidia brings a useable form to the masses. Tech partnerships at it's best, really.
     
    Prima.Vera says thanks.
  5. DaJMasta

    Joined:
    Nov 6, 2005
    Messages:
    479 (0.13/day)
    Thanks Received:
    38
    Location:
    Silver Spring, MD
    I'm not against it... but since when was GPU scaling limited by PCIe throughput? Maybe it's a latency thing... but 16 PCIe 3.0 lanes is quite a bit of bandwidth, and I thought that we've seen time and time again that the performance impact of halving that (running 8x) is minimal even with the highest end cards.

    They say it's because they need access to CPU memory and that GPU memory is "small", but I think again we've seen the opposite trend. Plenty of enthusiast computers have 16GB of main memory but have easily 3-4GB of VRAM per card. Is it just that a lot of main memory isn't used in gaming, and you can just hoard textures in there if you had more bandwidth?
     
  6. hhumas

    Joined:
    Jun 24, 2011
    Messages:
    554 (0.37/day)
    Thanks Received:
    22
    Location:
    Islamabad
    its crazy ...
     
  7. cadaveca

    cadaveca My name is Dave

    Joined:
    Apr 10, 2006
    Messages:
    14,372 (4.23/day)
    Thanks Received:
    7,697
    Location:
    Edmonton, Alberta
    If you do GPGPU, ever since such was possible, PCIe has been a limitation. This FACT (shown by the AOKI paper at the beginning of the STREAM, and has been something I have been personally talking about for years) has been present since PCIe came out, really.


    A limitation in gaming? Yes AND No. AMD Multi-GPU stutter problems are due to PCIe limitations.

    If you watched Nvidia's promo...they easily pointed out that in order to provide what is needed to make a real jump in graphics, requires 1000's of bits of memory interconnect...compared to the 384 we have today. Being able to feed that memory, as well as other GPUs, is not possible over PCIe...hence NV-LINK.
     
  8. Hilux SSRG

    Hilux SSRG

    Joined:
    May 1, 2012
    Messages:
    1,025 (0.87/day)
    Thanks Received:
    170
    Location:
    New Jersey, USA

    So is NVLINK a physical replacement to PCIe on the Nvidia mobos?

    I will be glad to buy a mobo that has both NVLINK and PCIe. Especially tired of seeing Intel's lack of PCIe 4, DDR4, new technology interfaces, etc on their Z97 and X99 platforms.
     
  9. cadaveca

    cadaveca My name is Dave

    Joined:
    Apr 10, 2006
    Messages:
    14,372 (4.23/day)
    Thanks Received:
    7,697
    Location:
    Edmonton, Alberta

    Christian_25H covered that well already:

     
  10. Arjai

    Arjai

    Joined:
    Apr 3, 2012
    Messages:
    2,925 (2.42/day)
    Thanks Received:
    5,788
    Location:
    St. Paul, MN
    Looks promising...How long will it take to get to gaming desktops, any guesses? 2016 seems like enough time to incorporate it to a gaming platform...hmm?

    *EDIT, just noticed I had put in 2026, instead of 2016. :oops:
     
    Last edited: Mar 25, 2014
    Crunching for Team TPU More than 25k PPD
  11. erocker

    erocker Super Moderator Staff Member

    Joined:
    Jul 19, 2006
    Messages:
    40,678 (12.34/day)
    Thanks Received:
    15,532
    Interesting, what are these going to be?
     
  12. cadaveca

    cadaveca My name is Dave

    Joined:
    Apr 10, 2006
    Messages:
    14,372 (4.23/day)
    Thanks Received:
    7,697
    Location:
    Edmonton, Alberta

    I guess we'll start to see them in 2016?

    Although, the mention of IBM POWERPC chips...kinda...well...removes my excitement. :roll:


    Looking at the physical sample NV showed today, it looks a lot like a module for the new Apple MAC PRO trashcan-PC.

    If that's Nvidia's choice to stay relevant to the marketplace...to work with Apple...well...
     
  13. erocker

    erocker Super Moderator Staff Member

    Joined:
    Jul 19, 2006
    Messages:
    40,678 (12.34/day)
    Thanks Received:
    15,532
    Heh, I didn't even know IBM still made PowerPC chips! Looking forward to seeing how it pans out.
     
  14. H2323 New Member

    Joined:
    Mar 25, 2014
    Messages:
    5 (0.01/day)
    Thanks Received:
    0
    K...seriously how is this usable....the CPU has to have this built in as well, and it is clear that the only one that will do this is PowerPC and any custom ARM SoC Nvidia wants to design. This is for enterprise and supercomputers it will not be on your PC. AMD and Intel already have there own internal solutions.
     
  15. H2323 New Member

    Joined:
    Mar 25, 2014
    Messages:
    5 (0.01/day)
    Thanks Received:
    0
    Just powerPC, AMD has there HSA solution and Intel obviously has there own solutions, sounds good but nothing for the consumer
     
  16. Jizzler

    Jizzler

    Joined:
    Aug 10, 2007
    Messages:
    3,602 (1.24/day)
    Thanks Received:
    714
    Location:
    Geneva, FL, USA
    In the short term we may not get CPUs with NV-Link in consumer form but perhaps it will allow for better performing and more efficient multi-GPU gaming cards?

    The Titan-Z is already outdated, wait for the Titan-Z2 with 12GB of unified memory ;)
     
  17. cadaveca

    cadaveca My name is Dave

    Joined:
    Apr 10, 2006
    Messages:
    14,372 (4.23/day)
    Thanks Received:
    7,697
    Location:
    Edmonton, Alberta

    Yep, push data to primary card, and then have secondary cards link together as slave devices, presenting itself as a large compute interface that the OS sees like a single compute device.



    Oh wait, that's exactly the scenario hinted at in the presentation...:p and shown in the slides.
     
  18. Jizzler

    Jizzler

    Joined:
    Aug 10, 2007
    Messages:
    3,602 (1.24/day)
    Thanks Received:
    714
    Location:
    Geneva, FL, USA
    I'll wait for a whitepaper on it to be posted. It's better for their bottom line if I don't watch these types of presentations :)
     
  19. zinfinion

    Joined:
    Jun 19, 2013
    Messages:
    39 (0.05/day)
    Thanks Received:
    18
    Uhhh, what happened to Volta? Wasn't that supposed to be Maxwell's follow up?
     
  20. Hilux SSRG

    Hilux SSRG

    Joined:
    May 1, 2012
    Messages:
    1,025 (0.87/day)
    Thanks Received:
    170
    Location:
    New Jersey, USA
    It may still slot in between Maxwell and Pascal in 2015/2016.
     
  21. Steevo

    Steevo

    Joined:
    Nov 4, 2005
    Messages:
    9,027 (2.54/day)
    Thanks Received:
    1,632
    2016..just right....around.........the.....................corner..........................................................



    Is it just me or does it seem like someone threw the PR team there out a window and they are trying to make enough PR slides to land on.
     
    10 Million points folded for TPU
  22. LeonVolcove

    LeonVolcove

    Joined:
    Jan 10, 2014
    Messages:
    154 (0.27/day)
    Thanks Received:
    12
    so its like AMD HSA but you still need CPU + Dedicated GPU?
     
  23. RejZoR

    RejZoR

    Joined:
    Oct 2, 2004
    Messages:
    5,891 (1.49/day)
    Thanks Received:
    1,560
    Location:
    Europe/Slovenia
    Unfortunately i fell asleep during the keynote. It was too much scientific computing and very little for the gaming. And if i'm honest, both graphics demos were rather boring. That Unreal Engine fight scene was nothing to talk about and that whale water simulation, it was ok and shows the muscle, but i wasn't actually impressed by it. There are so many better and more impressive ways to showcase fluid dynamics than with a transparent whale...
     

Currently Active Users Viewing This Thread: 1 (0 members and 1 guest)

Share This Page