• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

NVIDIA Quantum-2 Takes Supercomputing to New Heights, Into the Cloud

AleksandarK

News Editor
Staff member
Joined
Aug 19, 2017
Messages
2,225 (0.91/day)
NVIDIA today announced NVIDIA Quantum-2, the next generation of its InfiniBand networking platform, which offers the extreme performance, broad accessibility and strong security needed by cloud computing providers and supercomputing centers.

The most advanced end-to-end networking platform ever built, NVIDIA Quantum-2 is a 400 Gbps InfiniBand networking platform that consists of the NVIDIA Quantum-2 switch, the ConnectX-7 network adapter, the BlueField-3 data processing unit (DPU) and all the software that supports the new architecture.



The introduction of NVIDIA Quantum-2 comes as supercomputing centers are increasingly opening to multitudes of users, many from outside their organizations. At the same time, the world's cloud service providers are beginning to offer more supercomputing services to their millions of customers.

NVIDIA Quantum-2 includes key features required for demanding workloads running in either arena. Supercharged by cloud-native technologies, it provides high performance with 400 gigabits per second of throughput and advanced multi-tenancy to accommodate many users.

"The requirements of today's supercomputing centers and public clouds are converging," said Gilad Shainer, senior vice president of Networking at NVIDIA. "They must provide the greatest performance possible for next-generation HPC, AI and data analytics challenges, while also securely isolating workloads and responding to varying demands of user traffic. This vision of the modern data center is now real with NVIDIA Quantum-2 InfiniBand."

NVIDIA Quantum-2 Performance and Cloud-Native Capabilities
With 400 Gbps, NVIDIA Quantum-2 InfiniBand doubles the network speed and triples the number of network ports. It accelerates performance by 3x and reduces the need for data center fabric switches by 6x, while cutting data center power consumption and reducing data center space by 7 percent each.

The multi-tenant performance isolation of NVIDIA Quantum-2 keeps the activity of one tenant from disturbing others, utilizing an advanced telemetry-based congestion control system with cloud-native capabilities that ensure reliable throughput, regardless of spikes in users or workload demands.

NVIDIA Quantum-2 SHARPv3 In-Network Computing technology provides 32x more acceleration engines for AI applications compared with the previous generation. Advanced InfiniBand fabric management for data centers, including predictive maintenance, is enabled with the NVIDIA UFM Cyber-AI platform.

A nanosecond-precision timing system integrated into NVIDIA Quantum-2 can synchronize distributed applications, like database processing, helping to reduce the overhead of wait and idle times. This new capability allows cloud data centers to become part of the telecommunications network and host software-defined 5G radio services.

Quantum-2 InfiniBand Switch
At the heart of the Quantum-2 platform is the new Quantum-2 InfiniBand switch. With 57 billion transistors on 7-nanometer silicon, it is slightly bigger than the NVIDIA A100 GPU with 54 billion transistors.

It features 64 ports at 400 Gbps or 128 ports at 200 Gbps and will be offered in a variety of switch systems up to 2,048 ports at 400 Gbps or 4,096 ports at 200 Gbps—more than 5x the switching capability over the previous generation, Quantum-1.

The combined networking speed, switching capability and scalability is ideal for building the next-generation of giant HPC systems.

The NVIDIA Quantum-2 switch is now available from a wide range of leading infrastructure and system vendors around the world, including Atos, DataDirect Networks (DDN), Dell Technologies, Excelero, GIGABYTE, HPE, IBM, Inspur, Lenovo, NEC, Penguin Computing, QCT, Supermicro, VAST Data and WekaIO.

Quantum-2, ConnectX-7 and BlueField-3
The NVIDIA Quantum-2 platform provides two networking end-point options, the NVIDIA ConnectX-7 NIC and NVIDIA BlueField-3 DPU InfiniBand.

ConnectX-7, with 8 billion transistors in a 7-nanometer design, doubles the data rate of the world's current leading HPC networking chip, the NVIDIA ConnectX-6. It also doubles the performance of RDMA, GPUDirect Storage, GPUDirect RDMA and In-Networking Computing. The ConnectX-7 will sample in January.

BlueField-3 InfiniBand, with 22 billion transistors in a 7-nanometer design, offers sixteen 64-bit Arm CPUs to offload and isolate the data center infrastructure stack. BlueField-3 samples in May.

View at TechPowerUp Main Site
 
Joined
Oct 22, 2014
Messages
13,210 (3.81/day)
Location
Sunshine Coast
System Name Black Box
Processor Intel Xeon E3-1260L v5
Motherboard MSI E3 KRAIT Gaming v5
Cooling Tt tower + 120mm Tt fan
Memory G.Skill 16GB 3600 C18
Video Card(s) Asus GTX 970 Mini
Storage Kingston A2000 512Gb NVME
Display(s) AOC 24" Freesync 1m.s. 75Hz
Case Corsair 450D High Air Flow.
Audio Device(s) No need.
Power Supply FSP Aurum 650W
Mouse Yes
Keyboard Of course
Software W10 Pro 64 bit
Adding Quantum to the name doubles the speed. ;)
 
Joined
Jul 9, 2015
Messages
3,413 (1.06/day)
System Name M3401 notebook
Processor 5600H
Motherboard NA
Memory 16GB
Video Card(s) 3050
Storage 500GB SSD
Display(s) 14" OLED screen of the laptop
Software Windows 10
Benchmark Scores 3050 scores good 15-20% lower than average, despite ASUS's claims that it has uber cooling.
Adding Quantum to the name doubles the speed. ;)
Yeah, clearly, they need that, although, even that is not enough:

1636450539371.png


I love how the fact that MI200 wipes the floor with NV's best somehow doesn't make it neither into titles, nor even into articles.
 
Joined
Jul 9, 2015
Messages
3,413 (1.06/day)
System Name M3401 notebook
Processor 5600H
Motherboard NA
Memory 16GB
Video Card(s) 3050
Storage 500GB SSD
Display(s) 14" OLED screen of the laptop
Software Windows 10
Benchmark Scores 3050 scores good 15-20% lower than average, despite ASUS's claims that it has uber cooling.
this is different produc
Ah.
Like, say, i5 and Ryzen 5 are "different products".
A product so different it made it into marketing slides, for comparison sake.

Sounds reasonable... :D
 
Joined
Mar 24, 2012
Messages
528 (0.12/day)
Ah.
Like, say, i5 and Ryzen 5 are "different products".
A product so different it made it into marketing slides, for comparison sake.

Sounds reasonable... :D
Maybe the next time when TPU make a review about router they should also include performance numbers from Ryzen CPU or Radeon GPU.
 
Joined
Jul 17, 2007
Messages
164 (0.03/day)
Ah.
Like, say, i5 and Ryzen 5 are "different products".
A product so different it made it into marketing slides, for comparison sake.

Sounds reasonable... :D
Non, you don't get it, the article is about a networking interconnect (infiniband) that was developped my NVIDIA. It's not a gpu or a computing device.
Yes the latest CDNA accelerators are better than the latest NVIDIA equivalents but that not related to the article.
 
Top