• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

NVIDIA Blackwell Platform Pushes the Boundaries of Scientific Computing

GFreeman

News Editor
Staff member
Joined
Mar 6, 2023
Messages
1,181 (2.65/day)
Quantum computing. Drug discovery. Fusion energy. Scientific computing and physics-based simulations are poised to make giant steps across domains that benefit humanity as advances in accelerated computing and AI drive the world's next big breakthroughs. NVIDIA unveiled at GTC in March the NVIDIA Blackwell platform, which promises generative AI on trillion-parameter large language models (LLMs) at up to 25x less cost and energy consumption than the NVIDIA Hopper architecture.

Blackwell has powerful implications for AI workloads, and its technology capabilities can also help to deliver discoveries across all types of scientific computing applications, including traditional numerical simulation. By reducing energy costs, accelerated computing and AI drive sustainable computing. Many scientific computing applications already benefit. Weather can be simulated at 200x lower cost and with 300x less energy, while digital twin simulations have 65x lower cost and 58x less energy consumption versus traditional CPU-based systems and others.



Multiplying Scientific Computing Simulations With Blackwell
Scientific computing and physics-based simulation often rely on what's known as double-precision formats, or FP64 (floating point), to solve problems. Blackwell GPUs deliver 30% more FP64 and FP32 FMA (fused multiply-add) performance than Hopper.

Physics-based simulations are critical to product design and development. From planes and trains to bridges, silicon chips and pharmaceuticals - testing and improving products in simulation saves researchers and developers billions of dollars.

Today application-specific integrated circuits (ASICs) are designed almost exclusively on CPUs in a long and complex workflow, including analog analysis to identify voltages and currents.

But that's changing. The Cadence SpectreX simulator is one example of an analog circuit design solver. SpectreX circuit simulations are projected to run 13x quicker on a GB200 Grace Blackwell Superchip - which connects Blackwell GPUs and Grace CPUs - than on a traditional CPU.

Also, GPU-accelerated computational fluid dynamics, or CFD, has become a key tool. Engineers and equipment designers use it to predict the behavior of designs. Cadence Fidelity runs CFD simulations that are projected to run as much as 22x faster on GB200 systems than on traditional CPU-powered systems. With parallel scalability and 30 TB of memory per GB200 NVL72 rack, it's possible to capture flow details like never before.

In another application, Cadence Reality's digital twin software can be used to create a virtual replica of a physical data center, including all its components - servers, cooling systems and power supplies. Such a virtual model allows engineers to test different configurations and scenarios before implementing them in the real world, saving time and costs.

Cadence Reality's magic happens from physics-based algorithms that can simulate how heat, airflow and power usage affect data centers. This helps engineers and data center operators to more effectively manage capacity, predict potential operational problems and make informed decisions to optimize the layout and operation of the data center for improved efficiency and capacity utilization. With Blackwell GPUs, these simulations are projected to run up to 30x faster than with CPUs, offering accelerated timelines and higher energy efficiency.

AI for Scientific Computing
New Blackwell accelerators and networking will deliver leaps in performance for advanced simulation.

The NVIDIA GB200 kicks off a new era for high-performance computing (HPC). Its architecture sports a second-generation transformer engine optimized to accelerate inference workloads for LLMs.

This delivers a 30x speedup on resource-intensive applications like the 1.8-trillion-parameter GPT-MoE (generative pretrained transformer-mixture of experts) model compared to the H100 generation, unlocking new possibilities for HPC. By enabling LLMs to process and decipher vast amounts of scientific data, HPC applications can sooner reach valuable insights that can accelerate scientific discovery.

Sandia National Laboratories is building an LLM copilot for parallel programming. Traditional AI can generate basic serial computing code efficiently, but when it comes to parallel computing code for HPC applications, LLMs can falter. Sandia researchers are tackling this issue head-on with an ambitious project - automatically generating parallel code in Kokkos, a specialized programming language designed by multiple national labs for running tasks across tens of thousands of processors in the world's most powerful supercomputers.

Sandia is using an AI technique known as retrieval-augmented generation, or RAG, which combines information-retrieval capabilities with language generation models. The team is creating a Kokkos database and integrating it with AI models using RAG.

Initial results are promising. Different RAG approaches from Sandia have demonstrated autonomously generated Kokkos code for parallel computing applications. By overcoming hurdles in AI-based parallel code generation, Sandia aims to unlock new possibilities in HPC across leading supercomputing facilities worldwide. Other examples include renewables research, climate science and drug discovery.

Driving Quantum Computing Advances
Quantum computing unlocks a time machine trip for fusion energy, climate research, drug discovery and many more areas. So researchers are hard at work simulating future quantum computers on NVIDIA GPU-based systems and software to develop and test quantum algorithms faster than ever.

The NVIDIA CUDA-Q platform enables both simulation of quantum computers and hybrid application development with a unified programming model for CPUs, GPUs and QPUs (quantum processing units) working together.

CUDA-Q is speeding simulations in chemistry workflows for BASF, high-energy and nuclear physics for Stony Brook and quantum chemistry for NERSC.

NVIDIA Blackwell architecture will help drive quantum simulations to new heights. Utilizing the latest NVIDIA NVLink multi-node interconnect technology helps shuttle data faster for speedup benefits to quantum simulations.

Accelerating Data Analytics for Scientific Breakthroughs
Data processing with RAPIDS is popular for scientific computing. Blackwell introduces a hardware decompression engine to decompress compressed data and speed up analytics in RAPIDS.

The decompression engine provides performance improvements up to 800 GB/s and enables Grace Blackwell to perform 18x faster than CPUs - on Sapphire Rapids - and 6x faster than NVIDIA H100 Tensor Core GPUs for query benchmarks.

Rocketing data transfers with 8 TB/s of high-memory bandwidth and the Grace CPU high-speed NVLink Chip-to-Chip (C2C) interconnect, the engine speeds up the entire process of database queries. Yielding top-notch performance across data analytics and data science use cases, Blackwell speeds data insights and reduces costs.

Driving Extreme Performance for Scientific Computing with NVIDIA Networking
The NVIDIA Quantum-X800 InfiniBand networking platform offers the highest throughput for scientific computing infrastructure.

It includes NVIDIA Quantum Q3400 and Q3200 switches and the NVIDIA ConnectX-8 SuperNIC, together hitting twice the bandwidth of the prior generation. The Q3400 platform offers 5x higher bandwidth capacity and 14.4Tflops of in-network computing with NVIDIA's scalable hierarchical aggregation and reduction protocol (SHARPv4), providing a 9x increase compared with the prior generation.

The performance leap and power efficiency translates to significant reductions in workload completion time and energy consumption for scientific computing.

View at TechPowerUp Main Site | Source
 
Joined
Nov 27, 2023
Messages
1,196 (6.68/day)
System Name The Workhorse
Processor AMD Ryzen R9 5900X
Motherboard Gigabyte Aorus B550 Pro
Cooling CPU - Noctua NH-D15S Case - 3 Noctua NF-A14 PWM at the bottom, 2 Fractal Design 180mm at the front
Memory GSkill Trident Z 3200CL14
Video Card(s) NVidia GTX 1070 MSI QuickSilver
Storage Adata SX8200Pro
Display(s) LG 32GK850G
Case Fractal Design Torrent
Audio Device(s) FiiO E-10K DAC/Amp, Samson Meteorite USB Microphone
Power Supply Corsair RMx850 (2018)
Mouse Razer Viper (Original)
Keyboard Cooler Master QuickFire Rapid TKL keyboard (Cherry MX Black)
Software Windows 11 Pro (23H2)
“Quantum computing. Drug discovery. Fusion energy. Scientific computing and physics-based simulations are poised to make giant steps across domains that benefit humanity as advances in accelerated computing and AI drive the world's next big breakthroughs.”

Nice start already with all the best buzzwords.

“Quantum computing unlocks a time machine trip for fusion energy, climate research, drug discovery and many more areas.”

That’s a claim and a half, what? I am reading through this press release and it’s half fairly serious decent stuff and half just unhinged tech-bro nonsense. I don’t remember NV science and enterprise targeted press releases/articles being this bad before this gen, were they?
 
Joined
May 11, 2018
Messages
1,003 (0.45/day)
“I am reading through this press release and it’s half fairly serious decent stuff and half just unhinged tech-bro nonsense. I don’t remember NV science and enterprise targeted press releases/articles being this bad before this gen, were they?

There are no bounds to what AI could possibly do for you! So you can get as poetic, hyperbolic, grandiose as your imagination can take you!

Or better yet, give AI the task to write press releases about how good it is! :p
 
Joined
May 13, 2008
Messages
696 (0.12/day)
System Name HTPC whhaaaat?
Processor 2600k @ 4500mhz
Motherboard Asus Maximus IV gene-z gen3
Cooling Noctua NH-C14
Memory Gskill Ripjaw 2x4gb
Video Card(s) EVGA 1080 FTW @ 2037/11016
Storage 2x512GB MX100/1x Agility 3 128gb ssds, Seagate 3TB HDD
Display(s) Vizio P 65'' 4k tv
Case Lian Li pc-c50b
Audio Device(s) Denon 3311
Power Supply Corsair 620HX
M(s/r/rs) news person,

Are you going to post an article about this?


I mean, it *could* be a lot of things; a snapdragon X competitor...nvidia taking over the PC industry by proxy...etc, but...I think it likely also pertains to the next evolution of (something that could be rebranded/used as the next) Pentonic.

That's actually a really, really, big deal...either way you slice it. Mediatek stuff is in SOOOO many things. Even if this is $300, I would imagine they are planning for it to be a scalable platform with different uses (eventually).

If it's not that (used for Pentonic), than this, coupled with their existing partnership with nvidia for auto, likely means that WILL occur.

So yeah, your next TV might actually use DLSS (if not Mediatek-flavored scaling powered by similar units). That's fascinating to me...and probably long overdue.
 
Joined
Mar 16, 2017
Messages
1,738 (0.66/day)
Location
Tanagra
System Name Budget Box
Processor Xeon E5-2667v2
Motherboard ASUS P9X79 Pro
Cooling Some cheap tower cooler, I dunno
Memory 32GB 1866-DDR3 ECC
Video Card(s) XFX RX 5600XT
Storage WD NVME 1GB
Display(s) ASUS Pro Art 27"
Case Antec P7 Neo
They need to check that presser for AI hallucinations. It’s been watching too many Star Trek and Doctor Who episodes.
 
Joined
May 3, 2018
Messages
2,373 (1.07/day)
“Quantum computing. Drug discovery. Fusion energy. Scientific computing and physics-based simulations are poised to make giant steps across domains that benefit humanity as advances in accelerated computing and AI drive the world's next big breakthroughs.”

Nice start already with all the best buzzwords.

“Quantum computing unlocks a time machine trip for fusion energy, climate research, drug discovery and many more areas.”

That’s a claim and a half, what? I am reading through this press release and it’s half fairly serious decent stuff and half just unhinged tech-bro nonsense. I don’t remember NV science and enterprise targeted press releases/articles being this bad before this gen, were they?
Quantum computing is definitely being pursued in these spaces, it's not just hype like consumer AI (as opposed to scientific, medical etc applications of AI). This article provides a broad overview:

Quantum computing for climate

Quantum computing is particularity powerful for simulating the underlying chemical reactions in the atmosphere. Also apparently being used to improve battery technology.

There is a world of scientific literature on this stuff, it's not mere press release.
 
Joined
Sep 17, 2014
Messages
21,123 (5.97/day)
Location
The Washing Machine
Processor i7 8700k 4.6Ghz @ 1.24V
Motherboard AsRock Fatal1ty K6 Z370
Cooling beQuiet! Dark Rock Pro 3
Memory 16GB Corsair Vengeance LPX 3200/C16
Video Card(s) ASRock RX7900XT Phantom Gaming
Storage Samsung 850 EVO 1TB + Samsung 830 256GB + Crucial BX100 250GB + Toshiba 1TB HDD
Display(s) Gigabyte G34QWC (3440x1440)
Case Fractal Design Define R5
Audio Device(s) Harman Kardon AVR137 + 2.1
Power Supply EVGA Supernova G2 750W
Mouse XTRFY M42
Keyboard Lenovo Thinkpad Trackpoint II
Software W10 x64
Quantum computing is definitely being pursued in these spaces, it's not just hype like consumer AI (as opposed to scientific, medical etc applications of AI). This article provides a broad overview:

Quantum computing for climate

Quantum computing is particularity powerful for simulating the underlying chemical reactions in the atmosphere. Also apparently being used to improve battery technology.

There is a world of scientific literature on this stuff, it's not mere press release.
Sure but it requires a quantum computer, not an Nvidia GPU. We got this far without Blackwell. So its utter BS.
 
Joined
May 3, 2018
Messages
2,373 (1.07/day)
Sure but it requires a quantum computer, not an Nvidia GPU. We got this far without Blackwell. So its utter BS.
I was replying to the OP's comment which was about Quantum computing, not Blackwell.

A lot more people will have access to Blackwell hardware than quantum computing hardware, so for most it's more relevant.
 
Top