Inflection AI Builds Supercomputer with 22,000 NVIDIA H100 GPUs

AleksandarK · Jul 3, 2023

The AI hype continues to push hardware shipments, especially for servers with GPUs that are in very high demand. Another example is the latest feat of AI startup, Inflection AI. Building foundational AI models, the Inflection AI crew has secured an order of 22,000 NVIDIA H100 GPUs and built a supercomputer. Assuming a configuration of a single Intel Xeon CPU with eight GPUs, almost 700 four-node racks should go into the supercomputer. Scaling and connecting 22,000 GPUs is easier than it is to acquire them, as NVIDIA's H100 GPUs are selling out everywhere due to the enormous demand for AI applications both on and off premises.

Getting 22,000 H100 GPUs is the biggest challenge here, and Inflection AI managed to get them by having NVIDIA as an investor in the startup. The supercomputer is estimated to cost around one billion USD and consume 31 Mega-Watts of power. The Inflection AI startup is now valued at 1.5 billion USD at the time of writing.

View at TechPowerUp Main Site | Source

P4-630 · Jul 3, 2023

AleksandarK said:
Getting 22,000 H100 GPUs is the biggest challenge here

And building it I guess...

Paganstomp · Jul 3, 2023

Just when we are getting over a GPU shortage...

lemonadesoda · Jul 3, 2023

I don't understand the scaling assumption of 1:8.

If something Retail like this MSI B360-F PRO can do 18x, then a specialist bespoke xeon board could easily do many more. After all Intel Xeon W-3400 Series has 112 pcie lanes and could therefore run 112 GPUs, let's call it 100.

Wirko · Jul 3, 2023

lemonadesoda said:
I don't understand the scaling assumption of 1:8.

If something Retail like this MSI B360-F PRO can do 18x, then a specialist bespoke xeon board could easily do many more. After all Intel Xeon W-3400 Series has 112 pcie lanes and could therefore run 112 GPUs, let's call it 100.

"Can do"? For mining, sure, with an i3 CPU at that.
Here there are huge amounts of data to move from and to storage, and processing also takes place on the CPUs in part. Monster computing nodes with 8 GPU accelerators and twin Xeons or Epycs aren't uncommon. One variant of the MI300 is going to have as many as 24 CPU cores in the same package as the GPU, which will enable operation without a separate Epyc - and think about how much bandwidth those CPU cores need to communicate with the GPU part.

Scrizz · Jul 3, 2023

lol I read that as "Infection AI." :laugh:

dragontamer5788 · Jul 3, 2023

The Inflection AI startup is now valued at 1.5 billion USD at the time of writing.

Assuming $10,000 per GPU, that's $220 Million on GPUs alone, let alone datacenter costs, CPU costs, RAM, hard drives...

A valuation of $1.5 Billion sounds fair because that's barely much more than the underlying hardware.

Minus Infinity · Jul 4, 2023

So AI has actually achieved an Inflection point.

Wirko · Jul 4, 2023

Minus Infinity said:
So AI has actually achieved an Inflection point.

Maybe we're lucky to have a limited amount of sand and electricity to produce chips, and of course a limited number of TSMCs who can print them.

dragontamer5788 · Jul 4, 2023

Minus Infinity said:
So AI has actually achieved an Inflection point.

An inflection point of venture capitalist money for sure.

For creative use: AI seems like it will be with us with Photoshop's Generative Fill: (https://www.adobe.com/products/photoshop/generative-fill.html). I'm not convinced text is quite ready yet, even with GPT4. ChatGPT / GPT4 is good enough to make very annoying spambots, but the content / hallucinations / lying is just awful and makes practical use of GPT4 just unworkable in many cases.

skates · Jul 5, 2023

31 megawatts of juice required, that is enormous amount of power required and to what end? I wonder if adding this demand ups the cost for residential power.

diatribe · Jul 5, 2023

dragontamer5788 said:
Assuming $10,000 per GPU, that's $220 Million on GPUs alone, let alone datacenter costs, CPU costs, RAM, hard drives...

A valuation of $1.5 Billion sounds fair because that's barely much more than the underlying hardware.

The H100's are going for $40,000 each!

P4-630 · Jul 5, 2023

diatribe said:
The H100's are going for $40,000 each!

But: The more you buy, the more you save!....

System Name	AlderLake
Processor	Intel i7 12700K P-Cores @ 5Ghz
Motherboard	Gigabyte Z690 Aorus Master
Cooling	Noctua NH-U12A 2 fans + Thermal Grizzly Kryonaut Extreme + 5 case fans
Memory	32GB DDR5 Corsair Dominator Platinum RGB 6000MT/s CL36
Video Card(s)	MSI RTX 2070 Super Gaming X Trio
Storage	Samsung 980 Pro 1TB + 970 Evo 500GB + 850 Pro 512GB + 860 Evo 1TB x2
Display(s)	23.8" Dell S2417DG 165Hz G-Sync 1440p
Case	Be quiet! Silent Base 600 - Window
Audio Device(s)	Panasonic SA-PMX94 / Realtek onboard + B&O speaker system / Harman Kardon Go + Play / Logitech G533
Power Supply	Seasonic Focus Plus Gold 750W
Mouse	Logitech MX Anywhere 2 Laser wireless
Keyboard	RAPOO E9270P Black 5GHz wireless
Software	Windows 11
Benchmark Scores	Cinebench R23 (Single Core) 1936 @ stock Cinebench R23 (Multi Core) 23006 @ stock

System Name	Aorus B550 AMD AM4
Processor	Ryzen 7 5800XT
Motherboard	GIGABYTE B550 AORUS ELITE AX V2 Rev 1.5
Cooling	Arctic Liquid Freezer II 280
Memory	CORSAIR Vengeance LPX CMK16GX4M2E3200C16
Video Card(s)	MSI Armour RX 580 OC 8GB
Storage	Crucial 1TB P3 NVMe + WD 2TB Black SN850X NVME + PNY 2TB CS900 SATA
Display(s)	Acer VG270U P 2k
Case	Fractal Design Meshify C
Audio Device(s)	HDMI / SPDIF - Polk Audio MagniFi Mini
Power Supply	ASRock Challenger CL-850G
Mouse	Logitech M510
Keyboard	Logitech K270
VR HMD	Why?
Software	Windows 10 / 11

System Name	ICE-QUAD // ICE-CRUNCH
Processor	Q6600 // 2x Xeon 5472
Memory	2GB DDR // 8GB FB-DIMM
Video Card(s)	HD3850-AGP // FireGL 3400
Display(s)	2 x Samsung 204Ts = 3200x1200
Audio Device(s)	Audigy 2
Software	Windows Server 2003 R2 as a Workstation now migrated to W10 with regrets.

Processor	i5-6600K
Motherboard	Asus Z170A
Cooling	some cheap Cooler Master Hyper 103 or similar
Memory	16GB DDR4-2400
Video Card(s)	IGP
Storage	Samsung 850 EVO 250GB
Display(s)	2x Oldell 24" 1920x1200
Case	Bitfenix Nova white windowless non-mesh
Audio Device(s)	E-mu 1212m PCI
Power Supply	Seasonic G-360
Mouse	Logitech Marble trackball, never had a mouse
Keyboard	Key Tronic KT2000, no Win key because 1994
Software	Oldwin

System Name	:)
Processor	Intel 13700k
Motherboard	Gigabyte z790 UD AC
Cooling	Noctua NH-D15
Memory	64GB GSKILL DDR5
Video Card(s)	Gigabyte RTX 4090 Gaming OC
Storage	960GB Optane 905P U.2 SSD + 4TB PCIe4 U.2 SSD
Display(s)	Alienware AW3423DW 175Hz QD-OLED + AOC Agon Pro AG276QZD2 240Hz QD-OLED
Case	Fractal Design Torrent
Audio Device(s)	MOTU M4 - JBL 305P MKII w/2x JL Audio 10 Sealed --- X-Fi Titanium HD - Presonus Eris E5 - JBL 4412
Power Supply	Silverstone 1000W
Mouse	Roccat Kain 122 AIMO
Keyboard	KBD67 Lite / Mammoth75
VR HMD	Reverb G2 V2 / Quest 3
Software	Win 11 Pro

System Name	latest-greatest
Processor	i7 12700K
Motherboard	Z690 Rog Strix-E
Cooling	Lian Li Galahad 360
Memory	corsair vengeance Ddr5 4800
Video Card(s)	2080ti
Storage	980 pro gen4
Display(s)	LG C1 4K 120Mhz
Case	fractal meshify2
Audio Device(s)	Realtec 4080
Power Supply	Corsair rm1000x

System Name	Liquid 2022
Processor	Intel i7-12700k
Motherboard	Asus Strix Z690-A GAMING WIFI D4
Cooling	Custom loop with 9x120mm radiator area
Memory	Team 16GB (2x8GB) DDR4@4133 C18-18-18
Video Card(s)	Nvidia GeForce RTX 4090 on Heatkiller block
Storage	10TB SSD: Samsung 970 PRO 512GB (OS), Samsung 980 PRO 2TB, ADATA SX8200 PRO 2TB/500GB, 4TB/1TB MX500
Display(s)	Samsung 34" G85SB OLED, Samsung Odyssey 21:9
Case	Phanteks ENTHOO 719 (grey)
Audio Device(s)	Creative Sound BlasterX AE-5, Logitech Z906 5.1 speaker system
Power Supply	Cooler Master V1200, custom sleeved white cables
Mouse	Logitech G502
Keyboard	Corsair K70 Lux RGB
Software	Windows 10 Pro 64-bit (maybe 11 soon?)

Inflection AI Builds Supercomputer with 22,000 NVIDIA H100 GPUs

News Editor