Next‑Gen HBM4 to HBM8: Toward Multi‑Terabyte Memory on 15,000 W Accelerators

AleksandarK · 2025-06-16T07:37:39+0100

In a joint briefing this week, KAIST's Memory Systems Laboratory and TERA's Interconnection and Packaging group presented a forward-looking roadmap for High Bandwidth Memory (HBM) standards and the accelerator platforms that will employ them. Shared via Wccftech and VideoCardz, the outline covers five successive generations, from HBM4 to HBM8, each promising substantial gains in capacity, bandwidth, and packaging sophistication. First up is HBM4, targeted for a 2026 rollout in AI GPUs and data center accelerators. It will deliver approximately 2 TB/s per stack at an 8 Gbps pin rate over a 2,048-bit interface. Die stacks will reach 12 to 16 layers, yielding 36-48 GB per package with a 75 W power envelope. NVIDIA's upcoming Rubin series and AMD's Instinct MI500 cards are slated to employ HBM4, with Rubin Ultra doubling the number of memory stacks from eight to sixteen and AMD targeting up to 432 GB per device.

Looking to 2029, HBM5 maintains an 8 Gbps speed but doubles the I/O lanes to 4,096 bits, boosting throughput to 4 TB/s per stack. Power rises to 100 W and capacity scales to 80 GB using 16‑high stacks of 40 Gb dies. NVIDIA's tentative Feynman accelerator is expected to be the first HBM5 adopter, packing 400-500 GB of memory into a multi-die package and drawing more than 4,400 W of total power. By 2032, HBM6 will double pin speeds to 16 Gbps and increase bandwidth to 8 TB/s over 4,096 lanes. Stack heights can grow to 20 layers, supporting up to 120 GB per stack at 120 W. Immersion cooling and bumpless copper-copper bonding will become the norm. The roadmap then predicts HBM7 in 2035, which includes 24 Gbps speeds, 8,192-bit interfaces, 24 TB/s throughput, and up to 192 GB per stack at 160 W. NVIDIA is preparing a 15,360 W accelerator to accommodate this monstrous memory.

Last but not least is the HBM8 by 2038 with 32 Gbps, 16,384 lanes, 64 TB/s, 200-240 GB stacks, and embedded cooling at 180 W. Beyond pure DRAM, the briefing also teased hybrid HBM-HBF (High-Bandwidth Flash) architectures to meet the demands of large-scale AI inference, pairing NAND-based flash with DRAM stacks through ultra-fast through-silicon vias. With accelerators evolving toward petabyte‑class bandwidth and multi‑chiplet designs, these emerging HBM and HBF standards are the only way forward.

View at TechPowerUp Main Site | Source

Vayra86 · 2025-06-16T08:41:12+0100

Yay for climate!

I mean we are using this to fix that issue, right?

Prima.Vera · 2025-06-16T09:12:28+0100

Basically you will need your own Nuclear Power Plan to run a Datacenter with those abominations.
This is not even a sarcastic remark sadly...

dgianstefani · 2025-06-16T11:29:48+0100

People somehow still don't get efficiency does not equal power usage. It's power usage against work done. The work is going to be done regardless if investment in large scale hardware is done. So it's this, or worse power numbers.

See 450 W 5080 and 600 W 5090 topping efficiency charts even at stock.

I can essentially guarantee these units will be more efficient than lower wattage systems.

Furthermore even if they do require their own reactors, that's exactly what the nuclear industry needs. Modular, cheap, efficient and safe reactors (which have been in development before the issue of AI) rather than the regulated to death designs from decades ago super sized jokes governments build. These being used and developed will cut their cost and make them more accessible to other industries.

Vayra86 · 2025-06-16T13:16:52+0100

dgianstefani said:
People somehow still don't get efficiency does not equal power usage. It's power usage against work done. The work is going to be done regardless if investment in large scale hardware is done. So it's this, or worse power numbers.

See 400 W 5080 and 600 W 5090 topping efficiency charts even at stock.

I can essentially guarantee these units will be more efficient than lower wattage systems.

Furthermore even if they do require their own reactors, that's exactly what the nuclear industry needs. Modular, cheap, efficient and safe reactors (which have been in development before the issue of AI) rather than the regulated to death designs from decades ago super sized jokes governments build. These being used and developed will cut their cost and make them more accessible to other industries.

Of course not.

The work is going to become economically more feasible so more of it gets done and that in turn drives growth, ad infinitum.
This is what progress in tech ultimately brings us. It opens up new avenues, opportunities, possibilities... and new ways to use all that power.

Car analogy works here. As cars get more efficient, they've also progressively gotten heavier and bulkier. You can chalk that up to 'demand'... but demand is created. And it is created because it is enabled.

The bottom line is, progress leads to more power usage.

The bottom line is this - note the acceleration as well. We're not just growing, there is an escalation of growth since around 2000. And that is despite all the fantastic green products we have.

This whole idea technological progress leads to savings is top level bullshit that never got proven in real life. Its a utopian thought that allows us to keep consuming and developing; wishful thinking. And in isolation, its true, and fine that way. Too bad there's also a planet.

dgianstefani said:
Furthermore even if they do require their own reactors, that's exactly what the nuclear industry needs. Modular, cheap, efficient and safe reactors (which have been in development before the issue of AI) rather than the regulated to death designs from decades ago super sized jokes governments build. These being used and developed will cut their cost and make them more accessible to other industries.

There is nothing stopping governments from running similar projects to provide energy. Well yeah. Public opinion.

AleksandarK · 2025-06-16T13:24:15+0100

Vayra86 said:
Of course not.

The work is going to become economically more feasible so more of it gets done.
This is what progress in tech ultimately brings us. It opens up new avenues, opportunities, possibilities... and new ways to use all that power.

Car analogy works here. As cars get more efficient, they've also progressively gotten heavier and bulkier. You can chalk that up to 'demand'... but demand is created. And it is created because it is enabled.

In the case of LLMs, reasoning requires 10x more compute, but if you need to run it in an entire cluster it costs you 30kW of power. Now imagine a single 15kW chip running an entire reasoning model because it can fit it in memory and a 15kW processor can run it with more tokens per second. That now equals 2x efficiency, and another x from boosted token throughput. Of course models will become more complex, but that is only allowed by having more powerful hardware. And that more complicated AI will solve more problems, which is also efficiency increase. It is an endless loop. Efficiency is always increasing.

Vayra86 · 2025-06-16T13:28:22+0100

AleksandarK said:
In the case of LLMs, reasoning requires 10x more compute, but if you need to run it in an entire cluster it costs you 30kW of power. Now imagine a single 15kW chip running an entire reasoning model because it can fit it in memory and a 15kW processor can run it with more tokens per second. That now equals 2x efficiency, and another x from boosted token throughput. Of course models will become more complex, but that is only allowed by having more powerful hardware. And that more complicated AI will solve more problems, which is also efficiency increase. It is an endless loop. Efficiency is always increasing.

And so is energy usage. That's the point, and the problem. Its also paradoxical, to solve 'ever more complicated problems' while not getting even remotely in the vicinity of solving the biggest problem. In fact, AI nor companies ever want to fix that problem, because they don't benefit from it.

I'm not opposing the technicality of efficiency increases. But its not bringing us anywhere. Its just a gateway to do more, and 'more' is like a drug humans can't seem to stop taking. We're addicts and we'll find ways to keep using.

AleksandarK · 2025-06-16T14:19:13+0100

Vayra86 said:
And so is energy usage. That's the point, and the problem. Its also paradoxical, to solve 'ever more complicated problems' while not getting even remotely in the vicinity of solving the biggest problem. In fact, AI nor companies ever want to fix that problem, because they don't benefit from it.

I'm not opposing the technicality of efficiency increases. But its not bringing us anywhere. Its just a gateway to do more, and 'more' is like a drug humans can't seem to stop taking. We're addicts and we'll find ways to keep using.

"More" is what got us here: more tech, more innovation. There are billions of R&D that are "thrown away" just to bring you a 1% more interesting/efficient product.

dgianstefani · 2025-06-16T14:50:53+0100

Vayra86 said:
Car analogy works here. As cars get more efficient, they've also progressively gotten heavier and bulkier. You can chalk that up to 'demand'... but demand is created. And it is created because it is enabled.

False example unfortunately.

The reason cars have gotten bigger, particularly in the USA, is due to so called environmental standards. Ever wondered why you can't buy a (new) small truck with a big engine and a big weight/towing capacity? Why the Japanese pick up trucks are so valued and imported at great expense? You have to make the vehicle bigger to fit the standards. A lot of the rest is simply the peak aerodynamic efficiency has been found, and crumple zones (safety standards) dictate x amount of space etc. Why all cars look the same. Oh, and fatter people who want more amenities.

Vayra86 said:
Of course not.

The work is going to become economically more feasible so more of it gets done and that in turn drives growth, ad infinitum.
This is what progress in tech ultimately brings us. It opens up new avenues, opportunities, possibilities... and new ways to use all that power.

Car analogy works here. As cars get more efficient, they've also progressively gotten heavier and bulkier. You can chalk that up to 'demand'... but demand is created. And it is created because it is enabled.

The bottom line is, progress leads to more power usage.

The bottom line is this - note the acceleration as well. We're not just growing, there is an escalation of growth since around 2000. And that is despite all the fantastic green products we have.

View attachment 403875

This whole idea technological progress leads to savings is top level bullshit that never got proven in real life. Its a utopian thought that allows us to keep consuming and developing; wishful thinking. And in isolation, its true, and fine that way. Too bad there's also a planet.

There is nothing stopping governments from running similar projects to provide energy. Well yeah. Public opinion.

Population growth drives demand drives westernisation. I don't want to hear about the so called inefficient Western supercomputers powered by the evil nuclear when SEA exists.

Anyway, veering slightly off topic.

Vayra86 said:
And so is energy usage. That's the point, and the problem. Its also paradoxical, to solve 'ever more complicated problems' while not getting even remotely in the vicinity of solving the biggest problem. In fact, AI nor companies ever want to fix that problem, because they don't benefit from it.

I'm not opposing the technicality of efficiency increases. But its not bringing us anywhere. Its just a gateway to do more, and 'more' is like a drug humans can't seem to stop taking. We're addicts and we'll find ways to keep using.

The alternative is "Industrial society and it's future" which some people found too extreme, but has predictions that certainly came true since publishing.

evernessince · 2025-06-16T15:08:06+0100

Prima.Vera said:
Basically you will need your own Nuclear Power Plan to run a Datacenter with those abominations.
This is not even a sarcastic remark sadly...

Hence why three-mile island is being recommissioned and other nuclear plants are planned. The former is a project sponsored by Microsoft. Honestly a much better alternative than doing what the Chinese are doing and firing up coal plants.

Hakker · 2025-06-16T15:58:43+0100

Vayra86 said:
Of course not.

The work is going to become economically more feasible so more of it gets done and that in turn drives growth, ad infinitum.
This is what progress in tech ultimately brings us. It opens up new avenues, opportunities, possibilities... and new ways to use all that power.

Car analogy works here. As cars get more efficient, they've also progressively gotten heavier and bulkier. You can chalk that up to 'demand'... but demand is created. And it is created because it is enabled.

The bottom line is, progress leads to more power usage.

The bottom line is this - note the acceleration as well. We're not just growing, there is an escalation of growth since around 2000. And that is despite all the fantastic green products we have.

View attachment 403875

This whole idea technological progress leads to savings is top level bullshit that never got proven in real life. Its a utopian thought that allows us to keep consuming and developing; wishful thinking. And in isolation, its true, and fine that way. Too bad there's also a planet.

There is nothing stopping governments from running similar projects to provide energy. Well yeah. Public opinion.

Funny you are proving his point exactly. Europe and NA power usage has stayed somewhat the same over the past 25 years yet computational speed has increased dramatically during that time hence more work got done with the same amount of power.
Heck even the 2nd slide shows it. 50x the Total Bandwidth, 20x the HBM capacity against 7x the power requirements.

dgianstefani · 2025-06-16T16:28:58+0100

Hakker said:
Funny you are proving his point exactly. Europe and NA power usage has stayed somewhat the same over the past 25 years yet computational speed has increased dramatically during that time hence more work got done with the same amount of power.
Heck even the 2nd slide shows it. 50x the Total Bandwidth, 20x the HBM capacity against 7x the power requirements.

Yep. Plus EU/NA population has increased over this time (due to migration). So per capita power usage has decreased. You can see this in all sorts of similar charts graphs etc. Not really the West driving these explosions of resource usage. We did that decades ago when we invented all this tech, and now do things a bit more efficiently.

I think France is a decent model with their nuclear focus, cheap clean energy etc. Shame more countries didn't do that during the golden years when it was affordable. 10-20% power from nuclear on average is just too low.

AI might have interesting second order effects, on paying for small scale modular nuclear models to actually be built, that can later be scaled up, funny how that works.

Vayra86 · 2025-06-16T17:08:23+0100

Good point and also true. But what matters is global and we do outsource a lot too...

TheinsanegamerN · 2025-06-16T23:06:58+0100

Vayra86 said:
Good point and also true. But what matters is global and we do outsource a lot too...

Well, if computers and their energy use is a major problem, I have a pretty good solution. Find your local Amish community. At some point, the buck has to stop.

dgianstefani said:
Yep. Plus EU/NA population has increased over this time (due to migration). So per capita power usage has decreased. You can see this in all sorts of similar charts graphs etc. Not really the West driving these explosions of resource usage. We did that decades ago when we invented all this tech, and now do things a bit more efficiently.

I think France is a decent model with their nuclear focus, cheap clean energy etc. Shame more countries didn't do that during the golden years when it was affordable. 10-20% power from nuclear on average is just too low.

AI might have interesting second order effects, on paying for small scale modular nuclear models to actually be built, that can later be scaled up, funny how that works.

France absolutely did it right and the dumbest thing the rest of the globe ever did was allowing the Environmentalists to spook everyone with Chernobyl.

Coal/Natgas power should have been outmoded by 1990.

evernessince said:
Hence why three-mile island is being recommissioned and other nuclear plants are planned. The former is a project sponsored by Microsoft. Honestly a much better alternative than doing what the Chinese are doing and firing up coal plants.

I swear, if MS can manage that in a reasonable timeframe, the "its too expensive" crown wont have a leg to stand on.

Caring1 · 2025-06-17T13:30:55+0100

AleksandarK said:
And that more complicated AI will solve more problems, which is also efficiency increase. It is an endless loop. Efficiency is always increasing.

The more you buy, the more you save.

AleksandarK · 2025-06-17T14:23:56+0100

Caring1 said:
The more you buy, the more you save.

We may laugh, but somehow that almost always makes sense in scaling compute.

Hakker · 2025-06-17T19:24:08+0100

Vayra86 said:
Good point and also true. But what matters is global and we do outsource a lot too...

You're point still isn't true. The current smartphones have roughly 3000 times more computational power than a Cray supercomputer from 1985. That thing ate your house in space and ate juice enough for an electricity station. Now you have way more computational power dangling near your swimmers that you power with a battery!!!!

System Name	Tiny the White Yeti
Processor	7800X3D
Motherboard	MSI MAG Mortar b650m wifi
Cooling	CPU: Thermalright Peerless Assassin / Case: Phanteks T30-120 x3
Memory	32GB Corsair Vengeance 30CL6000
Video Card(s)	ASRock RX7900XT Phantom Gaming
Storage	Lexar NM790 4TB + Samsung 850 EVO 1TB + Samsung 980 1TB + Crucial BX100 250GB
Display(s)	Gigabyte G34QWC (3440x1440)
Case	Lian Li A3 mATX White
Audio Device(s)	Harman Kardon AVR137 + 2.1
Power Supply	EVGA Supernova G2 750W
Mouse	Steelseries Aerox 5
Keyboard	Lenovo Thinkpad Trackpoint II
VR HMD	HD 420 - Green Edition ;)
Software	W11 IoT Enterprise LTSC
Benchmark Scores	Over 9000

Processor	Intel® Core™ i7-13700K
Motherboard	Gigabyte Z790 Aorus Elite AX
Cooling	Noctua NH-D15
Memory	32GB(2x16) DDR5@6600MHz G-Skill Trident Z5
Video Card(s)	KUROUTOSHIKOU RTX 5080 GALAKURO
Storage	2TB SK Platinum P41 SSD + 4TB SanDisk Ultra SSD + 500GB Samsung 840 EVO SSD
Display(s)	Acer Predator X34 3440x1440@100Hz G-Sync
Case	NZXT PHANTOM410-BK
Audio Device(s)	Creative X-Fi Titanium PCIe
Power Supply	Corsair 850W
Mouse	Logitech Hero G502 SE
Software	Windows 11 Pro - 64bit
Benchmark Scores	30FPS in NFS:Rivals

System Name	Silent/X1 Yoga/S25U-1TB
Processor	Ryzen 9800X3D @ 5.4ghz AC 1.18 V, TG AM5 High Performance Heatspreader/1185 G7/Snapdragon 8 Elite
Motherboard	ASUS ROG Strix X870-I, chipset fans replaced with Noctua A14x25 G2
Cooling	Optimus Block, HWLabs Copper 240/40 x2, D5/Res, 4x Noctua A12x25, 1x A14G2, Conductonaut Extreme
Memory	64 GB Dominator Titanium White 6000 MT, 130 ns tRFC, active cooled, TG Putty Pro
Video Card(s)	RTX 3080 Ti Founders Edition, Conductonaut Extreme, 40 W/mK 3D Graphite pads, Corsair XG7 Waterblock
Storage	Intel Optane DC P1600X 118 GB, Samsung 990 Pro 2 TB
Display(s)	34" 240 Hz 3440x1440 34GS95Q LG MLA+ W-OLED, 31.5" 165 Hz 1440P NanoIPS Ultragear, MX900 dual VESA
Case	Sliger SM570 CNC Alu 13-Litre, 3D printed feet, TG Minuspad Extreme, LINKUP Ultra PCIe 4.0 x16 White
Audio Device(s)	Audeze Maxwell Ultraviolet w/upgrade pads & Leather LCD headband, Galaxy Buds 3 Pro, Razer Nommo Pro
Power Supply	SF1000 Plat, 13 A transparent custom cables, Sentinel Pro 1500 Online Double Conversion UPS w/Noctua
Mouse	Razer Viper V3 Pro 8 KHz Mercury White w/Pulsar Supergrip tape, Razer Atlas, Razer Strider Chroma
Keyboard	Wooting 60HE+ module, TOFU-R CNC Alu/Brass, SS Prismcaps W+Jellykey, LekkerL60 V2, TLabs Leath/Suede
Software	Windows 11 IoT Enterprise LTSC 24H2
Benchmark Scores	Legendary

System Name	Tiny the White Yeti
Processor	7800X3D
Motherboard	MSI MAG Mortar b650m wifi
Cooling	CPU: Thermalright Peerless Assassin / Case: Phanteks T30-120 x3
Memory	32GB Corsair Vengeance 30CL6000
Video Card(s)	ASRock RX7900XT Phantom Gaming
Storage	Lexar NM790 4TB + Samsung 850 EVO 1TB + Samsung 980 1TB + Crucial BX100 250GB
Display(s)	Gigabyte G34QWC (3440x1440)
Case	Lian Li A3 mATX White
Audio Device(s)	Harman Kardon AVR137 + 2.1
Power Supply	EVGA Supernova G2 750W
Mouse	Steelseries Aerox 5
Keyboard	Lenovo Thinkpad Trackpoint II
VR HMD	HD 420 - Green Edition ;)
Software	W11 IoT Enterprise LTSC
Benchmark Scores	Over 9000

System Name	Tiny the White Yeti
Processor	7800X3D
Motherboard	MSI MAG Mortar b650m wifi
Cooling	CPU: Thermalright Peerless Assassin / Case: Phanteks T30-120 x3
Memory	32GB Corsair Vengeance 30CL6000
Video Card(s)	ASRock RX7900XT Phantom Gaming
Storage	Lexar NM790 4TB + Samsung 850 EVO 1TB + Samsung 980 1TB + Crucial BX100 250GB
Display(s)	Gigabyte G34QWC (3440x1440)
Case	Lian Li A3 mATX White
Audio Device(s)	Harman Kardon AVR137 + 2.1
Power Supply	EVGA Supernova G2 750W
Mouse	Steelseries Aerox 5
Keyboard	Lenovo Thinkpad Trackpoint II
VR HMD	HD 420 - Green Edition ;)
Software	W11 IoT Enterprise LTSC
Benchmark Scores	Over 9000

Next‑Gen HBM4 to HBM8: Toward Multi‑Terabyte Memory on 15,000 W Accelerators

AleksandarK

News Editor

Vayra86

Prima.Vera

dgianstefani

TPU Proofreader

Vayra86

AleksandarK

News Editor

Vayra86

AleksandarK

News Editor

dgianstefani

TPU Proofreader

evernessince

Hakker

dgianstefani

TPU Proofreader

Vayra86

TheinsanegamerN

Caring1

AleksandarK

News Editor

Hakker

Processor	Ryzen 9800X3D
Motherboard	ASRock X670E Taichi
Cooling	Noctua NH-D15 Chromax
Memory	64GB DDR5 6000 CL26
Video Card(s)	MSI RTX 4090 Trio
Storage	P5800X 1.6TB 4x 15.36TB Micron 9300 Pro 4x WD Black 8TB M.2
Display(s)	Acer Predator XB3 27" 240 Hz
Case	Thermaltake Core X9
Audio Device(s)	JDS Element IV, DCA Aeon II
Power Supply	Seasonic Prime Titanium 850w
Mouse	PMM P-305
Keyboard	Wooting HE60
VR HMD	Valve Index
Software	Win 10

System Name	Stealth Machine
Processor	AMD 9800X3D
Motherboard	AMD B850 Riptide Wifi
Cooling	Artic Liquid Freezer III 360
Memory	G.Skill Flare X5 DDR5-6000 30-40-40-96
Video Card(s)	XFX Mercury AMD Radeon RX 9070 XT Gaming Edition
Storage	1x Samsung 980 Pro 2TB, 1x Lexar NM790 4TB, 1x Samsung 970 Evo 2TB, 1x Samsung 870 QVO 8TB
Display(s)	LG UltraGear OLED 27GX790A and a Dell 2012M
Case	Lian Li Lancool III
Audio Device(s)	Audeze Maxwell and Logitech Z906
Power Supply	BeQuiet Powerzone 2 1000W
Mouse	Logitech G502 Hero
Keyboard	Chilkey ND75
Software	Windows 11 Pro

System Name	Skunkworks 3.0
Processor	5800x3d
Motherboard	x570 unify
Cooling	Noctua NH-U12A
Memory	32GB 3600 mhz
Video Card(s)	asrock 6800xt challenger D
Storage	Sabarent rocket 4.0 2TB, MX 500 2TB
Display(s)	Asus 1440p144 27"
Case	Old arse cooler master 932
Power Supply	Corsair 1200w platinum
Mouse	squeak
Keyboard	Some old office thing
Software	Manjaro

System Name	H7 Flow 2024
Processor	AMD 5800X3D
Motherboard	Asus X570 Tough Gaming
Cooling	Custom liquid
Memory	32 GB DDR4
Video Card(s)	Intel ARC A750
Storage	Crucial P5 Plus 2TB.
Display(s)	AOC 24" Freesync 1m.s. 75Hz
Mouse	Lenovo
Keyboard	Eweadn Mechanical
Software	W11 Pro 64 bit

Next‑Gen HBM4 to HBM8: Toward Multi‑Terabyte Memory on 15,000 W Accelerators

News Editor

TPU Proofreader

News Editor

News Editor

TPU Proofreader

TPU Proofreader

News Editor

Next‑Gen HBM4 to HBM8: Toward Multi‑Terabyte Memory on 15,000 W Accelerators