MIT, Stanford Partner Towards Making CPU-Memory BUSes Obsolete

Raevenlord · Jul 10, 2017

Graphene has been hailed for some time now as the next natural successor to silicon, today's most used medium for semiconductor technology. However, even before such more exotic solutions to current semiconductor technology are employed (and we are still way off that future, at least when it comes to mass production), engineers and researchers seem to be increasing their focus in one specific part of computing: internal communication between components.

Typically, communication between a computer's Central Processing Unit (CPU) and a system's memory (usually DRAM) have occurred through a bus, which is essentially a communication highway between data stored in the DRAM, and the data that the CPU needs to process/has just finished processing. The fastest CPU and RAM is still only as fast as the bus, and recent workloads have been increasing the amount of data to be processed (and thus transferred) by orders of magnitude. As such, engineers have been trying to figure out ways of increasing communication speed between the CPU and the memory subsystem, as it is looking increasingly likely that the next bottlenecks in HPC will come not through lack of CPU speed or memory throughput, but from a bottleneck in communication between those two.

The MIT and Stanford researchers' solution? Do away with a bus entirely, by entwining the CPU and memory together so closely that there is no need for performance bottle-necking buses. According to the lead author in the research paper, Max Shulaker, "The RRAM and carbon nanotubes are built vertically over one another, making a new, dense 3-D computer architecture with interleaving layers of logic and memory. By inserting ultradense wires between these layers, this 3-D architecture promises to address the communication bottleneck." As a proof of concept (let's call it PoC for the Mass Effect: Andromeda fans out there), the team has produced a small-scale carbon nanotube (CNT) computer, and get this: it was actually capable of running programs, a basic multitasking operating system, and performing MIPS instructions. The pairing with RRAM (Resistive Random Access Memory) is a feat unto itself, as H.-S. Philip Wong, a co-author of the research, says that "RRAM can be denser, faster, and more energy-efficient compared to DRAM." In-between the logic and memory layers are "ultradense" wires that provide communication, which is "more than an order of magnitude" faster and more energy efficient than silicon.

Granted, the new system likely won't break speed records soon - it ran at 1 kHz. However, researchers behind the paper claim that achieving higher speeds is a more trivial task compared to the actual development of this unit in particular. As they put it, the speed limit "(...) is not due to the limitations of the CNT technology or our design methodology, but instead is caused by capacitive loading introduced by the measurement setup, the 1-mm minimum lithographic feature size possible in our academic fabrication facility, and CNT density and contact resistance." Compare the university's 1-mm minimum litographic feature size they can possibly achieve with their tools with those currently employed by state-of-the-art foundries by the likes of Intel and Samsung, and you're likely to even more respect the scientist's achievement. Even so, "the researchers integrated over 1 million RRAM cells and 2 million carbon nanotube field-effect transistors, making the most complex nanoelectronic system ever made with emerging nanotechnologies."

One of the key benefits of fabricating the new computer chips from graphene stems from the much lower temperatures involved in fabrication - and is what makes such a marriage between a CPU and its memory all but impossible through silicon. Silicon chip creation requires temperatures up to 1,000 degrees Celsius, so it's difficult to make multi-layers and 3D structures without damaging prior layers. However, "carbon nanotube circuits and RRAM memory can be fabricated at much lower temperatures, below 200°C", says Shulaker, which enables the stacking and interlinking of memory and CPU through fine wires.

View at TechPowerUp Main Site

R-T-B · Jul 10, 2017

That's how I feel after reading this. Thanks, Bill.

DeathtoGnomes · Jul 10, 2017

Read something like this a few months ago.

RejZoR · Jul 10, 2017

It's actually odd that on GPU's we can have 512bit bus width between GPU and VRAM and it's not really something that exotic where on CPU's, 256bit is about as maximum as it can get (quad channel is 4x64bit). And this is somewhat exotic since like 90% of people are still on dual channel systems...

Boosnie · Jul 10, 2017

RejZoR said:
It's actually odd that on GPU's we can have 512bit bus width between GPU and VRAM and it's not really something that exotic where on CPU's, 256bit is about as maximum as it can get (quad channel is 4x64bit). And this is somewhat exotic since like 90% of people are still on dual channel systems...

Is it?
Those elements are standardized for CPUs and the satellite elements they connect to.
On a GPU board you can cram in whatever you want, as long as the final I/O is PCI-e compatible and transmissible.
So I think it's not very odd that on a GPU you can have 2048 bus while on a CPU you still get 256.
That said, there are also other factors to account for, like path density on the MoBo and such.

sweet · Jul 10, 2017

1 mm in 2017 LUL.

Joke aside, it's quite impressive in this state of tech development.

seronx · Jul 10, 2017

but, can it run Pong?

FordGT90Concept · Jul 10, 2017

1mm is terrible. Assuming they fix that problem, how expensive are they to mass produce? Nothing has replaced the classic silicon wafers yet because nothing has been both an upgrade in performance and as cheap to mass produce.

Brusfantomet · Jul 10, 2017

RejZoR said:
It's actually odd that on GPU's we can have 512bit bus width between GPU and VRAM and it's not really something that exotic where on CPU's, 256bit is about as maximum as it can get (quad channel is 4x64bit). And this is somewhat exotic since like 90% of people are still on dual channel systems...

Well, if you look at a Hawaii based chip you will notice the insane PCB AMD was using.

Trace length is a limiting factor, and together with the slot with for DDR3 and DDR4 you get longer traces on the edges, and since DDR still is a parallel interface the shorter traces will need to be elongated so they are the same length. This is simpler when the interfaces are smaller chips you can place around the GPU.

The slot for DDR3 and DDR4 is also not as good as a soldered connection.

Another reason is that one does not need the super wide bus. A 290x has a wider bus then a GTX 1080, as a i7 920 has a wider bus then a i7 7700k. On modern CPUs the latency is just as important as the memory bandwidth, and where the bandwidth has had a explosive growth the last decade, the latency has not improved nearly as much, witch is why we have CPUs with L3 and some with L4 now.

So in summary, because its difficult and the gains are not that big, that is why we are on 128 to 256 bit buses for consumer CPUs.

FreedomEclipse · Jul 10, 2017

R-T-B said:
That's how I feel after reading this. Thanks, Bill.

puwoIEwbiYUWZ4AS2D51wE5BLrtb9IWG-BMC7_BCqOwizPQCJ0Z7-g9id2rbNriU5z2sAh4nYw=w136-h106-no

yogurt_21 · Jul 10, 2017

so are they talking an all in one memory/cpu solution? That's what I get from the stacking anyway.

PowerPC · Jul 10, 2017

So like HBM combined memory closer with the processing part.

System Name	The Ryzening
Processor	AMD Ryzen 9 5900X
Motherboard	MSI X570 MAG TOMAHAWK
Cooling	Lian Li Galahad 360mm AIO
Memory	32 GB G.Skill Trident Z F4-3733 (4x 8 GB)
Video Card(s)	Gigabyte RTX 3070 Ti
Storage	Boot: Transcend MTE220S 2TB, Kintson A2000 1TB, Seagate Firewolf Pro 14 TB
Display(s)	Acer Nitro VG270UP (1440p 144 Hz IPS)
Case	Lian Li O11DX Dynamic White
Audio Device(s)	iFi Audio Zen DAC
Power Supply	Seasonic Focus+ 750 W
Mouse	Cooler Master Masterkeys Lite L
Keyboard	Cooler Master Masterkeys Lite L
Software	Windows 10 x64

System Name	Pioneer
Processor	Ryzen 9 9950X
Motherboard	MSI MAG X670E Tomahawk Wifi
Cooling	Noctua NH-D15 + A whole lotta Sunon, Phanteks and Corsair Maglev blower fans...
Memory	128GB (4x 32GB) G.Skill Flare X5 @ DDR5-4200(Running 1:1:1 w/FCLK)
Video Card(s)	XFX RX 7900 XTX Speedster Merc 310
Storage	Intel 5800X Optane 800GB boot, +2x Crucial P5 Plus 2TB PCIe 4.0 NVMe SSDs, 1x 2TB Seagate Exos 3.5"
Display(s)	55" LG 55" B9 OLED 4K Display
Case	Thermaltake Core X31
Audio Device(s)	TOSLINK->Schiit Modi MB->Asgard 2 DAC Amp->AKG Pro K712 Headphones or HDMI->B9 OLED
Power Supply	FSP Hydro Ti Pro 850W
Mouse	Logitech G305 Lightspeed Wireless
Keyboard	WASD Code v3 with Cherry Green keyswitches + PBT DS keycaps
Software	Gentoo Linux x64, other office machines run Windows 11 Enterprise

System Name	Dumbass
Processor	AMD Ryzen 7800X3D
Motherboard	ASUS TUF gaming B650
Cooling	Artic Liquid Freezer 2 - 420mm
Memory	G.Skill Sniper 32gb DDR5 6000
Video Card(s)	GreenTeam 4070 ti super 16gb
Storage	Samsung EVO 500gb & 1Tb, 2tb HDD, 500gb WD Black
Display(s)	1x Nixeus NX_EDG27, 2x Dell S2440L (16:9)
Case	Phanteks Enthoo Primo w/8 140mm SP Fans
Audio Device(s)	onboard (realtek?) - SPKRS:Logitech Z623 200w 2.1
Power Supply	Corsair HX1000i
Mouse	Steeseries Esports Wireless
Keyboard	Corsair K100
Software	windows 10 H
Benchmark Scores	https://i.imgur.com/aoz3vWY.jpg?2

System Name	Dark Monolith
Processor	AMD Ryzen 7 5800X3D
Motherboard	ASUS Strix X570-E
Cooling	Arctic Cooling Freezer II 240mm + 2x SilentWings 3 120mm
Memory	64 GB G.Skill Ripjaws V Black
Video Card(s)	XFX Radeon RX 9070 XT Mercury OC Magnetic Air
Storage	Seagate Firecuda 530 4 TB SSD + Samsung 850 Pro 2 TB SSD + Seagate Barracuda 8 TB HDD
Display(s)	ASUS ROG Swift PG27AQDM 240Hz OLED
Case	Silverstone Kublai KL-07
Audio Device(s)	Sound Blaster AE-9 MUSES Edition + Altec Lansing MX5021 2.1 Nichicon Gold
Power Supply	BeQuiet DarkPower 11 Pro 750W
Mouse	Logitech G502 Proteus Spectrum
Keyboard	UVI Pride MechaOptical
Software	Windows 11 Pro

System Name	SolarwindMobile
Processor	AMD FX-9800P RADEON R7, 12 COMPUTE CORES 4C+8G
Motherboard	Acer Wasp_BR
Cooling	It's Copper.
Memory	2 x 8GB SK Hynix/HMA41GS6AFR8N-TF
Video Card(s)	ATI/AMD Radeon R7 Series (Bristol Ridge FP4) [ACER]
Storage	TOSHIBA MQ01ABD100 1TB + KINGSTON RBU-SNS8152S3128GG2 128 GB
Display(s)	ViewSonic XG2401 SERIES
Case	Acer Aspire E5-553G
Audio Device(s)	Realtek ALC255
Power Supply	PANASONIC AS16A5K
Mouse	SteelSeries Rival
Keyboard	Ducky Channel Shine 3
Software	Windows 10 Home 64-bit (Version 1607, Build 14393.969)

MIT, Stanford Partner Towards Making CPU-Memory BUSes Obsolete

Raevenlord

News Editor

R-T-B

DeathtoGnomes

RejZoR

Boosnie

sweet

seronx

FordGT90Concept

"I go fast!1!11!1!"

Brusfantomet

FreedomEclipse

~Technological Technocrat~

yogurt_21

PowerPC

System Name	BY-2021
Processor	AMD Ryzen 7 5800X (65w eco profile)
Motherboard	MSI B550 Gaming Plus
Cooling	Scythe Mugen (rev 5)
Memory	2 x Kingston HyperX DDR4-3200 32 GiB
Video Card(s)	AMD Radeon RX 7900 XT
Storage	Samsung 980 Pro, Seagate Exos X20 TB 7200 RPM
Display(s)	Nixeus NX-EDG274K (3840x2160@144 DP) + Samsung SyncMaster 906BW (1440x900@60 HDMI-DVI)
Case	Coolermaster HAF 932 w/ USB 3.0 5.25" bay + USB 3.2 (A+C) 3.5" bay
Audio Device(s)	Realtek ALC1150, Micca OriGen+
Power Supply	Enermax Platimax 850w
Mouse	Nixeus REVEL-X
Keyboard	Tesoro Excalibur
Software	Windows 10 Home 64-bit
Benchmark Scores	Faster than the tortoise; slower than the hare.

System Name	Games/internet/usage
Processor	I7 5820k 4.2 Ghz
Motherboard	ASUS X99-A2
Cooling	custom water loop for cpu and gpu
Memory	16GiB Crucial Ballistix Sport 2666 MHz
Video Card(s)	Radeon Rx 6800 XT
Storage	Samsung XP941 500 GB + 1 TB SSD
Display(s)	Dell 3008WFP
Case	Caselabs Magnum M8
Audio Device(s)	Shiit Modi 2 Uber -> Matrix m-stage -> HD650
Power Supply	beQuiet dark power pro 1200W
Mouse	Logitech MX518
Keyboard	Corsair K95 RGB
Software	Win 10 Pro

System Name	WorkInProgress
Processor	AMD 7800X3D
Motherboard	MSI X670E GAMING PLUS
Cooling	Thermalright AM5 Contact Frame + Phantom Spirit 120SE
Memory	2x32GB G.Skill Trident Z5 NEO DDR5 6000 CL32
Video Card(s)	Asus Dual Radeon™ RX 6700 XT OC Edition
Storage	WD SN770 1TB (Boot)\|1x WD SN850X 8TB (Gaming)\| 2x2TB WD SN770\| 2x2TB+2x4TB Crucial BX500
Display(s)	LG GP850-B
Case	Corsair 760T (White) {1xCorsair ML120 Pro\|5xML140 Pro}
Audio Device(s)	Yamaha RX-V573\|Speakers: JBL Control One\|Auna 300-CN\|Wharfedale Diamond SW150
Power Supply	Seasonic Focus GX-850 80+ GOLD
Mouse	Logitech G502 X
Keyboard	Cherry G80-3000N (TKL)
Software	Windows 11 Home
Benchmark Scores	ლ(ಠ益ಠ)ლ

System Name	Thought I'd be done with this by now
Processor	i7 11700k 8/16
Motherboard	MSI Z590 Pro Wifi
Cooling	Be Quiet Dark Rock Pro 4, 9x aigo AR12
Memory	32GB GSkill TridentZ Neo DDR4-4000 CL18-22-22-42
Video Card(s)	MSI Ventus 2x Geforce RTX 3070
Storage	1TB MX300 M.2 OS + Games, + cloud mostly
Display(s)	Samsung 40" 4k (TV)
Case	Lian Li PC-011 Dynamic EVO Black
Audio Device(s)	onboard HD -> Yamaha 5.1
Power Supply	EVGA 850 GQ
Mouse	Logitech wireless
Keyboard	same
VR HMD	nah
Software	Windows 10
Benchmark Scores	no one cares anymore lols