TSMC N3 Nodes Show SRAM Scaling is Hitting the Wall

AleksandarK · May 29, 2023

When TSMC introduced its N3 lineup of nodes, the company only talked about the logic scaling of the two new semiconductor manufacturing steps. However, it turns out that there was a reason for it, as WikiChip confirms that the SRAM bit cells of N3 nodes are almost identical to the SRAM bit cells of N5 nodes. At TSMC 2023 Technology Symposium, TSMC presented additional details about its N3 node lineup, including logic and SRAM density. For starters, the N3 node is TSMC's "3 nm" node family that has two products: a Base N3 node (N3B) and an Enhanced N3 node (N3E). The base N3B uses a new (for TSMC) self-aligned contact (SAC) scheme that Intel introduced back in 2011 with a 22 nm node, which improves the node's yield.

Regardless of N3's logic density improvements compared to the "last-generation" N5, the SRAM density is almost identical. Initially, TSMC claimed N3B SRAM density was 1.2x over the N5 process. However, recent information shows that the actual SRAM density is merely a 5% difference. With SRAM taking a large portion of the transistor and area budget of a processor, N3B's soaring manufacturing costs are harder to justify when there is almost no area improvement. For some time, SRAM scaling wasn't following logic scaling; however, the two have now completely decoupled.

View at TechPowerUp Main Site | Source

TristanX · May 29, 2023

there is no problem with small size of caches, but problem with unoptimized software.
For well optimized software, few megabytes of cache is sufficient

mechtech · May 29, 2023

When you take the time to look at the graph and realize its logarithmic on the area, it has been flatlined since 5nm, and if you include 7nm, it's still a pretty flat line.

Wye · May 29, 2023

TristanX said:
there is no problem with small size of caches, but problem with unoptimized software.
For well optimized software, few magabytes of cache is sufficient

MAGA bytes now? Damn, the Trump fans are getting desperate.

Count von Schwalbe · May 29, 2023

How long until the SRAM is on a separate chip entirely (think X3D style) and the logic chip is only cores and interconnect?

AnotherReader · May 29, 2023

Backside power delivery, or PowerVia in Intel parlance, should help with SRAM scaling. Nanosheet transistors will also help, but these are all slated for either Intel's 20A node or TSMC's N2P node. These aren't expected to be available until 2024 and 2026 respectively.

Count von Schwalbe said:
How long until the SRAM is on a separate chip entirely (think X3D style) and the logic chip is only cores and interconnect?

That will increase latency of SRAM as off-chip communication is costly in both latency and power. It could only be done with large, last level caches like AMD's LLC for RDNA3. Smaller caches like L1 and L2 will remain on-chip.

TumbleGeorge · May 29, 2023

It's a miracle that some SRAM scaling still fits between 7nm and 3nm. ASML's 3000 series(3400&3600) lithography scanners are both fully identical wavelengths.

Screenshot_2023-05-29-21-01-17-40_40deb401b9ffe8e1df2f1cc5ba480b12.jpg

AnotherReader · May 29, 2023

TumbleGeorge said:
It's a miracle that some SRAM scaling still fits between 7nm and 3nm. ASML's 3000 series(3400&3600) lithography scanners are both fully identical wavelengths.

It's not a miracle. The light source is a necessary part of the process, but it doesn't govern the minimum size of the current processes which are all greater than 13.5 nm. Besides, N7 doesn't use EUV. Instead, it uses light with a wavelength of 193 nm.

thegnome · May 29, 2023

TumbleGeorge said:
It's a miracle that some SRAM scaling still fits between 7nm and 3nm. ASML's 3000 series(3400&3600) lithography scanners are both fully identical wavelengths.
View attachment 298202

Any chance of any new scanners having shorter wavelengths then? If we can't go further than that we'll be stuck with the chips only getting tiny improvements.

AnotherReader · May 29, 2023

thegnome said:
Any chance of any new scanners having shorter wavelengths then? If we can't go further than that we'll be stuck with the chips only getting tiny improvements.

Using 193 nm light sources didn't stop the industry from having processes with feature sizes as small as 30 nm.

EDIT: 30 nm is the the fin pitch for TSMC's N7.

TumbleGeorge · May 29, 2023

AnotherReader said:
N7 doesn't use EUV

Yes, N7 doesn't use EUV, but there is much more than one "7"nm variants.

TheoneandonlyMrK · May 29, 2023

Count von Schwalbe said:
How long until the SRAM is on a separate chip entirely (think X3D style) and the logic chip is only cores and interconnect?

You answered your own question x3D already brought that.

First off die would be L3, they're not getting the L1/2 cache's off die, the optic chips or another massive in memory compute evolution is necessary to change that I think.

AnotherReader · May 29, 2023

TristanX said:
there is no problem with small size of caches, but problem with unoptimized software.
For well optimized software, few magabytes of cache is sufficient

In the real world, the working set of most programs isn't defined by their code. Perhaps you have heard of servers that usually have hundreds of GB of RAM. Do you think they would do fine with CPUs with less than 10 MB of last level cache.

TumbleGeorge said:
Yes, N7 doesn't use EUV, but there is much more than one "7"nm variants.

True, but the most popular variant is the one that forgoes EUV.

cchi · May 29, 2023

AnotherReader said:
Backside power delivery, or PowerVia in Intel parlance, should help with SRAM scaling. Nanosheet transistors will also help, but these are all slated for either Intel's 20A node or TSMC's N2P node. These aren't expected to be available until 2024 and 2026 respectively.

That will increase latency of SRAM as off-chip communication is costly in both latency and power. It could only be done with large, last level caches like AMD's LLC for RDNA3. Smaller caches like L1 and L2 will remain on-chip.

With proper die stacking there is no large latency penalty, heck it might even be lower due to lower distance in z direction compared to x-y.

What is a problem though is heat dissipation, which is why it currently is limited to the LLC of Zen3/4, because of its lower power density compared to the core area.
Still the X3D chips run much hotter due to the structural silicon pieces, but would be even hotter if it was covered with active silicon.

AnotherReader · May 29, 2023

cchi said:
With proper die stacking there is no large latency penalty, heck it might even be lower due to lower distance in z direction compared to x-y.

What is a problem though is heat dissipation, which is why it currently is limited to the LLC of Zen3/4, because of its lower power density compared to the core area.
Still the X3D chips run much hotter due to the structural silicon pieces, but would be even hotter if it was covered with active silicon.

I was thinking of non stacked chips, but your're right; die stacking solves the downsides of off-chip cache, but in its current form, it brings new issues too.

kondamin · May 29, 2023

Can't the mosfet's be stacked so the sram cell is flipped 90°?

TumbleGeorge · May 29, 2023

thegnome said:
Any chance of any new scanners having shorter wavelengths then? If we can't go further than that we'll be stuck with the chips only getting tiny improvements.

Yes 5000 series. Very first 5000 are delivered to Intel. First 5200 will be delivered in 2024.

PhantomTaco · May 29, 2023

thegnome said:
Any chance of any new scanners having shorter wavelengths then? If we can't go further than that we'll be stuck with the chips only getting tiny improvements.

With all of lithography the process of converting to a "shorter wavelength" means either an optical improvement (lenses/mirrors) or a new light source. At this point, there's not many good candidates for a new light source sub 13.5nm. Like someone else said in the thread, the ASML EXE platform is the next step on the optics side of things to reduce the wavelength. The platform is also called High NA (Numerical Aperture), and essentially allow for wavelength reductions down to around 8nm. The core design behind how the light source is generated, however, remains the same as the current EUV tools.

For more information on how these minimium resolutions are calculated, you can look into the Rayleigh Criterion, which is basically what governs all of this in terms of minimum critical dimension

Wirko · May 29, 2023

kondamin said:
Can't the mosfet's be stacked so the sram cell is flipped 90°?

That would describe the CFET (complementary FET), which is a stack of two transistors. Yes, just two. And I'm not sure anyone has produced even an experimental working chip with those.

Panther_Seraphin · May 29, 2023

thegnome said:
Any chance of any new scanners having shorter wavelengths then? If we can't go further than that we'll be stuck with the chips only getting tiny improvements.

From what I heard that 13.5nm is the optimal wavelength to etch on current materials as anything smaller tends to go through the material vs reflect/etch

So it will probably take a massive leap in materials technolgy again to get the next "leap" vs just optimising 13.5nm utilisation.

Wirko · May 29, 2023

AnotherReader said:
Smaller caches like L1 and L2 will remain on-chip.

AMD said the stacked L3 chip adds four clock cycles to access latency. Assuming the same were true for L2, it might actually be beneficial if a Zen core could have, for example, 1 MB plus stacked 2 MB of L2 compared to just 1 MB of faster L2.

Count von Schwalbe · May 29, 2023

L1 and L2 are nothing compared to the vast expanse of L3.

What seems likely is a "blank area" where the L3 sits currently, with interconnects on-chip but no actual transistors. Then the L3, made on a larger node, is laid in the same area but is considerably higher capacity.

Wirko · May 29, 2023

Count von Schwalbe said:
L1 and L2 are nothing compared to the vast expanse of L3.

What do you mean, nothing? 1 MB of L2 is about one third the size of a slice of L3 (= 4 MB next to each core).

Count von Schwalbe · May 29, 2023

Wirko said:
What do you mean, nothing? 1 MB of L2 is about one third the size of a slice of L3 (= 4 MB next to each core).

You have 4X the L3 as L2, and that is on Zen 4. I understand that L3 sizes are going to increase again pretty soon.

lexluthermiester · May 29, 2023

Panther_Seraphin said:
So it will probably take a massive leap in materials technolgy again to get the next "leap" vs just optimising 13.5nm utilisation.

This. We need a replacement for strained Silicon.

Processor	Ryzen 5700x
Motherboard	Gigabyte X570S Aero G R1.1 Bios F7g
Cooling	Noctua NH-C12P SE14 w/ NF-A15 HS-PWM Fan 1500rpm
Memory	Micron DDR4-3200 2x32GB D.S. D.R. (CT2K32G4DFD832A)
Video Card(s)	AMD RX 6800 - Asus Tuf
Storage	Kingston KC3000 1TB & 2TB & 4TB Corsair MP600 Pro LPX
Display(s)	LG 27UL550-W (27" 4k)
Case	Be Quiet Pure Base 600 (no window)
Audio Device(s)	Realtek ALC1220-VB
Power Supply	SuperFlower Leadex V Gold Pro 850W ATX Ver2.52
Mouse	Mionix Naos Pro
Keyboard	Corsair Strafe with browns
Software	W10 22H2 Pro x64

Processor	Ryzen 7 5700X
Motherboard	ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling	Noctua NH-C14S (two fans)
Memory	2x16GB DDR4 3200
Video Card(s)	Reference Vega 64
Storage	Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s)	Nixeus NX-EDG27, and Samsung S23A700
Case	Fractal Design R5
Power Supply	Seasonic PRIME TITANIUM 850W
Mouse	Logitech
VR HMD	Oculus Rift
Software	Windows 11 Pro, and Ubuntu 20.04

Processor	Ryzen 7 5700X
Motherboard	ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling	Noctua NH-C14S (two fans)
Memory	2x16GB DDR4 3200
Video Card(s)	Reference Vega 64
Storage	Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s)	Nixeus NX-EDG27, and Samsung S23A700
Case	Fractal Design R5
Power Supply	Seasonic PRIME TITANIUM 850W
Mouse	Logitech
VR HMD	Oculus Rift
Software	Windows 11 Pro, and Ubuntu 20.04

System Name	Incomplete thing 1.0
Processor	Ryzen 2600
Motherboard	B450 Aorus Elite
Cooling	Gelid Phantom Black
Memory	HyperX Fury RGB 3200 CL16 16GB
Video Card(s)	Gigabyte 2060 Gaming OC PRO
Storage	Dual 1TB 970evo
Display(s)	AOC G2U 1440p 144hz, HP e232
Case	CM mb511 RGB
Audio Device(s)	Reloop ADM-4
Power Supply	Sharkoon WPM-600
Mouse	G502 Hero
Keyboard	Sharkoon SGK3 Blue
Software	W10 Pro
Benchmark Scores	2-5% over stock scores

Processor	Ryzen 7 5700X
Motherboard	ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling	Noctua NH-C14S (two fans)
Memory	2x16GB DDR4 3200
Video Card(s)	Reference Vega 64
Storage	Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s)	Nixeus NX-EDG27, and Samsung S23A700
Case	Fractal Design R5
Power Supply	Seasonic PRIME TITANIUM 850W
Mouse	Logitech
VR HMD	Oculus Rift
Software	Windows 11 Pro, and Ubuntu 20.04

TSMC N3 Nodes Show SRAM Scaling is Hitting the Wall

AleksandarK

News Editor

TristanX

mechtech

Wye

Count von Schwalbe

Nocturnus Moderatus

AnotherReader

TumbleGeorge

AnotherReader

thegnome

AnotherReader

TumbleGeorge

TheoneandonlyMrK

AnotherReader

cchi

New Member

AnotherReader

kondamin

TumbleGeorge

PhantomTaco

Wirko

Panther_Seraphin

Wirko

Count von Schwalbe

Nocturnus Moderatus

Wirko

Count von Schwalbe

Nocturnus Moderatus

lexluthermiester

System Name	RyzenGtEvo/ Asus strix scar II
Processor	Amd R5 5900X/ Intel 8750H
Motherboard	Crosshair hero8 impact/Asus
Cooling	360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory	Gskill Trident Z 3900cas18 32Gb in four sticks./16Gb/16GB
Video Card(s)	Asus tuf RX7900XT /Rtx 2060
Storage	Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s)	Samsung UAE28"850R 4k freesync.dell shiter
Case	Lianli 011 dynamic/strix scar2
Audio Device(s)	Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply	corsair 1200Hxi/Asus stock
Mouse	Roccat Kova/ Logitech G wireless
Keyboard	Roccat Aimo 120
VR HMD	Oculus rift
Software	Win 10 Pro
Benchmark Scores	laptop Timespy 6506

System Name	Project Taco
Processor	i7 4770k
Motherboard	Gigabyte G1 Sniper 5 z87
Cooling	Corsair H100i w/Noiseblocker eLoop
Memory	Avexir Core Series White LED 4x4GB 1600mhz
Video Card(s)	2x EVGA Nvidia GTX 780 TI Classified
Storage	Samsung 840 EVO 500GB
Display(s)	2 QNIX QX2710
Case	NZXT H440
Audio Device(s)	O2/ODAC
Power Supply	EVGA Supernova 1000W Gold

Processor	i5-6600K
Motherboard	Asus Z170A
Cooling	some cheap Cooler Master Hyper 103 or similar
Memory	16GB DDR4-2400
Video Card(s)	IGP
Storage	Samsung 850 EVO 250GB
Display(s)	2x Oldell 24" 1920x1200
Case	Bitfenix Nova white windowless non-mesh
Audio Device(s)	E-mu 1212m PCI
Power Supply	Seasonic G-360
Mouse	Logitech Marble trackball, never had a mouse
Keyboard	Key Tronic KT2000, no Win key because 1994
Software	Oldwin

Processor	AMD 7600x
Motherboard	Asrock x670e Steel Legend
Cooling	Silver Arrow Extreme IBe Rev B with 2x 120 Gentle Typhoons
Memory	4x16Gb Patriot Viper Non RGB @ 6000 30-36-36-36-40
Video Card(s)	XFX 6950XT MERC 319
Storage	2x Crucial P5 Plus 1Tb NVME
Display(s)	3x Dell Ultrasharp U2414h
Case	Coolermaster Stacker 832
Power Supply	Thermaltake Toughpower PF3 850 watt
Mouse	Logitech G502 (OG)
Keyboard	Logitech G512