• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Toshiba Develops New Bridge Chip Using PAM 4 to Boost SSD Speed and Capacity

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
46,283 (7.69/day)
Location
Hyderabad, India
System Name RBMK-1000
Processor AMD Ryzen 7 5700G
Motherboard ASUS ROG Strix B450-E Gaming
Cooling DeepCool Gammax L240 V2
Memory 2x 8GB G.Skill Sniper X
Video Card(s) Palit GeForce RTX 2080 SUPER GameRock
Storage Western Digital Black NVMe 512GB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Gamdias Hermes E2
Software Windows 11 Pro
Toshiba Memory Corporation, the world leader in memory solutions, today announced the development of a bridge chip that realizes high-speed and large-capacity SSDs. Using developed bridge chips with a small occupied area and low-power consumption, the company has succeeded in connecting more flash memory chips with fewer high-speed signal lines than with the conventional method of no bridge chips. This result was announced in San Francisco on February 20, at the International Solid-State Circuits Conference 2019 (ISSCC 2019).

In SSDs, multiple flash memory chips are connected to a controller that manages their operation. As more flash memory chips are connected to a controller interface, operating speed degrades, so there are limits to the number of chips that can be connected. In order to increase capacity, it is necessary to increase the number of interfaces, but that results in an enormous number of high-speed signal lines connected to the controller, making it more difficult to implement the wiring on the SSD board.



The company has overcome this problem with the development of a bridge chip that connects the controller and flash memory chips (Fig. 1), three novel techniques: a daisy chains connection including the controller and bridge chips in a ring shape; a serial communication using PAM 4; and a jitter improvement technique for eliminating a PLL circuit in the bridge chips. By using these techniques overhead of the bridge chips is reduced, and it is possible to operate a large number of flash memory chips at high speed with only a few high-speed signal lines (Fig. 2).

The ring-shape configuration of the bridge chips and the controller reduces the number of transceivers required in the bridge chip from two pairs to one pair, it achieves chip area reduction of the bridge chip. In addition, adopting PAM 4 serial communication between the controller and the daisy-chained bridge chips lowers the operating speed in the bridge chips' circuits and relaxes their required performance. A new CDR*5 that utilizes the characteristics of PAM 4 to improve jitter characteristics eliminates the need for a PLL circuit in the bridge chip, which also contributes to a smaller chip area and lower power consumption.

The prototype bridge chips were fabricated with 28 nm CMOS process, and results were evaluated by connecting four bridge chips and a controller in ring-shape daisy chain. This confirmed satisfactory performance of PAM 4 communication by all of the bridge chips and the controller at 25.6 Gbps, and also that it is possible to obtain a BER*6 of less than 10-12.

Moving forward, the company will continue development work toward achieving high-speed, large-capacity storage at levels not yet seen by further enhancing bridge-chip performance while reducing the chip's area and power consumption.

Notes
  • 1 Daisy chain: a connecting scheme in which multiple chips are wired together in sequence
  • 2 PAM 4: 4-level Pulse Amplitude Modulation (it contains a 4-value data)
  • 3 Jitter: Fluctuation in the time domain of the clock or signal waveforms
  • 4 PLL: Phase Locked Loop (a circuit that generates an accurate reference signal)
  • 5 CDR: Clock Data Recovery (a circuit that recovers the data and clock from the received signal)
  • 6 BER: Bit Error Rate (the lower value is the better performance)

View at TechPowerUp Main Site
 
Joined
Sep 17, 2014
Messages
20,781 (5.97/day)
Location
The Washing Machine
Processor i7 8700k 4.6Ghz @ 1.24V
Motherboard AsRock Fatal1ty K6 Z370
Cooling beQuiet! Dark Rock Pro 3
Memory 16GB Corsair Vengeance LPX 3200/C16
Video Card(s) ASRock RX7900XT Phantom Gaming
Storage Samsung 850 EVO 1TB + Samsung 830 256GB + Crucial BX100 250GB + Toshiba 1TB HDD
Display(s) Gigabyte G34QWC (3440x1440)
Case Fractal Design Define R5
Audio Device(s) Harman Kardon AVR137 + 2.1
Power Supply EVGA Supernova G2 750W
Mouse XTRFY M42
Keyboard Lenovo Thinkpad Trackpoint II
Software W10 x64
...decrees operation speed? Painful :D
 

bug

Joined
May 22, 2015
Messages
13,163 (4.07/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10
...decrees operation speed? Painful :D
It's not like "configuration to reducing transceiver" is any better.

That aside, what gains are we looking at here? When can we expect them?
 
Joined
Mar 21, 2016
Messages
2,195 (0.75/day)
So like over twice the bandwidth of conventional method with 1/3 the required signal wiring. Wonder if DRAM already does similar or will adopt this type of technique to be applied to it.
 
Joined
Aug 11, 2014
Messages
866 (0.25/day)
Processor ryzen 5 5600x
Motherboard AB350m Pro4
Cooling custom loop
Memory TEAMGROUP T-Force TXKD416G3600HC18ADC01 16gbs XMP
Video Card(s) HP GTX1650 super 4gb
Storage MZVLB256HBHQ-000H1 PM981a (256GB)/3TB HDD
Display(s) Nitro XF243Y Pbmiiprx
Case Rosewill CULLINAN
Audio Device(s) onboard
Power Supply Corsair 750w
Mouse Best Buy Insignia
Keyboard Best Buy Insignia
Software Win 10 pro
this seems very similar to what amd is doing with the controller they are putting on their new ryzens.
 
Joined
Dec 16, 2010
Messages
1,662 (0.34/day)
Location
State College, PA, US
System Name My Surround PC
Processor AMD Ryzen 9 7950X3D
Motherboard ASUS STRIX X670E-F
Cooling Swiftech MCP35X / EK Quantum CPU / Alphacool GPU / XSPC 480mm w/ Corsair Fans
Memory 96GB (2 x 48 GB) G.Skill DDR5-6000 CL30
Video Card(s) MSI NVIDIA GeForce RTX 4090 Suprim X 24GB
Storage WD SN850 2TB, 2 x 512GB Samsung PM981a, 4 x 4TB HGST NAS HDD for Windows Storage Spaces
Display(s) 2 x Viotek GFI27QXA 27" 4K 120Hz + LG UH850 4K 60Hz + HMD
Case NZXT Source 530
Audio Device(s) Sony MDR-7506 / Logitech Z-5500 5.1
Power Supply Corsair RM1000x 1 kW
Mouse Patriot Viper V560
Keyboard Corsair K100
VR HMD HP Reverb G2
Software Windows 11 Pro x64
Benchmark Scores Mellanox ConnectX-3 10 Gb/s Fiber Network Card
So like over twice the bandwidth of conventional method with 1/3 the required signal wiring. Wonder if DRAM already does similar or will adopt this type of technique to be applied to it.
That's basically FB-DIMM, and that failed due to high power consumption and latency. I'd be curious to see how much power this bridge chip consumes because I doubt it will be efficient.
 
Joined
Feb 18, 2005
Messages
5,239 (0.75/day)
Location
Ikenai borderline!
System Name Firelance.
Processor Threadripper 3960X
Motherboard ROG Strix TRX40-E Gaming
Cooling IceGem 360 + 6x Arctic Cooling P12
Memory 8x 16GB Patriot Viper DDR4-3200 CL16
Video Card(s) MSI GeForce RTX 4060 Ti Ventus 2X OC
Storage 2TB WD SN850X (boot), 4TB Crucial P3 (data)
Display(s) 3x AOC Q32E2N (32" 2560x1440 75Hz)
Case Enthoo Pro II Server Edition (Closed Panel) + 6 fans
Power Supply Fractal Design Ion+ 2 Platinum 760W
Mouse Logitech G602
Keyboard Logitech G613
Software Windows 10 Professional x64
That's basically FB-DIMM, and that failed due to high power consumption and latency. I'd be curious to see how much power this bridge chip consumes because I doubt it will be efficient.

If I'm correctly understanding what Toshiba is saying here, this bridge chip works for NAND controller channels in the same way that a PLX bridge works for PCIe lanes, i.e. is a multiplier. Which means that instead of having an 8- or 16-channel controller, you can get away with a 4- or even 2-channel one, with not much loss of performance. And since a controller with fewer channels is far less complex, it's therefore cheaper to manufacture and dissipates less heat, which means that (simpler controller + bridge chip) might have the same power budget as (complex controller).

However, I'm guessing that like PLX chips, this is intended for the higher-end of the market, e.g. allowing 8-channel controllers to address 32 channels of NAND for absurd parallelism and throughput. In such an environment, higher power consumption would be an acceptable tradeoff for massively increased performance - particularly if it allowed a proven 8-channel consumer NAND controller to be reused in an enterprise product.

Either way, finally some innovation in the SSD space.
 
Top