• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.
  • The forums have been upgraded with support for dark mode. By default it will follow the setting on your system/browser. You may override it by scrolling to the end of the page and clicking the gears icon.

Intel 288 E-core Xeon "Sierra Forest" Out to Eat AMD EPYC Bergamo's Lunch

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
47,854 (7.38/day)
Location
Dublin, Ireland
System Name RBMK-1000
Processor AMD Ryzen 7 5700G
Motherboard Gigabyte B550 AORUS Elite V2
Cooling DeepCool Gammax L240 V2
Memory 2x 16GB DDR4-3200
Video Card(s) Galax RTX 4070 Ti EX
Storage Samsung 990 1TB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Gamdias Hermes E2
Software Windows 11 Pro
Intel at the 2023 InnovatiON event unveiled a 288-core extreme core-count variant of the Xeon "Sierra Forest" processor for high-density servers for scale-out, cloud-native environments. It succeeds the current 144-core model. "Sierra Forest" is a server processor based entirely on efficiency cores, or E-cores, based on the "Sierra Glen" core microarchitecture, a server-grade derivative of "Crestmont," Intel's second-generation E-core that's making a client debut with "Meteor Lake."

Xeon "Sierra Forest" is a chiplet-based processor, much like "Meteor Lake" and the upcoming "Emerald Rapids" server processor. It features a total of five tiles—two Compute tiles, two I/O tiles, and a base tile (interposer). Each of the two Compute tiles is built on the Intel 3 foundry node, a more advanced node than Intel 4, featuring higher-density libraries, and an undisclosed performance/Watt increase. Each tile has 36 "Sierra Glen" E-core clusters, 108 MB of shared L3 cache, 6-channel (12 sub-channel) DDR5 memory controllers, and Foveros tile-to-tile interfaces.



Each "Sierra Glen" E-core cluster features four CPU cores that share a 4 MB local L2 cache, and a 3 MB segment contributing to the tile's 108 MB L3 cache. Unlike the "Meteor Lake" Compute tile that uses a ringbus to connect its E-core clusters and P-cores, the Compute tile uses a Mesh topology interconnect for the large array of 36 E-core clusters. With 144 cores per tile, in its maximum configuration with three such tiles, "Sierra Forest" achieves 288 cores. "Sierra Glen" lacks SMT, just like "Crestmont," and so the OS only has 288 logical processors to address.


Besides the two Compute tiles, the processor has two I/O tiles. Unlike the similarly named "I/O tile" of the client "Meteor Lake" processor, the ones on "Sierra Forest" serve the functions of both the SoC and I/O PHY. With the memory controllers located on the Compute tiles, in its maximum 288-core variant, "Sierra Forest" features a 12-channel DDR5 memory interface.


The I/O tile is left with the UPI interconnect for 2P servers; application-specific accelerators, a 68-lane PCI-Express Gen 5 root complex that's flexible between PCIe Gen 5 and CXL 2.0, and the I/O Fabric. Despite being based on an advanced node like Intel 3, each of the two Compute tiles is an enormous 578 mm² in die-area, while each of the two I/O tiles is 241 mm².

The up to 12-channel memory interface of "Sierra Forest" comes with native support for ECC DDR5-6400 speed. The accelerators are carried over from the current "Granite Rapids" processor, and provide speed ups for popular cryptography, file-streaming, and and data-compression operations.

When it arrives in the first half of 2024, Xeon "Sierra Forest" will square off against AMD's EPYC "Bergamo" processor. "Bergamo" is based on a slightly different philosophy than "Sierra Forest." It is a 128-core/256-thread processor based on "Zen 4c" cores that don't quite qualify as E-cores, and have an identical IPC to regular "Zen 4" cores, an identical ISA, and SMT.

View at TechPowerUp Main Site | Source
 
288 E-cores yes - not 288 real P cores or half baken cores. AMD still holds crown with the amount of real cores at this point. And AMD has a 192 core 384 thread behemoth upcoming too.
 
Each tile has 36 "Sierra Glen" E-core clusters, 108 MB of shared L3 cache, 6-channel (12 sub-channel) DDR5 memory controllers, and Foveros tile-to-tile interfaces.

The diagram says that DDR controllers is located in the I/O tile


Interesting solution. It might be better at inter-core communication and have better timings overall due to EMIB, but it looks like it will have lower performance than Bergamo. And huge tiles, EMIB, it won't be cheap...
 
And huge tiles, EMIB, it won't be cheap...
That's very probably true. We know quite a few technical details about these advanced packaging technologies, at least at Intel and TSMC. Unfortunately, we get to learn almost nothing about costs, yields, capacity constraints, logistics issues, suppliers other than Intel and TSMC, and more.
 
This is good. AMD and Intel getting at each others necks. We need the GPU side to be this fierce in competition. Not saying that AMD cards are weak by any means, but...
 
Due to lack of SMT, Sierra Forest will need a huge IPC and/or clock increase to compete with Bergamo.

Sierra Forest - 288 threads, little E-cores
Bergamo - 256 threads, BIG Zen 4c cores
 
What is a real core? Please elaborate

And that answers the question how?

:laugh:

A real core is a core which is a non-Atom type, with relatively high IPC.
Those e-core are quite weak, hence the term - not real.
Normally, intel uses these not real cores for background tasks and processes which do not require super high IPC/performance.
 
288 E-cores yes - not 288 real P cores or half baken cores. AMD still holds crown with the amount of real cores at this point. And AMD has a 192 core 384 thread behemoth upcoming too.

tas8zGAlwPGavytU.jpg

Only 144 Cores/socket maximum not 288 Cores/socket.
 
:laugh:

A real core is a core which is a non-Atom type, with relatively high IPC.
Those e-core are quite weak, hence the term - not real.
Normally, intel uses these not real cores for background tasks and processes which do not require super high IPC/performance.
That's your definition of a core?? A non atom type? Okay buddy great definition. I say a real core is a non ryzen type.
 
It'll be interesting to see how the Sierra forest core competes against a Zen5c core (and the rumored 20% IPC inprovement plus whatever improvements come from being on a 3nm node, Zen5 AmD has stated, is like a whole new architecture unlike Zen4). I guess the Bergamo successor will be 192 core/384 threads. I wonder if AMD will ever have a 4 socket platform for Zen dense....it'd seem like a good application to develop one....4x Zen5c Turin 192 core chips making a 768 core/1536 thread system on a single board....imagine that'd be a huge board though with the socket size, haha
 
Last edited:
View attachment 314410
Only 144 Cores/socket maximum not 288 Cores/socket.
Dang, seems like everyone is running with the 288 cores on the chip. Even looking at a picture of the Intel presentation, the Sierra Forrest Demo says "2 processors with 288 cores". The confusion isn't even caused by an asterisk or anything. It's right there in your face.

So really, the one up here I think, iirc Bergamo is only a single socket design.
 
288 E-cores yes - not 288 real P cores or half baken cores. AMD still holds crown with the amount of real cores at this point. And AMD has a 192 core 384 thread behemoth upcoming too.
Even ARM gains more and more server marketshare.

E-cores might be more suitable for servers than fewer P-cores

Depends on workload really.

E-cores are real cores anyway. Just slower but smaller. Depending on tasks E-cores can make more sense. 4 E-cores uses same die space as 1 P-core.
 
E-cores are real cores anyway. Just slower but smaller. Depending on tasks E-cores can make more sense. 4 E-cores uses same die space as 1 P-core.

This means that the e-cores lack important components - like cache, some other units inside them which make them much weaker and slower than the real cores in Ryzen or p-cores in Core-i9.
 
This means that the e-cores lack important components - like cache, some other units inside them which make them much weaker and slower than the real cores in Ryzen or p-cores in Core-i9.
E-cores have cache and which components do they lack?

Even ARM is gaining more and more marketshare in the enterprise markets. Many enterprise workloads prefers alot of cores, speed is not crucial

E-cores and ARM does not have to be much slower, depends on software
 
E-cores have cache and which components do they lack?

Even ARM is gaining more and more marketshare in the enterprise markets. Many enterprise workloads prefers alot of cores, speed is not crucial

E-cores and ARM does not have to be much slower, depends on software
E cores are slower than Zen 4 or Intel's big cores, but they are still fairly fast: approaching Skylake in IPC. For high core count servers, this performance level is adequate. I would have concerns around 12 channels being insufficient for 288 cores, but we'll see. For integer code, the E cores can be more efficient than the P cores if clocked sufficiently low. However, Zen 4 is very power efficient and Intel, despite the new node, will have a difficult time matching its power efficiency. Even Zen 2 beats E cores at performance per watt.

1695221351305.png
 
Zen5 AmD has stated, is like a whole new architecture
I also believe in this. Before years. After reading more i believe only main change is in front end. You know, decoder, encoder, L1 I cache, branch predictor...And few other things.
 
E-cores have cache and which components do they lack?

The components which enable hyper-threading at least. Also, I think I said less cache, not without any cache.

I am not going to dig into the details of the corresponding architectures... though...
 
I also believe in this. Before years. After reading more i believe only main change is in front end. You know, decoder, encoder, L1 I cache, branch predictor...And few other things.
You might be thinking of Zen 4 which is a refined Zen 3. There isn't enough public information about Zen 5 to determine how different it is from Zen 4.
 
Well, they will be eating something, that's for sure! Sure as a load of manure!
 
Dang, seems like everyone is running with the 288 cores on the chip. Even looking at a picture of the Intel presentation, the Sierra Forrest Demo says "2 processors with 288 cores". The confusion isn't even caused by an asterisk or anything. It's right there in your face.

So really, the one up here I think, iirc Bergamo is only a single socket design.

Siena SP5 is 1p only Zen4c cores, Bergamo works in all SP5 boards that take Genoa.
Good catch on the Tom foolery

Sierra is still only 144c/144t per socket.

edit: up to 144c/die, up to 2 dies per socket.
 
Last edited:
Back
Top