Intel Meteor Lake Technical Deep Dive

W1zzard · Sep 19, 2023

Today Intel is taking the wraps off their Meteor Lake Architecture. Our tech preview tells you everything you need to know about Intel's new ideas that will power the company's processors for years to come. Just like AMD, Intel is betting on chiplets, which combine multiple silicon dies into a single CPU to build faster, more-energy efficient designs that are cheaper to manufacture.

Show full review

Denver · Sep 19, 2023

Wouldn't Meteor Lake be focused on laptops? I don't understand the reason for comparing the iGPU with the desktop version which only has 256shaders (768shaders on the laptop). Bigger numbers to show investors, of course...

AnotherReader · Sep 19, 2023

I like the low power island which clocks the E cores more sensibly. I was expecting them to use smaller cores than their fairly large Crestmont cores for the low power island. Let's see when it starts shipping. The claims for power efficiency are lower than I expected: more than 20% compared to Raptor Lake mobile. Intel's presentation was rather light on microarchitectural details. The only hard numbers were cache sizes: I cache for the P core has doubled to 64 KB. Other caches stay at the same size as Raptor Lake. It's also interesting that they have opted to use TSMC's N5 rather than their own process for the GPU tile.

Squared · Sep 19, 2023

Maybe there was more buried in the slides but I feel like very little new information was presented here. I had heard that there was a 6P+8E core configuration plus 2 e-cores in the SoC that would only be used if the CPU tile was powered down. But now I know it's 4 cores on the SoC and they're available for use even when the CPU tile is active. Maybe this was in the slides but the article only said that the GPU tile used something from TSMC which is better than what Arc uses, which implies N5, N4, or N3 but still nothing more specific, which pretty much confirms what we already surmised. The article says the interposer has some logic, but what is that logic? Is it the rumored L4 cache? And what about the ISA, is AVX-512 supported?

AnotherReader · Sep 19, 2023

Squared said:
Maybe there was more buried in the slides but I feel like very little new information was presented here. I had heard that there was a 6P+8E core configuration plus 2 e-cores in the SoC that would only be used if the CPU tile was powered down. But now I know it's 4 cores on the SoC and they're available for use even when the CPU tile is active. Maybe this was in the slides but the article only said that the GPU tile used something from TSMC which is better than what Arc uses, which implies N5, N4, or N3 but still nothing more specific, which pretty much confirms what we already surmised. The article says the interposer has some logic, but what is that logic? Is it the rumored L4 cache? And what about the ISA, is AVX-512 supported?

They only claimed VNNI which suggests 256 bit AVX.

R0H1T · Sep 19, 2023

AVX 512 is practically dead for Intel, at least post TGL till now. Don't expect anything like that till AMD also moves to AVX 512 minus the double pumping.

AnotherReader · Sep 19, 2023

R0H1T said:
AVX 512 is practically dead for Intel, at least post TGL till now. Don't expect anything like that till AMD also moves to AVX 512 minus the double pumping.

Zen 4 has AVX-512 which means that for the foreseeable future, its successors will support it as well.

Squared · Sep 19, 2023

I'm not really sure what it is for, except I've heard some AI workloads can benefit from it. My reason for wondering is because Golden/Raptor Cove includes it, but Gracemont does not. So I'm wondering if Redwood Cove and Crestmont were actually designed with the same ISA instead of non-shared instructions being disabled.

From Anand Tech, it sounds like the GPU tile is made with TSMC N5 and the SoC with TSMC N6.

I wonder today if Intel's new reliance on TSMC has to do with node optimization. Intel's newest nodes get used first for CPUs, so they're frequency-optimized. But TSMC favors density-optimized nodes because their early adopters make smartphone and graphics processors. So even if Intel and TSMC were keeping pace with one another, it'd still make more sense to build the CPU tile at Intel and the GPU and SoC tiles at TSMC. (I believe both companies tend to make alternate versions of their nodes that are optimized differently, but those come later and might be a little more expensive.)

Anand Tech also said that the Crestmont cores in the SoC are optimized with a lower voltage-frequency curve, perhaps that's because the TSMC N6 process they're built with is density-optimized?

Source:

Intel Unveils Meteor Lake Architecture: Intel 4 Heralds the Disaggregated Future of Mobile CPUs

www.anandtech.com

AnotherReader · Sep 19, 2023

Squared said:
I'm not really sure what it is for, except I've heard some AI workloads can benefit from it. My reason for wondering is because Golden/Raptor Cove includes it, but Gracemont does not. So I'm wondering if Redwood Cove and Crestmont were actually designed with the same ISA instead of non-shared instructions being disabled.

From Anand Tech, it sounds like the GPU tile is made with TSMC N5 and the SoC with TSMC N6.

I wonder today if Intel's new reliance on TSMC has to do with node optimization. Intel's newest nodes get used first for CPUs, so they're frequency-optimized. But TSMC favors density-optimized nodes because their early adopters make smartphone and graphics processors. So even if Intel and TSMC were keeping pace with one another, it'd still make more sense to build the CPU tile at Intel and the GPU and SoC tiles at TSMC. (I believe both companies tend to make alternate versions of their nodes that are optimized differently, but those come later and might be a little more expensive.)

Anand Tech also said that the Crestmont cores in the SoC are optimized with a lower voltage-frequency curve, perhaps that's because the TSMC N6 process they're built with is density-optimized?

Source:

Intel Unveils Meteor Lake Architecture: Intel 4 Heralds the Disaggregated Future of Mobile CPUs

www.anandtech.com

I suspect you're right about N5 being denser than Intel 4 for GPUs which don't need them to clock very high. As far as VNNI is concerned, it's useful for algorithms used for AI. I'm quoting the relevant part from the link in the previous sentence

Platforms not using VNNI require the vpmaddubsw, vpmaddwd and vpaddd instructions to complete the multiply-accumulate operations in INT8 convolution operation:

Platforms using VNNI require only one instruction, “vpdpbusd”, to complete the INT8 convolution operation:

DavidC1 · Sep 19, 2023

Denver said:
Wouldn't Meteor Lake be focused on laptops? I don't understand the reason for comparing the iGPU with the desktop version which only has 256shaders (768shaders on the laptop). Bigger numbers to show investors, of course...

View attachment 314302

Not sure where you are getting that it's desktop Raptorlake. It's a 1:1 comparison against mobile Raptorlake. Or did you forget Raptorlake is also in mobile?

AnotherReader said:
I like the low power island which clocks the E cores more sensibly. I was expecting them to use smaller cores than their fairly large Crestmont cores for the low power island. Let's see when it starts shipping. The claims for power efficiency are lower than I expected: more than 20% compared to Raptor Lake mobile. Intel's presentation was rather light on microarchitectural details. The only hard numbers were cache sizes: I cache for the P core has doubled to 64 KB.

That's is just for the process, not Meteorlake. Also, the reason they were light on uarch details is that neither P nor the E cores advance that much. It's basically:

P: Doubles L1i cache to 64KB
E: rename/allocate goes from 5 to 6

Squared said:
Anand Tech also said that the Crestmont cores in the SoC are optimized with a lower voltage-frequency curve, perhaps that's because the TSMC N6 process they're built with is density-optimized?

We always knew that. Intel 4 for Compute, TSMC N5 for GPU and N6 for IO and SoC.

The LP E cores in the SoC are just optimized for lower power, nothing to do with the process.

Hyderz · Sep 19, 2023

Cool! Can’t wait to see what meteor lake can do, as always I will be getting the second generation, more optimization etc

DavidC1 · Sep 19, 2023

It looks like Server and PC cores are diverging further.

Redwood Cove server in Granite Rapids: Enhanced branch prediction, double L1i
Sierra Glen(Crestmont counterpart) in Sierra Forest: Rename/Allocate is still at 5, same as Gracemont.

Redwood Cove client in Meteorlake: Double L1i
Crestmont in Meteorlake: Rename/Allocate is at 6, Enhanced branch prediction

Server E core is optimizing for higher frequency hence the slightly narrower uarch.

Denver · Sep 19, 2023

DavidC1 said:
Not sure where you are getting that it's desktop Raptorlake. It's a 1:1 comparison against mobile Raptorlake. Or did you forget Raptorlake is also in mobile?

That's is just for the process, not Meteorlake. Also, the reason they were light on uarch details is that neither P nor the E cores advance that much. It's basically:

P: Doubles L1i cache to 64KB
E: rename/allocate goes from 5 to 6

We always knew that. Intel 4 for Compute, TSMC N5 for GPU and N6 for IO and SoC.

The LP E cores in the SoC are just optimized for lower power, nothing to do with the process.

You're right, I didn't even notice that they refreshed the Alder-Lake mobile and called it RaptorLake.

DavidC1 · Sep 19, 2023

Squared said:
Maybe there was more buried in the slides but I feel like very little new information was presented here. I had heard that there was a 6P+8E core configuration plus 2 e-cores in the SoC that would only be used if the CPU tile was powered down. But now I know it's 4 cores on the SoC and they're available for use even when the CPU tile is active.

It's 2x LP E cores not four. https://www.techpowerup.com/review/intel-meteor-lake-technical-deep-dive/3.html

On the slide "Meteorlake Low Power Island".

With Crestmont you could have 2 core clusters rather than 4 only as with Alder and Raptor.

@AnotherReader Also specifically for Intel 4 it only support HP libraries and doesn't have enough to support a full IO block. Don't know why they would continue to heavily rely on TSMC for future though.

AnotherReader · Sep 19, 2023

DavidC1 said:
It's 2x LP E cores not four. https://www.techpowerup.com/review/intel-meteor-lake-technical-deep-dive/3.html

On the slide "Meteorlake Low Power Island".

With Crestmont you could have 2 core clusters rather than 4 only as with Alder and Raptor.

@AnotherReader Also specifically for Intel 4 it only support HP libraries and doesn't have enough to support a full IO block. Don't know why they would continue to heavily rely on TSMC for future though.

Yes, Intel 3 should be the full featured version of Intel 4.

Zareek · Sep 19, 2023

I bet we'll see this in a lot of NUC like devices. That new iGPU might finally give AMD some competition on that front.

unwind-protect · Sep 19, 2023

Is there actually any machine learning software that uses Intel neural accelerator?

AusWolf · Sep 19, 2023

So if I get it right, the purpose of moving some e-cores onto the SoC tile is to save on power consumption by disabling the tile interconnects - which is basically what consumes a lot of power on Ryzen when idle. Interesting.

Squared · Sep 19, 2023

DavidC1 said:
It's 2x LP E cores not four. https://www.techpowerup.com/review/intel-meteor-lake-technical-deep-dive/3.html

On the slide "Meteorlake Low Power Island".

With Crestmont you could have 2 core clusters rather than 4 only as with Alder and Raptor.

Ah you're right. The article says, "Based on the same 'Crestmont' core architecture as the E-cores on the Compute tile although not being part of its ringbus or sharing its L3 cache; this E-core cluster has its own L2 cache shared among four cores." But I can see the slide you referred to from Intel which says 2 cores.

Zareek said:
I bet we'll see this in a lot of NUC like devices. That new iGPU might finally give AMD some competition on that front.

Tiger Lake's iGPU was actually a bit faster than AMD's Vega iGPU at the time. But Alder Lake and Raptor Lake use the exact same iGPU as Tiger Lake so yeah this will be the first time in a while it's improved and with a 2x improvement it should rival the current RDNA3 iGPU. Intel's lower trims are also less cut down than AMD's, so certain i5 and i3 models I think still compete very favorably against AMD.

THU31 · Sep 19, 2023

A very mobile-focused architecture, but it's pretty cool that we're finally getting rid of the chipset (even though technically in mobile CPUs the PCH was on package).

Will they do that for desktops too, or will there still be a PCH on the motherboard?

evernessince · Sep 19, 2023

AusWolf said:
So if I get it right, the purpose of moving some e-cores onto the SoC tile is to save on power consumption by disabling the tile interconnects - which is basically what consumes a lot of power on Ryzen when idle. Interesting.

From what I read it's only the compute titles:

it allows Intel to power down the Compute tile when not needed

In essence the SoC tile acts as a smaller low end chip within a chip that only taps the compute / IO tiles as needed. Should reduce power consumption for idle or very light tasks.

shoskunk · Sep 19, 2023

Intel calls a "disaggregated chiplet-based processor" a SoP or System-on-Package chip architecture.

This article fails to mention, even once, the core IDM 2.0 components as Intel defined them

Dis-aggregated nonsense writing... Better check with DerBuyer.. lulz.

DavidC1 · Sep 20, 2023

THU31 said:
A very mobile-focused architecture, but it's pretty cool that we're finally getting rid of the chipset (even though technically in mobile CPUs the PCH was on package).

Will they do that for desktops too, or will there still be a PCH on the motherboard?

AMD had an on-die chipset since the Bulldozer-derivative chip right before Zen. The I/O features are cut down so you could call them PCH-lite. The desktops come with additional chipset for more IO.

Intel could conceivably do the same thing.

Intel says 4-6% improvement for Crestmont E cores. Likely less for the P.

HisDivineOrder · Sep 20, 2023

Disappointing that they neutered the AV1 encoder to 4:2:0. I was expecting them to make AV1 ubiquitous but they left room for improvement for future generations in classic Apple style.

Stephen. · Sep 20, 2023

Very nice article, I enjoyed reading this.

Processor	Ryzen 7 5700X
Memory	48 GB
Video Card(s)	RTX 4080
Storage	2x HDD RAID 1, 3x M.2 NVMe
Display(s)	30" 2560x1600 + 19" 1280x1024
Software	Windows 10 64-bit

Processor	Ryzen 7 5700X
Motherboard	ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling	Noctua NH-C14S (two fans)
Memory	2x16GB DDR4 3200
Video Card(s)	Reference Vega 64
Storage	Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s)	Nixeus NX-EDG27, and Samsung S23A700
Case	Fractal Design R5
Power Supply	Seasonic PRIME TITANIUM 850W
Mouse	Logitech
VR HMD	Oculus Rift
Software	Windows 11 Pro, and Ubuntu 20.04

Processor	Ryzen 7 5700X
Motherboard	ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling	Noctua NH-C14S (two fans)
Memory	2x16GB DDR4 3200
Video Card(s)	Reference Vega 64
Storage	Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s)	Nixeus NX-EDG27, and Samsung S23A700
Case	Fractal Design R5
Power Supply	Seasonic PRIME TITANIUM 850W
Mouse	Logitech
VR HMD	Oculus Rift
Software	Windows 11 Pro, and Ubuntu 20.04

Processor	Ryzen 7 5700X
Motherboard	ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling	Noctua NH-C14S (two fans)
Memory	2x16GB DDR4 3200
Video Card(s)	Reference Vega 64
Storage	Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s)	Nixeus NX-EDG27, and Samsung S23A700
Case	Fractal Design R5
Power Supply	Seasonic PRIME TITANIUM 850W
Mouse	Logitech
VR HMD	Oculus Rift
Software	Windows 11 Pro, and Ubuntu 20.04

Processor	Ryzen 7 5700X
Motherboard	ASUS TUF Gaming X570-PRO (WiFi 6)
Cooling	Noctua NH-C14S (two fans)
Memory	2x16GB DDR4 3200
Video Card(s)	Reference Vega 64
Storage	Intel 665p 1TB, WD Black SN850X 2TB, Crucial MX300 1TB SATA, Samsung 830 256 GB SATA
Display(s)	Nixeus NX-EDG27, and Samsung S23A700
Case	Fractal Design R5
Power Supply	Seasonic PRIME TITANIUM 850W
Mouse	Logitech
VR HMD	Oculus Rift
Software	Windows 11 Pro, and Ubuntu 20.04

System Name	Custom
Processor	i9 9900k
Motherboard	Gigabyte Z390 arous master
Cooling	corsair h150i
Memory	4x8 3200mhz corsair
Video Card(s)	Galax RTX 3090 EX Gamer White OC
Storage	500gb Samsung 970 Evo PLus
Display(s)	MSi MAG341CQ
Case	Lian Li Pc-011 Dynamic
Audio Device(s)	Arctis Pro Wireless
Power Supply	850w Seasonic Focus Platinum
Mouse	Logitech G403
Keyboard	Logitech G110

System Name	Gaming Rig
Processor	Ryzen 7 3800X
Motherboard	Gigabyte X570 Aurus Pro Wifi
Cooling	Noctua NH-D15 chromax.black
Memory	32GB(2x16GB) Patriot Viper DDR4-3200C16
Video Card(s)	EVGA RTX 3060 Ti
Storage	Samsung 970 EVO Plus 1TB (Boot/OS)\|Hynix Platinum P41 2TB (Games)
Display(s)	Gigabyte G27F
Case	Corsair Graphite 600T w/mesh side
Audio Device(s)	Logitech Z625 2.1 \| cheapo gaming headset when mic is needed
Power Supply	Corsair HX850i
Mouse	Redragon M808-KS Storm Pro (Great Value)
Keyboard	Redragon K512 Shiva replaced a Corsair K70 Lux - Blue on Black
VR HMD	Nope
Software	Windows 11 Pro x64
Benchmark Scores	Nope

System Name	My second and third PCs are Intel + Nvidia
Processor	AMD Ryzen 7 7800X3D @ 45 W TDP Eco Mode
Motherboard	MSi Pro B650M-A Wifi
Cooling	be quiet! Shadow Rock LP
Memory	2x 24 GB Corsair Vengeance DDR5-4800
Video Card(s)	PowerColor Reaper Radeon RX 9070 XT
Storage	2 TB Corsair MP600 GS, 4 TB Seagate Barracuda
Display(s)	Dell S3422DWG 34" 1440 UW 144 Hz
Case	Corsair Crystal 280X
Audio Device(s)	Logitech Z333 2.1 speakers, AKG Y50 headphones
Power Supply	750 W Seasonic Prime GX
Mouse	Logitech MX Master 2S
Keyboard	Logitech G413 SE
Software	Bazzite (Fedora Linux) KDE Plasma

System Name	THU
Processor	Intel Core i5-13600KF
Motherboard	ASUS PRIME Z790-P D4
Cooling	SilentiumPC Fortis 3 v2 + Arctic Cooling MX-2
Memory	Crucial Ballistix 2x16 GB DDR4-3600 CL16 (dual rank)
Video Card(s)	MSI GeForce RTX 4070 Ventus 3X OC 12 GB GDDR6X (2610/21000 @ 0.91 V)
Storage	Lexar NM790 2 TB + Corsair MP510 960 GB + PNY XLR8 CS3030 500 GB + Toshiba E300 3 TB
Display(s)	LG OLED C8 55" + ASUS VP229Q
Case	Fractal Design Define R6
Audio Device(s)	Yamaha RX-V4A + Monitor Audio Bronze 6 + Bronze FX \| FiiO E10K-TC + Koss Porta Pro
Power Supply	Corsair RM650
Mouse	Logitech M705 Marathon
Keyboard	Corsair K55 RGB PRO
Software	Windows 10 Home
Benchmark Scores	Benchmarks in 2025?

Processor	Ryzen 7800X3D
Motherboard	ASRock X670E Taichi
Cooling	Noctua NH-D15 Chromax
Memory	32GB DDR5 6000 CL30
Video Card(s)	MSI RTX 4090 Trio
Storage	P5800X 1.6TB 4x 15.36TB Micron 9300 Pro 4x WD Black 8TB M.2
Display(s)	Acer Predator XB3 27" 240 Hz
Case	Thermaltake Core X9
Audio Device(s)	JDS Element IV, DCA Aeon II
Power Supply	Seasonic Prime Titanium 850w
Mouse	PMM P-305
Keyboard	Wooting HE60
VR HMD	Valve Index
Software	Win 10

Processor	Intel Core Ultra 7-265K
Motherboard	ASUS ROG STRIX Z890-A
Cooling	ID-Cooling FROZN A620 Pro SE
Memory	Crucial Pro 96GB Kit (48GBx2) DDR5-5600
Video Card(s)	Intel ARC A770 Limited Edition
Storage	Solidigm P44 Pro 2TB x 2 / Sk Hynix Platinum P41 2TB
Display(s)	Philips 32M1N5800A
Case	Lian Li O11 Air Mini (White)
Power Supply	Seasonic Prime Fanless Titanium 600W
Keyboard	Dell KM714 Wireless
Software	Windows 11 Pro x64

Intel Meteor Lake Technical Deep Dive

Administrator