Monday, April 29th 2024

Intel Statement on Stability Issues: "Motherboard Makers to Blame"

A couple of weeks ago, we reported on NVIDIA directing users of Intel's 13th Generation Raptor Lake and 14th Generation Raptor Lake Refresh CPUs to consult Intel for any issues with system stability. Motherboard makers, by default, often run the CPU outside of Intel's recommended specifications, overvolting the CPU through modifying voltage curves, automatic overclocks, and removing power limits.

Today, we learned that Igor's Lab has obtained a statement from Intel that the company prepared for motherboard OEMs regarding the issues multiple users report. Intel CPUs come pre-programmed with a stock voltage curve. When motherboard makers remove power limits and automatically adjust voltage curves and frequency targets, the CPU can be pushed outside its safe operating range, possibly causing system instability. Intel has set up a dedicated website for users to report their issues and offer support. Manufacturers like GIGABYTE have already issued new BIOS updates for users to achieve maximum stability, which incidentally has recent user reports of still being outside Intel spec, setting PL2 to 188 W, loadlines to 1.7/1.7 and current limit to 249 A. While MSI provided a blog post tutorial for stability. ASUS has published updated BIOS for its motherboards to reflect on this Intel baseline spec as well. Surprisingly, not all the revised BIOS values match up with the Intel Baseline Profile spec for these various new BIOS updates from different vendors. You can read the statement from Intel in the quote below.
Intel has observed that this issue may be related to out of specification operating conditions resulting in sustained high voltage and frequency during periods of elevated heat.

Analysis of affected processors shows some parts experience shifts in minimum operating voltages which may be related to operation outside of Intel specified operating conditions.

While the root cause has not yet been identified, Intel has observed the majority of reports of this issue are from users with unlocked/overclock capable motherboards.

Intel has observed 600/700 Series chipset boards often set BIOS defaults to disable thermal and power delivery safeguards designed to limit processor exposure to sustained periods of high voltage and frequency, for example:
  • Disabling Current Excursion Protection (CEP)
  • Enabling the IccMax Unlimited bit
  • Disabling Thermal Velocity Boost (TVB) and/or Enhanced Thermal Velocity Boost (eTVB)
  • Additional settings which may increase the risk of system instability:
  • Disabling C-states
  • Using Windows Ultimate Performance mode
  • Increasing PL1 and PL2 beyond Intel recommended limits
Intel requests system and motherboard manufacturers to provide end users with a default BIOS profile that matches Intel recommended settings.

Intel strongly recommends customer's default BIOS settings should ensure operation within Intel's recommended settings.

In addition, Intel strongly recommends motherboard manufacturers to implement warnings for end users alerting them to any unlocked or overclocking feature usage.

Intel is continuing to actively investigate this issue to determine the root cause and will provide additional updates as relevant information becomes available.

Intel will be publishing a public statement regarding issue status and Intel recommended BIOS setting recommendations targeted for May 2024.
Source: Igor's Lab
Add your own comment

272 Comments on Intel Statement on Stability Issues: "Motherboard Makers to Blame"

#176
RGAFL
I spy with my little eye, something beginning with Intel Shill. How the hell is bringing a AMD Agesa problem that was fixed fairly rapidly into a discussion about Intel power limits that has been going on for a few years now justified. The only way Intel could keep up with AMD is with jacking the power up to 11. How that can be defended and justified is beyond me. This is also from a staff member going by the profile. I hope you don't do any AMD reviews.
Posted on Reply
#177
Tek-Check
Solaris17You think they would give us the user what they do on ark and NOT give the oem the values when they give they idk tell them how to use the CPUs in a new chipset? Thats a new level of denial. AMD board partners do the same shit. Did AMD not give them the specs when ASUS was burning holes in the core?
Get real.
Nice try, but the article and topic are about Intel. Focus.
Why did everyone suddently scramble to introduce BIOS update with "Intel Baseline Profile"? Isn't that something that should be there by default when buyers purchase motherboards?
Posted on Reply
#178
Solaris17
Super Dainty Moderator
RGAFLI spy with my little eye, something beginning with Intel Shill. How the hell is bringing a AMD Agesa problem that was fixed fairly rapidly into a discussion about Intel power limits that has been going on for a few years now justified. The only way Intel could keep up with AMD is with jacking the power up to 11. How that can be defended and justified is beyond me. This is also from a staff member going by the profile. I hope you don't do any AMD reviews.
I own AMD systems these two companies have both been doing this for years. “It was an agesa problem” like it’s a bios problem here? You can’t have your cake and eat it. Either both companies really love money and look the other way or by your logic both companies are so negligent they didn’t pass on the voltage values which is it?

board partners are doing this because it sells boards. How is MSI going to sell you an $1100 meg if it performs the same as a z760? Do you really want nehamic audio? No.

if you want to get on your “intel bad” conspiracy soap box for mobo makers being negligent wait until you stumble across these voltage increases over time steadily selling more coolers.

you people preach intel is big bad but the SAME mobo manufacturers that are bumping up the voltages were welding IHSs to AMD coolers months ago.

make up your mind.
Posted on Reply
#179
RGAFL
Solaris17You think they would give us the user what they do on ark and NOT give the oem the values when they give they idk tell them how to use the CPUs in a new chipset? Thats a new level of denial. AMD board partners do the same shit. Did AMD not give them the specs when ASUS was burning holes in the core?

Get real.
The common theme with that issue was ASUS. There was reports of a few ASRock and Gigabyte users it happened to but one of the users of said boards said they copied the info from a website giving information about the ASUS settings into a Gigabyte board and BOOM. The Intel problem is widespread across motherboard vendors. Wonder where they got the information from or who allowed them to do that. Do Intel not validate their vendor BIOSES. This is a thing Intel have been doing for a few generations of CPU now. Ramping up the power to get near to AMD. If you want to defend that be my guest. What I would take is AM4 which is pretty much rock solid now or AM5 which again after a few early issues and one particular motherboard vendor taking a few liberties is now pretty much there.
Posted on Reply
#180
mkppo
Funny how the people defending intel conveniently ignore the comments intel made publicly about the fact that maxed out power limits and all the shenanigans the board partners were doing are "within spec" only to backtrack now and blame the board partners.

Atleast AMD didn't stoop that low. Intel reeks of desperation now and I guess the defensive attitude stems from that. How far they've fallen..
Posted on Reply
#181
Tek-Check
Solaris17you people preach intel is big bad but the SAME mobo manufacturers that are bumping up the voltages were welding IHSs to AMD coolers months ago.
Focus on Intel. This is the thread and article about their alleged wrong-doings. Intel is supposed to validate BIOS-es from all vendors so that their CPUs work correctly. If Intel had not been satisfied with high benchmarks, they would have never allowed vendors to run their CPUs with extreme custom profiles. But, they overcooked it... and now they blame mobo vendors for what they allowed them to do. You know this, we know this, Intel know this, vendors know it. Now it's about face-saving and it's ugly so far.
Posted on Reply
#182
Dr. Dro
Vya DomusAMD also has the benefit of typically not lying
I seem to recall that "Zen 3 on X370 was impossible due to BIOS ROM size limitations" and that the official solution was literally "upgrade your garbage first-generation motherboard", yet they're running the Zen 3 X3D chips just fine nowadays... all it took was Alder Lake? :confused:
ZubasaOh it goes even further back then that. The board venders have been progressively ignoring every limit one by one and Intel turned a blind eye every time until now.
gamersnexus.net/guides/3389-intel-tdp-investigation-9900k-violating-turbo-duration-z390
First it was the Turbo Duration, then the PL and Voltage, and most recently the current extrusion protection on 14th gen.
GIGABYTE Releases CEP Disable Option in BIOS Updates to its Intel Z790 and B760 Motherboards | TechPowerUp
Apperently it comes from a microcode released months ago, and Intel is the only one that can release microcodes for their CPUs.
The list goes on, as long as CB R23 score goes up Intel did not care. The CPU just needed to live long enough for the reviews.

MSI: Nope
videocardz.com/newz/msi-z790-max-bios-feature-increases-intel-cpu-throttling-temperature-to-115c
Users have to go out of their way to set the "stock" values on DIY boards.
www.techpowerup.com/review/intel-core-14th-gen-unboxing-preview/2.html
The Asus board used was kindy provided by Intel.
Intel Core i9-14900K Review - Reaching for the Performance Crown - Test Setup | TechPowerUp

Just for the record, W1zzard deliberately disables the thermal and current protection for these tests. There is no circumstance under which a Raptor Lake processor will casually hit 115°C without the user disabling the thermal protection out of their own volition.
Posted on Reply
#183
Vya Domus
Dr. Droyet they're running the Zen 3 X3D chips just fine nowadays
Yeah but I am pretty sure those BIOS revisions removed support for some older CPUs, that's how they got around the limitation. I don't see how they were lying, it was true that it was not possible to add support for those CPUs, at least not without cutting stuff out.
Posted on Reply
#184
Dr. Dro
Vya DomusYeah but I am pretty sure those BIOS revisions removed support for some older CPUs, that's how they got around the limitation. I don't see how they were lying, it was true that it was not possible to add support for those CPUs, at least not without cutting stuff out.
Ah yes, everyone demanding support for Zen 3 was just dying to retain the ability to use those masterpieces of superior AMD engineering, those incredibly high performance, cherished Excavator chips...
Posted on Reply
#185
RGAFL
Vya DomusYeah but I am pretty sure those BIOS revisions removed support for some older CPUs, that's how they got around the limitation. I don't see how they were lying, it was true that it was not possible to add support for those CPUs, at least not without cutting stuff out.
Yes, CPU's stripped out were, believe it or not, Bulldozer/Excavator/Phenom CPU's that were released to OEM's for AM4 and some of the very early Ryzen 1 CPU's. Also some features were removed and the BIOS interface streamlined to an extent.

Anyway, enough of the digression, why do all the Intel owners want to change the subject and dodge the subject at hand.
Posted on Reply
#186
Solid State Brain
mkppoFunny how the people defending intel conveniently ignore the comments intel made publicly about the fact that maxed out power limits and all the shenanigans the board partners were doing are "within spec" only to backtrack now and blame the board partners.

Atleast AMD didn't stoop that low. Intel reeks of desperation now and I guess the defensive attitude stems from that. How far they've fallen..
Increased power limits are within specs only if thermal and electrical specs are maintained.

Anyway, motherboards having silly default settings isn't a new thing. A few years ago I had an Aorus B560 motherboard mated to an i9-11900 that wouldn't be stable under semi-default settings (with just power limits increased over the 65W default. Not much else to tweak on a B-series motherboard and non-K CPU), causing errors or bluescreens during intensive benchmarks. At first I thought it was the CPU, but eventually I found it was just a matter of voltage sagging too much under load. It turned out that the motherboard had rather weak default loadline and LLC settings, that while on one hand mitigated thermal throttling (to some extent), on another caused stability issues under load.
Posted on Reply
#187
RGAFL
Intel must be hating Epic at the moment. Unreal Engine 5 really has started to open a can of worms.
Posted on Reply
#188
Vya Domus
Dr. DroAh yes, everyone demanding support for Zen 3 was just dying to retain the ability to use those masterpieces of superior AMD engineering, those incredibly high performance, cherished Excavator chips...
Not but you understand why it wasn't straight forward to add support for those CPUs. And this was a problem only on the boards that had 32MB or whatever amount was problematic, which wasn't all of them, so it wasn't like this was some mastermind planned obsolesces conspiracy to get you to buy newer boards, I doubt AMD even considered this would become a problem.
Posted on Reply
#189
Tek-Check
Every Arrow Z890 motherboard will have to run Intel Baseline profile as out-of-the-box experience. Any enhancements will need to be enabled voluntarily and this should be clearly said to buyers. There is nothing wrong with running enhanced profiles, but this needs to be an add-on feature to be explicitly enabled by users, if they choose so.

As PC World debate on youtube says: "Mummy and daddy need to come in to clean this mess once and for all".
Posted on Reply
#190
Darmok N Jalad
Once the door was opened to virtually unlimited power settings and CPUs and GPUs that basically overclocked themselves, we’d eventually end up with issues like this. Motherboard makers were basically encouraged to run up to the edge of the cliff and rely on the chip to throttle itself. I’d be curious to see a graph of actual max power consumption over time/generation. It was a desperate grab for more performance, and now that “engineering solution” has been played out.
Posted on Reply
#191
Dr. Dro
Darmok N JaladOnce the door was opened to virtually unlimited power settings and CPUs and GPUs that basically overclocked themselves, we’d eventually end up with issues like this. Motherboard makers were basically encouraged to run up to the edge of the cliff and rely on the chip to throttle itself. I’d be curious to see a graph of actual max power consumption over time/generation. It was a desperate grab for more performance, and now that “engineering solution” has been played out.
Agreed. Call me old-fashioned, but I very much preferred the old days of static clocks instead of "dynamic boosts".
Posted on Reply
#192
AnonymousGuy767
I had a 14900K fail for the exact scenario described. I wasn't overclocking really: just maxing out the power limits, setting XMP and that's it. But I'm on full custom watercooling and my CPU idles most of the time so my theory is that there's more going on than just the power limits which I wasn't hitting 99.99% of the time anyways. I think that it's tied to the voltage because I would see *really f-ing high* voltages at idle like 1.6V in the BIOS. I wouldn't be surprised if some boards "Auto" voltage when you increase the power is like "oh sweet lemme do max loadline and +.2V baseline". Now I'm using a 14700 where it idles at 1.27V with everything on Auto and the power limit maxed....like it's following a real VF curve instead of doing whatever Asus figures to do on a K series.

So unless this Intel Baseline bios update also fixes the VF curve to not slam high voltages in at idle...the new profile wouldn't have prevented my 14900K from degrading even if it existed from day 1. What Intel seems to be fixing is the scenario of "I set everything to Auto and my CPU overheats".
Posted on Reply
#193
Crackong
Solid State BrainIncreased power limits are within specs only if thermal and electrical specs are maintained.
Regarding the 'electrical spec' part
I must say, judging from the open Intel spec document I've shown at #157

Only maximum values were specified by Intel.
While 'Capping maximum values' sounds okay.
Not all values are equal, certain things get ugly when they are ' too low', such as current / resistance / load line calibration

The problem (random crash in testing/UE5) we are facing right now seems to be caused by 'Not enough current / voltage during heavy workload'
Since the Intel document only specified the maximum current/voltage, which is 307A / 1.72V for a normal K SKU
No minimum values, nor typical values were supplied by Intel.
So the range is 0-307A / 0-1.72V

'Being too low' is actually...well.. 'Within Spec' .
Darmok N JaladOnce the door was opened to virtually unlimited power settings and CPUs and GPUs that basically overclocked themselves, we’d eventually end up with issues like this. Motherboard makers were basically encouraged to run up to the edge of the cliff and rely on the chip to throttle itself. I’d be curious to see a graph of actual max power consumption over time/generation. It was a desperate grab for more performance, and now that “engineering solution” has been played out.
True.
Things get dirty when everyone wants the last bit of juice getting squeezed out-of-box.

Really missed those days when 20-30% even 50% overclocking headroom was possible.
Posted on Reply
#194
Solid State Brain
CrackongRegarding the 'electrical spec' part
I must say, judging from the open Intel spec document I've shown at #157

Only maximum values were specified by Intel.
While 'Capping maximum values' sounds okay.
Not all values are equal, certain things get ugly when they are ' too low', such as current / resistance / load line calibration

The problem (random crash in testing/UE5) we are facing right now seems to be caused by 'Not enough current / voltage during heavy workload'
Since the Intel document only specified the maximum current/voltage, which is 307A / 1.72V for a normal K SKU
No minimum values, nor typical values were supplied by Intel.
So the range is 0-307A / 0-1.72V

'Being too low' is actually...well.. 'Within Spec' .
The CPU comes with a pre-programmed voltage curve at which it is supposed to be stable (with a certain margin for long-term drift) and intended to operate. If motherboard settings cannot supply the voltage that the CPU is requesting, then the CPU is not operating within specifications.

Posted on Reply
#195
Crackong
Solid State BrainThe CPU comes with a pre-programmed voltage curve at which it is supposed to be stable (with a certain margin for long-term drift) and intended to operate. If motherboard settings cannot supply the voltage that the CPU is requesting, then the CPU is not operating within specifications.

Yes.
But noted the word 'Individually calibrated' .
It means each CPU has their own voltage/frequency table.
That means a 'low bin' CPU could have voltage shots to the moon to achieve the said frequency.

It is quite unfair to ask the MB vendors to predict what a 'low bin' CPU would behave.
Since the lack of information supplied by Intel, and they have to try & error all those blank values, with Intel encouraging them to make LLC lower (lower voltage).
When low LLC meets 'low bin' CPU, bad things happened.
Posted on Reply
#196
Solid State Brain
The voltage/frequency table is something that the motherboard firmware is aware of and can (should) compensate for.

Of course, the existence of high/low bins implies that some CPUs will be able to boost longer before thermal throttling, some less. However, it isn't the motherboard manufacturer's job to make things "fair" for unlucky customers by lowering voltages (at the cost of instabilities).
Posted on Reply
#197
Crackong
Solid State BrainThe voltage/frequency table is something that the motherboard firmware is aware of and can (should) compensate for.
True, LLC should compensate that.

Then it all comes back to the paragraph I pointed out at #157
Intel themselves encouraged the MB vendors to use 'Superior board design with lower LLC to achieve better performance'
That's why I said it is a 50/50 blame on Intel/MB vendor.

It is quite obvious that Intel wants the MB vendor to use lower LLC and show better performance.
And the MB vendors get more sales when CPU performed better on their boards.
Technically they all act within-spec, until some low bin CPUs come in......

Maybe, MB vendors could have tested a few more hundred CPU to cook up a better LLC table.
Maybe, Intel themselves shouldn't allow such low bin CPU to be branded as i9 in a first place.
Posted on Reply
#198
Solid State Brain
You don't necessarily need an electrically less droopy LLC to compensate. On Intel CPUs there's an operating parameter called "AC loadline" which allows to adjust voltages up with current to make up for Vdroop. On some motherboards it tends to be high by default, causing CPUs to run hot (and at higher voltages than intended), on others it's too low by default, causing instabilities.

In principle it could be configured so that the voltage supplied under load is exactly the one requested by the CPU, but there's apparently barely any testing going on by motherboard manufacturers, with the same settings being used across different models having different voltage regulators and electrical characteristics.
Posted on Reply
#199
Crackong
Solid State BrainYou don't necessarily need an electrically less droopy LLC to compensate. On Intel CPUs there's an operating parameter called "AC loadline" which allows to adjust voltages up with current to make up for Vdroop. On some motherboards it tends to be high by default, causing CPUs to run hot (and at higher voltages than intended), on others it's too low by default, causing instabilities.
AC (or DC) loadline is exactly where the problem was.
In Intel spec document it is described that MB vendor should measure and set their own values. (Since no such value is provided by Intel)
And some 'Superior' / 'Improve' buzzword are also presented to encourage MB vendors to use a 'Shallower AC load line' design for more performance.
So naturally, MB vendors would favor a low AC load line default setting.

Solid State BrainIn principle it could be configured so that the voltage supplied under load is exactly the one requested by the CPU, but there's apparently barely any testing going on by motherboard manufacturers, with the same settings being used across different models having different voltage regulators and electrical characteristics.
Since there is no reference value provided by Intel.
MB vendors had to cook up their own values, by testing the ES CPUs provided by Intel.
IDK how many they've tested.
But juding from the reality, it doesn't cover the whole silicon lottery spectrum.
Posted on Reply
#200
AnonymousGuy767
CrackongBut juding from the reality, it doesn't cover the whole silicon lottery spectrum.
My 14900K wasn't stable at 6Ghz turbo boost. Didn't touch any frequencies or voltages manually, and it would crash at idle when it happened to clock up to 6Ghz because of some background process.

That was a fun one trying to troubleshoot. Eventually I locked it to 5.7 or 5.5 max regardless of core active count and called it a day, and it didn't crash after that. And then it degraded and had to get RMA'd anyways.

So I basically said "f this noise" and put a non-K in. No more TVB to go to unrealistic clocks or weird all core frequency stuff that is beyond specs. And seems to idle at a sane voltage so probably won't degrade. I have the replacement 14900K here so I'm tempted to unseal it and try the new Asus bios to see if they fixed that, but I've done so many CPU remounts and swaps and stuff fixing problems that I'm kinda over it.
Posted on Reply
Add your own comment
May 29th, 2024 04:00 EDT change timezone

New Forum Posts

Popular Reviews

Controversial News Posts