Tuesday, July 23rd 2024

Intel Statement on 13th and 14th Gen Core Instability: Faulty Microcode Causes Excessive Voltages, Fix Out Soon

Long-term reliability issues continue to plague Intel's 13th Gen and 14th Gen Core desktop processors based on the "Raptor Lake" microarchitecture, with users complaining that their processors have become unstable with heavy processing workloads, such as games. This includes the chips that have minor levels of performance tuning or overclocking. Intel had earlier isolated many of these stability issues to faulty CPU core frequency boosting algorithms, which it addressed through updates to the processor microcode that it got motherboard- and prebuilt manufacturers to distribute as UEFI firmware updates. The company has now come out with new findings of what could be causing these issues.

In a statement Intel posted on its website on Monday (22/07), the company said that it has been investigating the processors returned to it by users under warranty claims (which it has been replacing under the terms of its warranty). It has found that faulty processor microcode has been causing the processors to operate under excessive core voltages, leading to their structural degradation over time. "We have determined that elevated operating voltage is causing instability issues in some 13th/14th Gen desktop processors. Our analysis of returned processors confirms that the elevated operating voltage is stemming from a microcode algorithm resulting in incorrect voltage requests to the processor."
Modern processor power management runs on an intricate clockwork of collaboration between software, firmware, and hardware, with the software constantly telling the hardware what levels of performance it wants, and the hardware managing its power- and thermal budgets by rapidly altering the power and clock speeds of the various components, such as CPU cores, caches, fabric, and other on-die components. A faulty collaboration between any of the three key components could break this clockwork, as has happened in this case.

Intel is releasing yet another microcode update to its 13th- and 14th Gen Core processors, which will address not just the faulty boosting algorithm issue the company unearthed in June, but also the faulty voltage management the company discovered now. This new microcode should be released some time around mid-August to partners (motherboard manufacturers and PC OEMs), who will then need to validate it on their machines, before passing it along to end-users as UEFI firmware updates.
Intel is delivering a microcode patch which addresses the root cause of exposure to elevated voltages. We are continuing validation to ensure that scenarios of instability reported to Intel regarding its Core 13th/14th Gen desktop processors are addressed. Intel is currently targeting mid-August for patch release to partners following full validation. Intel is committed to making this right with our customers, and we continue asking any customers currently experiencing instability issues on their Intel Core 13th/14th Gen desktop processors reach out to Intel Customer Support for further assistance, the company stated.
It's important to note here, that the microcode update won't fix the issues on processors already experiencing instability, but prevent it on chips that aren't. The instability is caused by irreversible physical degradation of the chip. These chips will, of course, be covered under warranty.

Meanwhile, an interesting issue has come to light, which that some of Intel's processors built on the Intel 7 node are experiencing chemical oxidation of the die as they age. Intel responded to this, stating that it had discovered the oxidation manufacturing issues in 2023, and addressed it. The company also stated that die oxidation is not related to the stability issues it is embattled with.
We can confirm that the via Oxidation manufacturing issue affected some early Intel Core 13th Gen desktop processors. However, the issue was root caused and addressed with manufacturing improvements and screens in 2023. We have also looked at it from the instability reports on Intel Core 13th Gen desktop processors and the analysis to-date has determined that only a small number of instability reports can be connected to the manufacturing issue, the company stated.
If you feel your chip might be affected, you can file for an RMA.
Sources: Intel Community, Intel (Reddit)
Add your own comment

387 Comments on Intel Statement on 13th and 14th Gen Core Instability: Faulty Microcode Causes Excessive Voltages, Fix Out Soon

#351
OkieDan
TomorrowI can also offer a worse example: Sony.
I bought their Linkbuds S wireless earbuds in 2022 only for these to develop a battery discharge issue a week after my two year warranty period ended (both go from 100% to empty withing 15-30 minutes instead of usual ~8 hours).
Reading reviews and comments online there are many people facing the same issue with both the Linkbuds S and WF-1000 XM4 models produced and bought in 2022.

Yet Sony has not even acknowledged the issue nor provided any replacements for customers because in their eyes the warranty period for both products (for 2022 buyers at least) has ended and thus they feel they dont have to do anything.

A product failing a week after warranty ended feels like planned obsolescence...
I've avoided everything Sony since their DRM root kit.
Posted on Reply
#352
Sunny and 75
trparkyEven if it wasn't the intention, my confidence has been shaken.
Hope they can restore some of that with the release of ARL and BTL for a thriving CPU market.
Posted on Reply
#353
Dr. Dro
Sunny and 75Hope they can restore some of that with the release of ARL and BTL for a thriving CPU market.
I am really looking forward to Bartlett Lake. I know it's a NEX product but I hope Intel will go ahead with a full-scale release of it. I'll buy one at launch.
Posted on Reply
#354
Sunny and 75
Dr. DroI am really looking forward to Bartlett Lake.
The same.
Posted on Reply
#355
trparky
Sunny and 75Hope they can restore some of that with the release of ARL and BTL for a thriving CPU market.
I hope so as well, mainly because I don't want to see AMD become complacent in the market. When there's competition, the consumer wins.
Posted on Reply
#356
Sunny and 75
trparkyWhen there's competition, the consumer wins.
That's the definition of a thriving market.
Posted on Reply
#357
ikjadoon
www.theverge.com/2024/7/26/24206529/intel-13th-14th-gen-crashing-instability-cpu-voltage-q-a

Ugly to disgusting. This needs a proper watchdog / regulatory investigation at this point.
  • Intel confirms ANY 65W or higher 13th/14th CPU is susceptible to accelerated degradation.
  • Intel will not recall any CPUs.
  • Intel will not provide any manufacturing dates / serial number ranges for via oxidation.
  • Intel (obviously) confirms the damage is permanent. It will never get better.
Posted on Reply
#358
close
JWNoctisProbably not, or they wouldn't have done it in the first place.

"Quietly drop the voltage and performance when reviewers stop looking" would be actual fraud.
Noooo, it's fixing a bug in microcode and improving reliability. The way they're approaching this whole scandal reeks of fraud, especially since the message they passed on to the press and consumers ("just some microcode fixes") was very different to what they were already discussing internally about voltages and excessive oxidation prior to that. They withheld that information and if even one consumer's CPU ran out of warranty while Intel was yet again trying to bury the truth, that's already fraud enough for me. They now have a long tradition of trying to hide issues for just long enough for the CEO to dump some shares. But as I said elsewhere, as long as enough people and companies have more brand loyalty than sense, they will be treated by Intel accordingly.
Posted on Reply
#359
BoggledBeagle
ikjadoon
  • Intel (obviously) confirms the damage is permanent. It will never get better.
We need the silicon healer! :respect:
Posted on Reply
#360
RatusNatus
fevgatosRealistically it won't. Worst case scenario it drops what, 100mhz on the pcores? Let's say 200? Heck let's make it 300 just for the sake. That's a 3% drop on the mt performance of the chip. It's so irrelevant you won't even notice.
Some Minecraft servers (14900k) were dying in 2 months of usage.
They reduced 500mhz and that made it last 6 to 8 months before failing.

This is the source...
Posted on Reply
#361
MikeSnow
I was quite negative here about Intel this week, but my recent RMA experience has been flowless. I had stability issues with my 13900KF CPU for over an year, but I refrained from asking an RMA until this week, as I needed my computer for work and didn't want to wait a few weeks for a replacement.

Anyway, as this Saturday I'm leaving on vacation for two weeks, I decided that this week would be the best time to ask an RMA for it, especially in light of the latest revelations. I was hoping to get a replacement when I got back from vacation.

So, on Tuesday I made the RMA request, telling them that my CPU is unstable even with the latest BIOSes and the Intel defaults.

- 8 hours later, they were asking for the serial number, as I have given them an incorrect serial number initially.
- 4 hours later the RMA was approved, and I received an estimation of 5-7 business days to receive the replacement, from the time I submit the faulty unit.
- 1 hour later I received the DHL return label. The collection was scheduled for Thursday, 2 days later, but I decided not to wait for that.
- 3 hours later I handed over the faulty CPU at a local DHL service point
- 15 hours later the CPU started its travel towards Intel
- 22 hours later the CPU arrived at Intel
- 7 hours later they initiated shipping for the replacement CPU
- 2 hours later the CPU was on its way to me
- 3 hours later I received an email confirming they received the CPU and they need to validate it which would take 1 to 3 business days. Not sure what that email was about, since they already sent the replacement, probably just an automated message.
- 16 hours later I had a brand new 13900KF CPU on my desk.

Overall, the process took just a bit over 3 days, which I find almost unbelievable. I didn't get any pushback from Intel, and apparently when you send them a CPU that they know is very likely to be affected they don't actually do any validation, except that you actually sent them the CPU, and almost immediately send you a replacement.

So, while they might have been reluctant to accept RMAs in the past, it seems that right now they are actually doing the right thing regarding RMAs, and they are trying to make things as painless as possible for their customers.

While I can't totally forgive them for their past behavior, I have to give them a 10/10 for my recent experience.

The new CPU uses just 1.279 V in the BIOS, compared to the defective CPU, which used 1.447V. I left the BIOS on the Intel defaults with no XMP, and so far I don't have any stability issues and it reaches the maximum frequencies properly. But I won't do more extensive testing or stress it more until the microcode updates are released.
Posted on Reply
#362
Dr. Dro
RatusNatusSome Minecraft servers (14900k) were dying in 2 months of usage.
They reduced 500mhz and that made it last 6 to 8 months before failing.

This is the source...
I set up a Minecraft server on my personal server (C2Q Q9505) last year and basically forgot about it. I just logged in there yesterday and everything was as I left, talk about time machine effect. Not good. A 14900K must be so outrageously powerful for Minecraft hosting that you probably need to host a 500-player server before it begins to challenge the chip :eek:
MikeSnowI was quite negative here about Intel this week, but my recent RMA experience has been flowless. I had stability issues with my 13900KF CPU for over an year, but I refrained from asking an RMA until this week, as I needed my computer for work and didn't want to wait a few weeks for a replacement.

Anyway, as this Saturday I'm leaving on vacation for two weeks, I decided that this week would be the best time to ask an RMA for it, especially in light of the latest revelations. I was hoping to get a replacement when I got back from vacation.

So, on Tuesday I made the RMA request, telling them that my CPU is unstable even with the latest BIOSes and the Intel defaults.

- 8 hours later, they were asking for the serial number, as I have given them an incorrect serial number initially.
- 4 hours later the RMA was approved, and I received an estimation of 5-7 business days to receive the replacement, from the time I submit the faulty unit.
- 1 hour later I received the DHL return label. The collection was scheduled for Thursday, 2 days later, but I decided not to wait for that.
- 3 hours later I handed over the faulty CPU at a local DHL service point
- 15 hours later the CPU started its travel towards Intel
- 22 hours later the CPU arrived at Intel
- 7 hours later they initiated shipping for the replacement CPU
- 2 hours later the CPU was on its way to me
- 3 hours later I received an email confirming they received the CPU and they need to validate it which would take 1 to 3 business days. Not sure what that email was about, since they already sent the replacement, probably just an automated message.
- 16 hours later I had a brand new 13900KF CPU on my desk.

Overall, the process took just a bit over 3 days, which I find almost unbelievable. I didn't get any pushback from Intel, and apparently when you send them a CPU that they know is very likely to be affected they don't actually do any validation, except that you actually sent them the CPU, and almost immediately send you a replacement.

So, while they might have been reluctant to accept RMAs in the past, it seems that right now they are actually doing the right thing regarding RMAs, and they are trying to make things as painless as possible for their customers.

While I can't totally forgive them for their past behavior, I have to give them a 10/10 for my recent experience.

The new CPU uses just 1.279 V in the BIOS, compared to the defective CPU, which used 1.447V. I left the BIOS on the Intel defaults with no XMP, and so far I don't have any stability issues and it reaches the maximum frequencies properly. But I won't do more extensive testing or stress it more until the microcode updates are released.
Dang! That's a good improvement. Do you have an ASUS ROG motherboard? Curious to see the SP prediction for your chip
Posted on Reply
#363
MikeSnow
Yes, I have a ASUS ROG STRIX Z790-F GAMING WIFI motherboard.
Posted on Reply
#364
Dr. Dro
MikeSnowYes, I have a ASUS ROG STRIX Z790-F GAMING WIFI motherboard.
Mmm I don't know if the Strix series have SP prediction readout capability like the Maximus series, but if so you can find it on "AI tweaking" section. Make sure to update your BIOS and use the Intel "Performance" profile if you want to keep your chip safe, though. You'll lose some perf but temps will be great and volts nice and low
Posted on Reply
#365
trparky
MikeSnowThe new CPU uses just 1.279 V in the BIOS, compared to the defective CPU, which used 1.447V. I left the BIOS on the Intel defaults with no XMP, and so far I don't have any stability issues and it reaches the maximum frequencies properly. But I won't do more extensive testing or stress it more until the microcode updates are released.
According to a video by Gamer Meld on YouTube that went over an article by Igor's Lab and The Verge, all new replacement processors already have the new microcode loaded.
Will replacement / RMA’d chips ship with the microcode update preapplied beginning in August? Is Intel still shipping replacement chips ahead of that update?
Intel will be applying to microcode to 13th/14th Gen desktop processors that are not yet shipped once the production patch is released to OEM/ODM partners (targeting mid-August or sooner). For 13th /14th Gen desktop processors already in service, users will need to apply the patch via BIOS update once available.
Posted on Reply
#366
Sunny and 75
MikeSnowI was quite negative here about Intel this week, but my recent RMA experience has been flowless. I had stability issues with my 13900KF CPU for over an year, but I refrained from asking an RMA until this week, as I needed my computer for work and didn't want to wait a few weeks for a replacement.

Anyway, as this Saturday I'm leaving on vacation for two weeks, I decided that this week would be the best time to ask an RMA for it, especially in light of the latest revelations. I was hoping to get a replacement when I got back from vacation.

So, on Tuesday I made the RMA request, telling them that my CPU is unstable even with the latest BIOSes and the Intel defaults.

- 8 hours later, they were asking for the serial number, as I have given them an incorrect serial number initially.
- 4 hours later the RMA was approved, and I received an estimation of 5-7 business days to receive the replacement, from the time I submit the faulty unit.
- 1 hour later I received the DHL return label. The collection was scheduled for Thursday, 2 days later, but I decided not to wait for that.
- 3 hours later I handed over the faulty CPU at a local DHL service point
- 15 hours later the CPU started its travel towards Intel
- 22 hours later the CPU arrived at Intel
- 7 hours later they initiated shipping for the replacement CPU
- 2 hours later the CPU was on its way to me
- 3 hours later I received an email confirming they received the CPU and they need to validate it which would take 1 to 3 business days. Not sure what that email was about, since they already sent the replacement, probably just an automated message.
- 16 hours later I had a brand new 13900KF CPU on my desk.

Overall, the process took just a bit over 3 days, which I find almost unbelievable. I didn't get any pushback from Intel, and apparently when you send them a CPU that they know is very likely to be affected they don't actually do any validation, except that you actually sent them the CPU, and almost immediately send you a replacement.

So, while they might have been reluctant to accept RMAs in the past, it seems that right now they are actually doing the right thing regarding RMAs, and they are trying to make things as painless as possible for their customers.

While I can't totally forgive them for their past behavior, I have to give them a 10/10 for my recent experience.

The new CPU uses just 1.279 V in the BIOS, compared to the defective CPU, which used 1.447V. I left the BIOS on the Intel defaults with no XMP, and so far I don't have any stability issues and it reaches the maximum frequencies properly. But I won't do more extensive testing or stress it more until the microcode updates are released.
That's nice to hear.
Posted on Reply
#367
Dredi
trparkyAccording to a video by Gamer Meld on YouTube that went over an article by Igor's Lab and The Verge, all new replacement processors already have the new microcode loaded.
Yes, but only after it has been made available, in mid august. The now shipping replacements do not have it.

It even explicitly states that in the quote from verge you posted.
Posted on Reply
#368
trparky
Now there's talk about how the problem might be with the ring bus. The reasoning is that for whatever stupid reason, the ring bus is on the same voltage domain as the CPU cores are. So as the cores request higher voltages to hit higher and higher clocks, the ring bus is being fed the same amount of voltage and getting cooked.
DrediIt even explicitly states that in the quote from verge you posted.
Yeah, I just realized that. Oops.
Posted on Reply
#369
Dredi
trparkyNow there's talk about how the problem might be with the ring bus. The reasoning is that for whatever stupid reason, the ring bus is on the same voltage domain as the CPU cores are. So as the cores request higher voltages to hit higher and higher clocks, the ring bus is being fed the same amount of voltage and getting cooked.
That would be.. pretty bad. ”Fixing” it would mean that the top voltage of the cores must be lowered and the chips would no longer be able to hit the advertized clock speeds. Oof.
Posted on Reply
#370
Caring1
trparkyall new replacement processors already have the new microcode loaded.
Pre gimped from the factory.
Posted on Reply
#371
Sunny and 75
Caring1Pre gimped from the factory.
A re-review is called for, then.
Posted on Reply
#372
chrcoluk
trparkyNow there's talk about how the problem might be with the ring bus. The reasoning is that for whatever stupid reason, the ring bus is on the same voltage domain as the CPU cores are. So as the cores request higher voltages to hit higher and higher clocks, the ring bus is being fed the same amount of voltage and getting cooked.


Yeah, I just realized that. Oops.
I linked information which says this isnt the case. I know Buildzoid has said they the same voltages but what other source has said it.

In my bios I have independent controls for core and ring voltage, which is an indication they different, but also Intel documentation says it itself.

edc.intel.com/content/www/us/en/design/products/platforms/details/raptor-lake-s/13th-generation-core-processors-datasheet-volume-1-of-2/002/ring-interconnect/

I will quote it here as well
The Ring shares frequency and voltage with the Last Level Cache (LLC).
Posted on Reply
#373
trparky
chrcolukI linked information which says this isnt the case. I know Buildzoid has said they the same voltages but what other source has said it.

In my bios I have independent controls for core and ring voltage, which is an indication they different, but also Intel documentation says it itself.

edc.intel.com/content/www/us/en/design/products/platforms/details/raptor-lake-s/13th-generation-core-processors-datasheet-volume-1-of-2/002/ring-interconnect/

I will quote it here as well
Then explain how in some cases people have "fixed" their issues by either running RAM at stock (no overclocking), disabling E-Cores thus reducing the possible extra voltage hit, or a combination of the two? The only thing that I can possibly think of is that that reduces the stress on an already compromised ring bus and that the ring bus has been overtaxed by higher than normal voltage.

Then let's take into account that the ring bus as we know it today is still the same ring bus that was used back when Intel was still pumping out quad-core CPUs. As Intel began to add more cores to their processors with the HEDT line, they found that adding more cores to the ring bus introduced too much stress and latency thus leading to reduced performance. Hence it leads to why Intel introduced a mesh-style bus to their X-series of chips that were part of their HEDT lineup.

Fast-forward to today and Intel is expecting the very same ring bus to handle far more cores than it ever was designed to handle.
Posted on Reply
#374
chrcoluk
trparkyThen explain how in some cases people have "fixed" their issues by either running RAM at stock (no overclocking), disabling E-Cores thus reducing the possible extra voltage hit, or a combination of the two? The only thing that I can possibly think of is that that reduces the stress on an already compromised ring bus and that the ring bus has been overtaxed by higher than normal voltage.

Then let's take into account that the ring bus as we know it today is still the same ring bus that was used back when Intel was still pumping out quad-core CPUs. As Intel began to add more cores to their processors with the HEDT line, they found that adding more cores to the ring bus introduced too much stress and latency thus leading to reduced performance. Hence it leads to why Intel introduced a mesh-style bus to their X-series of chips that were part of their HEDT lineup.

Fast-forward to today and Intel is expecting the very same ring bus to handle far more cores than it ever was designed to handle.
I never said the ring voltage hasnt gone up, just I think its not the same as core voltage.

On RL the cache clock got a massive boost, so logically voltage will have gone up also to power that.
Posted on Reply
#375
Lewzke
My friend has no issues with 13700KF, but he runs DDR5-5600 "only", the maximum official RAM speeds. I think I will leave his BIOS as it is until this issue is fully solved. (discovered)
Posted on Reply
Add your own comment
Jul 23rd, 2025 02:53 CDT change timezone

New Forum Posts

Popular Reviews

TPU on YouTube

Controversial News Posts