BSOD whea_uncorrectable_error

kuuuuujo · Aug 5, 2021

I built PC year ago, everything worked up to this point. Started getting BSOD. Always whea_uncorrectable_error with no additional information. It will never crash when it's idle, only when I do something. The biggest issue is that the error is displayed stuck at 0% of saving memory dump and it never dumps it. I tried tips from dozens of posts but I just can't get it to dump, after restart there is never any MEMORY.dmp and the minidump folder is empty. I have the settings correct, tried all types of dumps and increased paging file to be bigger than my ram.

I noticed it always crashes somewhere during, or after SSD test in UserBenchmark, so I use it to debug the problem. I monitor temperatures, but I don't see anything out of the ordinary when it crashes.

Corsair MP600 SSD could be the cause because when I was building PC I removed the original heat sink from it, and used the one from motherboard instead. The temps aren't that high though, it's around 45-50 when it crashes. all the temperatures are in 30-65 range so the PC doesn't seem to overheat that much, but... I temporarily attached a fan to blow the heat away from the SSD and it crashes a bit later than usual, so there is a difference.

What could be the issue here?

So far I:

updated bios
updated drivers
updated windows
updated SSD firmware
installed the original heatsink with new pad
checked SSD health with multiple applications, none reported issues
removed Sonic Radar 3 as I heard this causes issues
did sfc /scannow

EDIT: System specs:
Asus ROG STRIX X570-I
Ryzen 7 3700X
RTX 2080 Ti
Corsair MP600 2TB M.2
Corsair Vengeance 2x32GB 3000MHz
Corsair SF750 750W

neatfeatguy · Aug 5, 2021

Folks are going to ask for your system specs - so you might want to post what they are.

GerKNG · Aug 5, 2021

SMART values are fine for your SSD?
Run Memtest (just for 5 minutes to see if something is really wrong)
if that's all fine try something that causes issues on a bunch of ryzen platforms.

go to the DRAM Settings and disable the Power Down Mode.
and change the PSU Idle Control from Auto to typical current idle (just for good measure. probably not the problem but it does not hurt either)

Tetras · Aug 5, 2021

Seeing as you suspect the drive, do you have a spare? Even some old 128GB SSD? I thought WHEA was more commonly related to windows updates, CPU or memory config.

RJARRRPCGP · Aug 5, 2021

Looks like a faulty processor or motherboard. Unless it's reporting an error for RAM, but often, it isn't because of RAM. But a faulty IMC, can cause RAM-related errors.
I'm golden with a Ryzen 7 3700X here with an MSI B450 Tomahawk motherboard.

Looks like you should clear the CMOS and then re-enter the boot order, boot config, date and time.

I have Corsair Vengeance LPX 2x8GB 3000 Mhz with XMP and Fclk at 1:1 and still no RAM-related error.

Fangio1951 · Aug 5, 2021

Do you have any overclocks applied = RAM, CPU, GPU, etc ??

Deleted member 211755 · Aug 5, 2021

Run Memtest for two hours and see if it spits out any errors.
My money's on the RAM here.

P.S. I had the same memory modules and after a while they couldn't hold their XMP speeds
I had to run at stock FQ to ensure system stability

MxPhenom 216 · Aug 5, 2021

Whats the ID of the event? 18, 19?

Logan7 · Aug 5, 2021

Is "CRC Error Count" (or similarly-worded) above 0 in SMART? Any WHEA logger events in Event Viewer?

DeathtoGnomes · Aug 5, 2021

Logan7 said:
Any WHEA logger events in Event Viewer?

First place to look when you get BSOD, event viewer. cant miss the big red X.

kuuuuujo · Aug 6, 2021

Thank you everyone for the suggestions, here's where I'm standing now:

SMART values are fine on SSD
I ran MemTest86 for 1 hour, up to Test 13, 0 errors
Switched DRAM Timing Control / Power Down Enable to disabled
Changed the PSU Idle Control from Auto to Typical Current Idle
Never did any OC

Those steps didn't solve the issue.

I connected second SSD and installed fresh Windows 10 on it, updated drivers and ran userBenchmark again. This time it passes every time and I guess that since Windows is no longer on the disk that is potential culprit it doesn't BSOD so I can see errors that userBenchmark throws:

Code:

ERROR: G: Drive bench execution failed
ERROR: t[0:0] error during write: A device which does not exist was specified. (433)
ERROR: There has been an error during threads execution

G is the MP600 SSD. I went to the event viewer and these are the events that trigger at the time benchmark runs:

Code:

Information: Volume G: (\Device\HarddiskVolume4) is healthy. No action is needed.
Warning: Reset to device, \Device\RaidPort2, was issued.
Error: The driver detected a controller error on \Device\RaidPort2.
Warning: An error was detected on device \Device\Harddisk1\DR1 during paging operation.
Error: A fatal hardware error has occured. A record describing the condition is contained in the data section of this event.

After those events it's just a never ending log of warnings for that device. Since there's no BSOD, there's still no memory dump, there's only XML file of event data which I don't understand at all and I am not sure if it's even useful.

I am biased, since I want it to be SSD because it's less troublesome than motherboard. Could it be motherboard? BIOS? Maybe me connecting new drive caused some issues that I associate with previous BSOD? Or does this prove it's SSD?

Logan7 · Aug 6, 2021

kuuuuujo said:
G is the MP600 SSD. I went to the event viewer and these are the events that trigger at the time benchmark runs:

Code:

Information: Volume G: (\Device\HarddiskVolume4) is healthy. No action is needed. Warning: Reset to device, \Device\RaidPort2, was issued. Error: The driver detected a controller error on \Device\RaidPort2. Warning: An error was detected on device \Device\Harddisk1\DR1 during paging operation. Error: A fatal hardware error has occured. A record describing the condition is contained in the data section of this event.

After those events it's just a never ending log of warnings for that device. Since there's no BSOD, there's still no memory dump, there's only XML file of event data which I don't understand at all and I am not sure if it's even useful.

If you go into the "Details" tab of at least this error:

Code:

Error: A fatal hardware error has occured. A record describing the condition is contained in the data section of this event.

You should be able to get the RawData output of letters and numbers. If you paste that into a hex to string converter, you may see your SSD listed in there somewhere. For example, for the one seen here (https://docs.microsoft.com/en-us/an...rdware-error-occured-whea-logger-event-i.html), if you paste the RawData into the converter, you get

Code:

CPERÿÿÿÿ�������Î��3�
<`ÁƒR§H‡ÑÙF}we����������������|!Wf^ûD€3›tÊÎß[ø3�p.ˆN™,o&ÚóÛzâuF†É§Ö�����������������������È�����������������������������������������������������������������STORPORT�¤�������� 0û§àÓ    [±ß9´s�t�o�r�a�h�c�i�����������������INTEL   �SSDSC2KW010X6����¤���¤���}àP���������� ������������������d�������������������2���ÿÿÿÿ������������ �������������������������2�������e���d���������������4���2���2�����������”��

In the middle of that you can see "INTEL SSDSC2KW010X6" is causing the fault.

Reason I know this is because I had a similar issue with occasionally getting that error with a rare blue screen (not as easily as you're getting them) and it ended up being some kind of incompatibility between the older AMD chipset (~2010) and my Samsung EVO 860 SSD - the answer was to turn off Native Command Queuing (NCQ). You're on much newer hardware so that doesn't seem as likely.

This is the reason I asked about the CRC error count in SMART - with NCQ on the error count would go up if I ran something like CrystalDiskMark, even though it shows the value as being "Good." It hasn't increased since disabling NCQ months ago.

Those other events do seem relevant for sure with the issue you're having, hopefully someone here has experience with them.

kuuuuujo · Aug 6, 2021

MxPhenom 216 said:
Whats the ID of the event? 18, 19?

The ID of the fatal hardware error is 1, if that's what you mean. I get this when reading event list in windows, I got no other numbers/codes on BSOD when it happened.

Logan7 said:
If you go into the "Details" tab of at least this error:

Code:

Error: A fatal hardware error has occured. A record describing the condition is contained in the data section of this event.

I get this:

CristalDiskInfo doesn't give me any S.M.A.R.T. data at all.

When I was using Windows on that drive, every time I did something hardware intensive (like benchmark) I would get BSOD. It seems that when I use Windows on different SSD I don't get BSOD, but after the error appears in events the MP600 disk becomes disconnected, as in it's no longer detected by any software.

campossilva · Aug 6, 2021

To solve this problem:
PRECISION BOOST OVERDRIVE [Enhanced Mode 3]

AMD CBS\
CORE PERFORMANCE BOOST [Auto]
Global C-State Control [Disabled]

AMD Overclocking\
ECO Mode [Disabled]

Precision Boost Overdrive [Advanced]
PBO Limits [Motherboard]
Precision Boost Overdrive Scalar [Auto]
Curve Optimizer [Disabled]
Max CPU Boost Clock Override [100MHz] (200 works too =slight increase of 50-80 pts in Cinebench r20 multi, but more W)
Platform Thermal Throttle Limit [Manual]
Platform Thermal Throttle Limit 255

kuuuuujo · Aug 7, 2021

campossilva said:
To solve this problem:

I have the latest BIOS and I don't have all those options you mentioned.

PRECISION BOOST OVERDRIVE [Enhanced Mode 3] I don't have Enhanced Modes anywhere in settings
CORE PERFORMANCE BOOST [Auto] was already set by default
Global C-State Control [Disabled] changed this
ECO Mode [Disabled] was already set by default
Precision Boost Overdrive [Advanced] only have disabled/enabled/manual so I set it to manual
PBO Limits [Motherboard] setting manual PBO allows to change PPT, TDC and EDC Limits, but no Motherboard option anywhere
Precision Boost Overdrive Scalar [Auto] was set by default
Curve Optimizer [Disabled] don't have that option
Max CPU Boost Clock Override [100MHz] (200 works too =slight increase of 50-80 pts in Cinebench r20 multi, but more W) changed this
Platform Thermal Throttle Limit [Manual] changed this
Platform Thermal Throttle Limit 255 changed this

Unfortunately after those changes I was able to make the problem wasn't fixed.

HD64G · Aug 7, 2021

Run the RAM at stock settings without XMP and check stability again. As some already posted, most possibly that the RAM timings aren't stable for the voltages applied (RAM, SOC, etc).

kuuuuujo · Aug 7, 2021

As I understand D.O.C.P is the ASUS equivalent of XMP and I never had it turned on. The RAM runs at whatever is default. I never overclocked anything. MemTest86 didn't show errors for the first hour, didn't try it longer.

HD64G · Aug 8, 2021

kuuuuujo said:
As I understand D.O.C.P is the ASUS equivalent of XMP and I never had it turned on. The RAM runs at whatever is default. I never overclocked anything. MemTest86 didn't show errors for the first hour, didn't try it longer.

Nice! Then your M.2 disk should be the root of the problem.

Deleted member 211755 · Aug 9, 2021

Yeah, it seems like SSD is the root after all. I thought it might be the RAM as a lot of BSODs are memory related,
but sometimes it's tricky to find the real cause.

RJARRRPCGP · Aug 9, 2021

spanjaman said:
I thought it might be the RAM as a lot of BSODs are memory related

If all the BSODs that occurred, had stop error codes of RAM corruption being the reason.

Deleted member 211755 · Aug 9, 2021

@RJARRRPCGP In fact, a lot of BSOD ARE memory related.
I didn't say all, but at least half or more of them are indeed memory related
That's my experience.

System Name	Personal / HTPC
Processor	Ryzen 5900x / Ryzen 5600X3D
Motherboard	Asrock x570 Phantom Gaming 4 /ASRock B550 Phantom Gaming
Cooling	Corsair H100i / bequiet! Pure Rock Slim 2
Memory	32GB DDR4 3200 / 16GB DDR4 3200
Video Card(s)	EVGA XC3 Ultra RTX 3080Ti / EVGA RTX 3060 XC
Storage	500GB Pro 970, 250 GB SSD, 1TB & 500GB Western Digital / lots
Display(s)	Dell - S3220DGF & S3222DGM 32"
Case	Titan Silent 2 / CM HAF XB Evo
Audio Device(s)	Logitech G35 headset
Power Supply	850W SeaSonic X Series / 750W SeaSonic X Series
Mouse	Logitech G502
Keyboard	Black Microsoft Natural Elite Keyboard
Software	Windows 10 Pro 64 / Windows 10 Pro 64

Processor	AMD Ryzen 9 9950X3D
Motherboard	ASRock B850M PRO-A
Cooling	Corsair Nautilus 360 RS
Memory	2x32GB Kingston Fury Beast 6000 CL30
Video Card(s)	PowerColor Hellhound RX 9070 XT
Storage	1TB Samsung 990 Pro, 2TB Samsung 990 Pro, 4TB Samsung 990 Pro
Display(s)	LG 27GS95QE-B, MSI G272QPF E2
Case	Lian Li DAN Case A3 Black Wood Edition
Audio Device(s)	Bose Companion Series 2 III, Sennheiser GSP600 and HD599 SE - Creative Soundblaster X4
Power Supply	Corsair RM1000X ATX 3.1
Mouse	Razer Deathadder V3
Keyboard	Razer Black Widow V3 TKL
VR HMD	Oculus Rift S

System Name	KHR-1
Processor	Ryzen 9 5900X
Motherboard	ASRock B550 PG Velocita (UEFI-BIOS P3.40)
Memory	64 GB G.Skill RipJaws V F4-3200C16D-64GVK
Video Card(s)	Sparkle Titan Arc A770 16 GB
Storage	Samsung 990 Pro 1 TB NVMe SSD
Display(s)	Alienware AW3423DWF OLED-ASRock PG27Q15R2A (backup)
Case	Corsair 275R
Audio Device(s)	Technics SA-EX140 receiver with Polk VT60 speakers
Power Supply	eVGA Supernova G3 750W
Mouse	Logitech G Pro (Hero)
Software	Windows 11 Pro x64 24H2

System Name	K9
Processor	i9 9900K @ 5.1Ghz and 32deg C - delid + Grizzly Conductonaught LM
Motherboard	Gigabyte Aorus Z390 Gaming X
Cooling	Custom water cooling loop - GPU + mobo (+VRM's) + CPU
Memory	G Skill - Trident Z RGB DDR4 - 3866Mhz x 32Gb @ 3800Mhz
Video Card(s)	Gigabyte Aorus 11Gb GTX 1080 Ti Waterforce Extreme @ 2250Mhz
Storage	Samsung 500Gb M2 970 EVO + Samsung 850 Pro SSD + ADATA 512Gb SSD + Samsung 1Tb & 3T + WD 1Tb + 3Tb
Display(s)	ASUS 27" ROG Swift 1440p @ 165Hz & BenQ 27" LED
Case	Thermaltake Core P7 - Open frame
Audio Device(s)	Logitech Z906 - 5.1ch
Power Supply	EVGA 1200W
Mouse	Roccat LeadR + Razer Nagar V2 Pro
Keyboard	Corsair K70 LUX with Cherry Red switches
Software	Win 10 Pro 64bit
Benchmark Scores	v/fast

System Name	Main Stack
Processor	AMD Ryzen 7 9800X3D
Motherboard	Asus X870 ROG Strix-A - White
Cooling	Air (temporary until 9070xt blocks are available)
Memory	G. Skill Royal 2x24GB 6000Mhz C26
Video Card(s)	Powercolor Red Devil Radeon 9070XT 16G
Storage	Samsung 9100 Gen5 1TB \| Samsung 980 Pro 1TB (Games_1) \| Lexar NM790 2TB (Games_2)
Display(s)	Asus XG27ACDNG 360Hz QD-OLED \| Gigabyte M27Q-P 165Hz 1440P IPS \| LG 24" 1440 IPS 1440p
Case	HAVN HS420 - White
Audio Device(s)	FiiO K7 \| Sennheiser HD650 + Beyerdynamic FOX Mic
Power Supply	Corsair RM1000x ATX 3.1
Mouse	Razer Viper v3 Pro
Keyboard	Corsair K65 Plus 75% Wireless - USB Mode
Software	Windows 11 Pro 64-Bit

BSOD whea_uncorrectable_error

kuuuuujo

New Member

neatfeatguy

GerKNG

Tetras

RJARRRPCGP

Fangio1951

Deleted member 211755

Guest

MxPhenom 216

ASIC Engineer

Logan7

DeathtoGnomes

kuuuuujo

New Member

Logan7

kuuuuujo

New Member

campossilva

kuuuuujo

New Member

HD64G

kuuuuujo

New Member

HD64G

Deleted member 211755

Guest

RJARRRPCGP

Deleted member 211755

Guest

Processor	Intel i7-4790K @ 4.7 GHz
Motherboard	ASUS Z97-A USB3.1
Cooling	DEEPCOOL GAMMAXX400V2
Memory	16GB DDR3-1600
Video Card(s)	Dell RX 6900 XT
Storage	500GB Samsung 860 EVO SSD
Display(s)	Gigabyte M28U (4K 144Hz 10-bit)
Case	Corsair 4000D
Audio Device(s)	Audio-Technica ATH-M50x
Power Supply	Corsair CX650M
Mouse	Logitech MK295
Keyboard	Logitech MK295

System Name	Dumbass
Processor	AMD Ryzen 7800X3D
Motherboard	ASUS TUF gaming B650
Cooling	Artic Liquid Freezer 2 - 420mm
Memory	G.Skill Sniper 32gb DDR5 6000
Video Card(s)	GreenTeam 4070 ti super 16gb
Storage	Samsung EVO 500gb & 1Tb, 2tb HDD, 500gb WD Black
Display(s)	1x Nixeus NX_EDG27, 2x Dell S2440L (16:9)
Case	Phanteks Enthoo Primo w/8 140mm SP Fans
Audio Device(s)	onboard (realtek?) - SPKRS:Logitech Z623 200w 2.1
Power Supply	Corsair HX1000i
Mouse	Steeseries Esports Wireless
Keyboard	Corsair K100
Software	windows 10 H
Benchmark Scores	https://i.imgur.com/aoz3vWY.jpg?2

Processor	AMD Ryzen 5 5600@80W
Motherboard	MSI B550 Tomahawk
Cooling	ZALMAN CNPS9X OPTIMA
Memory	2*8GB PATRIOT PVS416G400C9K@3733MT_C16
Video Card(s)	Sapphire Radeon RX 6750 XT Pulse 12GB
Storage	Sandisk SSD 128GB, Kingston A2000 NVMe 1TB, Samsung F1 1TB, WD Black 10TB
Display(s)	AOC 27G2U/BK IPS 144Hz
Case	SHARKOON M25-W 7.1 BLACK
Audio Device(s)	Realtek 7.1 onboard
Power Supply	Seasonic Core GC 500W
Mouse	Sharkoon SHARK Force Black
Keyboard	Trust GXT280
Software	Win 7 Ultimate 64bit/Win 10 pro 64bit/Manjaro Linux