Hard resets playing games

flashfrenzy · Mar 22, 2021

I just finished building a new system this last week and twice now I've had hard resets playing Shadow of the Tomb Raider. In one case it was after an hour or two and the second time it was within ~15 min. The second time I had both hwinfo and gpu-z logging running. Everything seems fine from the logs (links below) but maybe someone can see something I don't. I don't think the restart is thermal related. The CPU was at around 50C at the time of the restart. In games it seems to be around 50-60C and idle in the low 30s. Since the restarts I completely cleared the CMOS and went 100% stock (no XMP). Since then, I haven't seen any restarts but I've been running prime95 tests and always run into errors within 2 hours. The CPU does get very hot in the tests (up to 90C) but I'm not sure if this is typical or not for prime95 since my normal load temps seem fine. I've also run memtest86 with XMP enabled and FCLK at 1800 and 3 passes (3 and half hours) there were no errors so I don't suspect the RAM at this point. My last runs of P95 I had the case open with a giant fan blowing in. It didn't really seem to affect anything. My GPU temps were way down (irrelevant, I know) but CPU was unaffected. I'm kind of at a loss of what to look for or do at this point. Any suggestions/advice would be greatly appreciated.

Component rundown: 5800x, ASUS b550-f mobo, EVGA RTX3060, Arctic Freezer 280 AIO, Super Flower Leadex III 850W, 32 GB Crucial Ballistix CL16 3600

BIOS and drivers are 100% up to date.

hwinfo logs: https://docs.google.com/spreadsheets/d/1ggHi0jCAs5BACIKK_BJT7cCELAP9mVAvYHBBip8FtSk/edit?usp=sharing
gpu-z logs: https://docs.google.com/spreadsheets/d/1Fc2VbCaYCh_FgW8JDO1Q9o771ZlAXU5dhhazVmOR5_Y/edit?usp=sharing

Ruyki · Mar 22, 2021

Hard restarts may indicate a power issue.
Check Windows error logs. They may contain clues on what caused the shutdown.
Errors in prime95 mean that the CPU makes mistakes when it's processing instructions. This should be fixed or the system probably won't be able to work reliably. This could be caused by incorrectly configured CPU/RAM/mainboard or CPU/RAM/mainboard fault.

flashfrenzy · Mar 22, 2021

Thanks, yeah there's absolutely nothing in the windows event viewer, unfortunately. I guess what other options are there aside from just RMA'ing components until it stops? I'm just not sure where to start. I guess RAM is the easiest but seems like the least likely source at this point.

Ruyki · Mar 22, 2021

Nothing in event viewer means that it can be a power issuer, or something else. So this information did not really help to pinpoint the issue.
I would try other CPU stability testing software to see if you get errors in other software or just in prime95. I'm not sure which ones are good now. I guess OCCT may still be good.
You can also check if the CPU voltage and all other data points in an app like hwinfo64 look good.
I'm not at all familiar with the Ryzen platform so the above is just generic advice really. Someone who knows more about Ryzen can probably assist better.

RJARRRPCGP · Mar 23, 2021

flashfrenzy said:
I've been running prime95 tests and always run into errors within 2 hours.

That's classic of a manual CPU core OC with not enough Vcore.

flashfrenzy · Mar 23, 2021

RJARRRPCGP said:
That's classic of a manual CPU core OC with not enough Vcore.

I wish that were the case. Unfortunately I don't have any OC, it's 100% default BIOS config, even XMP disabled.

Chomiq · Mar 23, 2021

32 GB Crucial Ballistix CL16 3600

That's 2 sticks or one stick? Give us the exact part number. If they are 2x16 GB sticks they can be single or dual rank. Use Zen Timings: https://zentimings.protonrom.com/ and post screenshot.

I've had hard resets with nothing in event log when I was running an unstable ram OC on my 3700x. Memtest86 wouldn't show a thing but running https://www.techpowerup.com/download/techpowerup-memtest64/ pretty much guaranteed a hard reset.

flashfrenzy · Mar 23, 2021

Here's the timings (with DOCP enabled). I unfortunately don't know enough about Ryzen memory overclocking yet to know what is normal and what isn't. These unfortunately are single rank, not dual rank as Crucial recently changed that for the 2x16GB sticks. Thanks for the tip on memtest64, I'll give that a shot.

ZenTimings_Screenshot_26941888.5416662.png

Deleted member 205776 · Mar 23, 2021

Your MCLK, FCLK and UCLK values should all be the same. I doubt that's what's causing it, but you should rectify that. It means that the board failed to boot with unified fabric & memory clocks. That will introduce unnecessary amounts of latency during gaming. Either that or it didn't attempt to do it in the first place (which I doubt as I have the same board without WiFi).

Try setting FCLK yourself manually to 1800 in the BIOS.

I would also suggest setting:
- VDDG CCD to 1.050v
- VDDG IOD to 0.950v
- CLDO VDDP to 0.900v
- DRAM voltage to 1.35v if it isn't already set that way
(all of these are under Ai Tweaker)

... and disabling Global C-State control under Advanced --> AMD CBS (this may help with the resets, it helped with my random idle reboots but I'm not sure)

No worries, you won't break anything by doing this. It'll either fix the affected 3 clocks mentioned above or not fix anything at all, in which case you can easily reset them back.

For example, this is how mine looks like:

flashfrenzy · Mar 23, 2021

Yes, I actually just noticed that myself. It was from me messing around in the BIOS yesterday and not properly setting everything back to default. Here's what it has been in the past when experiencing resets.

ZenTimings_Screenshot_26941910.7752197.png

And thanks for the other tips, I will give those a shot as well.

Deleted member 205776 · Mar 23, 2021

flashfrenzy said:
Yes, I actually just noticed that myself. It was from me messing around in the BIOS yesterday and not properly setting everything back to default. Here's what it has been in the past when experiencing resets.View attachment 193531

We can rule that out then.

Still, looking at your SoC voltage, I would recommend you adjust the voltages as I posted above. The voltage on the VDDG CCD is too little while it's too high on the VDDG IOD. VDDP is fine.

Also consider disabling C-States.

Chomiq · Mar 23, 2021

@flashfrenzy What BIOS version are you running on that board?

flashfrenzy · Mar 23, 2021

BIOS version is 1804

Also, the system just had a reset WITHOUT DOCP enabled. Will try looking at voltages and disabling C-States.

Deeveo · Mar 23, 2021

flashfrenzy said:
Yes, I actually just noticed that myself. It was from me messing around in the BIOS yesterday and not properly setting everything back to default. Here's what it has been in the past when experiencing resets.View attachment 193531

And thanks for the other tips, I will give those a shot as well.

That VDDG IOD voltage looks way too high, could be causing issues. With RAM at 3600 you should be able to lower those voltages quite a bit. Something like:
CLDO VDDP: 0.800V
VDDG IOD: 0.850V
VDDG CCD: 0.900V (could run this higher if needed)
VSOC looks ok for those speeds.

@Alexa :s values on earlier post should work aswell. Should also try to find ProcODT etc settings for your Micron B-dies, should help with stability.

Deleted member 205776 · Mar 23, 2021

I would not go down in the 0.800v range since that caused me audio stuttering issues. The voltages I inputted should be fine for 3600 MT/s RAM.

thesmokingman · Mar 23, 2021

Alexa said:
... and disabling Global C-State control under Advanced --> AMD CBS (this may help with the resets, it helped with my random idle reboots but I'm not sure)

No worries, you won't break anything by doing this. It'll either fix the affected 3 clocks mentioned above or not fix anything at all, in which case you can easily reset them back.

You shouldn't ever have to disable c-states or any of the power saving features. If one has issues with them enabled it's not the feature at fault obviously and the real issue lies elsewhere.

GerKNG · Mar 23, 2021

flashfrenzy said:
I just finished building a new system this last week and twice now I've had hard resets playing Shadow of the Tomb Raider. In one case it was after an hour or two and the second time it was within ~15 min. The second time I had both hwinfo and gpu-z logging running. Everything seems fine from the logs (links below) but maybe someone can see something I don't. I don't think the restart is thermal related. The CPU was at around 50C at the time of the restart. In games it seems to be around 50-60C and idle in the low 30s. Since the restarts I completely cleared the CMOS and went 100% stock (no XMP). Since then, I haven't seen any restarts but I've been running prime95 tests and always run into errors within 2 hours. The CPU does get very hot in the tests (up to 90C) but I'm not sure if this is typical or not for prime95 since my normal load temps seem fine. I've also run memtest86 with XMP enabled and FCLK at 1800 and 3 passes (3 and half hours) there were no errors so I don't suspect the RAM at this point. My last runs of P95 I had the case open with a giant fan blowing in. It didn't really seem to affect anything. My GPU temps were way down (irrelevant, I know) but CPU was unaffected. I'm kind of at a loss of what to look for or do at this point. Any suggestions/advice would be greatly appreciated.

Component rundown: 5800x, ASUS b550-f mobo, EVGA RTX3060, Arctic Freezer 280 AIO, Super Flower Leadex III 850W, 32 GB Crucial Ballistix CL16 3600

BIOS and drivers are 100% up to date.

hwinfo logs: https://docs.google.com/spreadsheets/d/1ggHi0jCAs5BACIKK_BJT7cCELAP9mVAvYHBBip8FtSk/edit?usp=sharing
gpu-z logs: https://docs.google.com/spreadsheets/d/1Fc2VbCaYCh_FgW8JDO1Q9o771ZlAXU5dhhazVmOR5_Y/edit?usp=sharing

welcome to the club of broken AMD CPUs.

RMA it. i went through this a few times now with a bunch of CPUs. (my 5900X was not stable at stock speeds.)

do you have any cache hierachy related errors in the eventviewer after the crashes?

RJARRRPCGP · Mar 23, 2021

Chomiq said:
I've had hard resets with nothing in event log when I was running an unstable ram OC on my 3700x.

So you would get power-loss-like-symptoms, which usually would mean a bad VRM IC, MOSFET(s) or bad cap(s). That's spooky! Reminds me of one day when an Acer LCD monitor that I had, was power-cycling and then I realized that the power went out for a second multiple times! (power problems during that time and wasn't sure if the power utility company noticed or not)

More likely, I would suspect Windows crashing and rebooting, but unable to even keep a record of a bugcheck in the event log. If Windows failed to log a hardware-related bugcheck, then it will look like a flaky power source!

thesmokingman · Mar 23, 2021

flashfrenzy said:
Since the restarts I completely cleared the CMOS and went 100% stock (no XMP). Since then, I haven't seen any restarts but I've been running prime95 tests and always run into errors within 2 hours. The CPU does get very hot in the tests (up to 90C) but I'm not sure if this is typical or not for prime95 since my normal load temps seem fine. I've also run memtest86 with XMP enabled and FCLK at 1800 and 3 passes (3 and half hours) there were no errors so I don't suspect the RAM at this point. My last runs of P95 I had the case open with a giant fan blowing in. It didn't really seem to affect anything. My GPU temps were way down (irrelevant, I know) but CPU was unaffected. I'm kind of at a loss of what to look for or do at this point. Any suggestions/advice would be greatly appreciated.

Component rundown: 5800x, ASUS b550-f mobo, EVGA RTX3060, Arctic Freezer 280 AIO, Super Flower Leadex III 850W, 32 GB Crucial Ballistix CL16 3600

BIOS and drivers are 100% up to date.

hwinfo logs: https://docs.google.com/spreadsheets/d/1ggHi0jCAs5BACIKK_BJT7cCELAP9mVAvYHBBip8FtSk/edit?usp=sharing
gpu-z logs: https://docs.google.com/spreadsheets/d/1Fc2VbCaYCh_FgW8JDO1Q9o771ZlAXU5dhhazVmOR5_Y/edit?usp=sharing

That deep into a p95 run usually indicates memory errors.

RJARRRPCGP · Mar 23, 2021

GerKNG said:
welcome to the club of broken AMD CPUs.

RMA it. i went through this a few times now with a bunch of CPUs. (my 5900X was not stable at stock speeds.)

do you have any cache hierachy related errors in the eventviewer after the crashes?

When you posted that, it reminds me of my close buddy's FX9590! When he was just Skyping and the like, it kept going down! I think at that point, I was familiar with the dreaded "There is a problem with this call" message (or similar) from Skype and the familiar sound of losing a call! (2014, when he had to return that FX9590) I suspected that there were a lot of faulty FX 9590s, because I thought I saw multiple posts of FX 9590s being flaky at stock!

flashfrenzy · Mar 23, 2021

GerKNG said:
do you have any cache hierachy related errors in the eventviewer after the crashes?

I get nothing in the event viewer.

Btw, this is my timings that I just experienced the most recent restart. 100% default BIOS settings.

flashfrenzy · Mar 24, 2021

OK, so I got a new set of memory and have the same thing happening but now I'm seeing WHEA errors to go along with it.

Deeveo · Mar 24, 2021

flashfrenzy said:
OK, so I got a new set of memory and have the same thing happening but now I'm seeing WHEA errors to go along with it.

View attachment 193733

Was this at stock aswell, with no DOCP enabled? Starting to smell like there is an issue with the cpu itself.

Did you try manually setting the CLDO VDDG etc voltages like @Alexa suggested? Although at stock it should run with auto setting without issues.

Alexa said:
Your MCLK, FCLK and UCLK values should all be the same. I doubt that's what's causing it, but you should rectify that. It means that the board failed to boot with unified fabric & memory clocks. That will introduce unnecessary amounts of latency during gaming. Either that or it didn't attempt to do it in the first place (which I doubt as I have the same board without WiFi).

Try setting FCLK yourself manually to 1800 in the BIOS.

I would also suggest setting:
- VDDG CCD to 1.050v
- VDDG IOD to 0.950v
- CLDO VDDP to 0.900v
- DRAM voltage to 1.35v if it isn't already set that way
(all of these are under Ai Tweaker)

... and disabling Global C-State control under Advanced --> AMD CBS (this may help with the resets, it helped with my random idle reboots but I'm not sure)

No worries, you won't break anything by doing this. It'll either fix the affected 3 clocks mentioned above or not fix anything at all, in which case you can easily reset them back.

For example, this is how mine looks like:

Space Lynx · Mar 24, 2021

flashfrenzy said:
I just finished building a new system this last week and twice now I've had hard resets playing Shadow of the Tomb Raider. In one case it was after an hour or two and the second time it was within ~15 min. The second time I had both hwinfo and gpu-z logging running. Everything seems fine from the logs (links below) but maybe someone can see something I don't. I don't think the restart is thermal related. The CPU was at around 50C at the time of the restart. In games it seems to be around 50-60C and idle in the low 30s. Since the restarts I completely cleared the CMOS and went 100% stock (no XMP). Since then, I haven't seen any restarts but I've been running prime95 tests and always run into errors within 2 hours. The CPU does get very hot in the tests (up to 90C) but I'm not sure if this is typical or not for prime95 since my normal load temps seem fine. I've also run memtest86 with XMP enabled and FCLK at 1800 and 3 passes (3 and half hours) there were no errors so I don't suspect the RAM at this point. My last runs of P95 I had the case open with a giant fan blowing in. It didn't really seem to affect anything. My GPU temps were way down (irrelevant, I know) but CPU was unaffected. I'm kind of at a loss of what to look for or do at this point. Any suggestions/advice would be greatly appreciated.

Component rundown: 5800x, ASUS b550-f mobo, EVGA RTX3060, Arctic Freezer 280 AIO, Super Flower Leadex III 850W, 32 GB Crucial Ballistix CL16 3600

BIOS and drivers are 100% up to date.

hwinfo logs: https://docs.google.com/spreadsheets/d/1ggHi0jCAs5BACIKK_BJT7cCELAP9mVAvYHBBip8FtSk/edit?usp=sharing
gpu-z logs: https://docs.google.com/spreadsheets/d/1Fc2VbCaYCh_FgW8JDO1Q9o771ZlAXU5dhhazVmOR5_Y/edit?usp=sharing

it may not be the ram it all. it may be PBO. disable PBO in mobo bios. some ryzen chips simply aren't stable with PBO on. (is usually on bydefault or on auto in mobo bios) so manually make sure it says disabled under PBO.

flashfrenzy · Mar 24, 2021

Deeveo said:
Was this at stock aswell, with no DOCP enabled? Starting to smell like there is an issue with the cpu itself.

Did you try manually setting the CLDO VDDG etc voltages like @Alexa suggested? Although at stock it should run with auto setting without issues.

Yes, I forgot to update, but I tried the above voltages. Same issue. I've also had the issue occur with both sets of memory at non-DOCP. New set is GSkill.

lynx29 said:
it may not be the ram it all. it may be PBO. disable PBO in mobo bios. some ryzen chips simply aren't stable with PBO on. (is usually on bydefault or on auto in mobo bios) so manually make sure it says disabled under PBO.

Yes, I don't think it's memory related at this point. I also did try disabling PBO and it still occured.

I guess the question is what I should try next? Mobo would be easiest to try since it's still within Amazon's return window. CPU I got direct from AMD so I have no idea how long that process will take. But their shipping was SLOW so probably a while. I have an older PSU (6 year old EVGA G2 750W that has worked flawlessly) if anyone things that could make any difference whatsoever.

System Name	maipc
Processor	4790k @ 4.4Ghz / 1.16v
Motherboard	Asus vii hero
Cooling	Noctua NH-D14
Memory	16GB (2x8GB) Hyperx fury 1866 / CL10
Video Card(s)	MSI RTX 3060 Ti Gaming X 8GB
Storage	1TB 860EVO + 1TB 860EVO + 4TB WD Red + 4TB WD Red
Display(s)	Asus VG259QM (1080p IPS 240Hz)
Case	Cooler Master Centurion 6
Power Supply	Seasonic Prime GX-650
Mouse	Logitech G Pro X Superlight
Keyboard	Wooting 60HE
Software	Win 10 64bit

System Name	maipc
Processor	4790k @ 4.4Ghz / 1.16v
Motherboard	Asus vii hero
Cooling	Noctua NH-D14
Memory	16GB (2x8GB) Hyperx fury 1866 / CL10
Video Card(s)	MSI RTX 3060 Ti Gaming X 8GB
Storage	1TB 860EVO + 1TB 860EVO + 4TB WD Red + 4TB WD Red
Display(s)	Asus VG259QM (1080p IPS 240Hz)
Case	Cooler Master Centurion 6
Power Supply	Seasonic Prime GX-650
Mouse	Logitech G Pro X Superlight
Keyboard	Wooting 60HE
Software	Win 10 64bit

System Name	KHR-1
Processor	Ryzen 9 5900X
Motherboard	ASRock B550 PG Velocita (UEFI-BIOS P3.40)
Memory	64 GB G.Skill RipJaws V F4-3200C16D-64GVK
Video Card(s)	Sparkle Titan Arc A770 16 GB
Storage	Samsung 990 Pro 1 TB NVMe SSD
Display(s)	Alienware AW3423DWF OLED-ASRock PG27Q15R2A (backup)
Case	Corsair 275R
Audio Device(s)	Technics SA-EX140 receiver with Polk VT60 speakers
Power Supply	eVGA Supernova G3 750W
Mouse	Logitech G Pro (Hero)
Software	Windows 11 Pro x64 24H2

Processor	Ryzen 7 5800X3D
Motherboard	Gigabyte X570 Aorus Elite
Cooling	Thermalright Phantom Spirit 120 SE
Memory	2x16 GB Crucial Ballistix 3600 CL16 Rev E @ 3600 CL14
Video Card(s)	RTX3080 Ti FE
Storage	SX8200 Pro 1 TB, Plextor M6Pro 256 GB, WD Blue 2TB
Display(s)	LG 34GN850P-B
Case	Lancool 207
Audio Device(s)	SoundBlaster G6 \| Fidelio X2 \| Sennheiser 6XX
Power Supply	SeaSonic Focus Plus Gold 750W
Mouse	Endgame Gear XM1R
Keyboard	Wooting Two HE

Processor	Ryzen 7 5800X3D
Motherboard	Gigabyte X570 Aorus Elite
Cooling	Thermalright Phantom Spirit 120 SE
Memory	2x16 GB Crucial Ballistix 3600 CL16 Rev E @ 3600 CL14
Video Card(s)	RTX3080 Ti FE
Storage	SX8200 Pro 1 TB, Plextor M6Pro 256 GB, WD Blue 2TB
Display(s)	LG 34GN850P-B
Case	Lancool 207
Audio Device(s)	SoundBlaster G6 \| Fidelio X2 \| Sennheiser 6XX
Power Supply	SeaSonic Focus Plus Gold 750W
Mouse	Endgame Gear XM1R
Keyboard	Wooting Two HE

System Name	Gaming rig
Processor	AMD Ryzen 7 5900X
Motherboard	Asus X570-Plus TUF /w "passive" chipset mod
Cooling	Noctua NH-D15S
Memory	Crucial Ballistix Sport LT 2x16GB 3200C16 @3600C16
Video Card(s)	MSI RTX 3060 TI Gaming X Trio
Storage	Samsung 970 Pro 1TB, Crucial MX500 2TB, Samsung 860 QVO 4TB
Display(s)	Samsung C32HG7x
Case	Fractal Design Define R5
Audio Device(s)	Asus Xonar Essence STX
Power Supply	Corsair RM850i 850W
Mouse	Logitech G502 Hero
Keyboard	Logitech G710+
Software	Windows 10 Pro

Processor	AMD 5900x
Motherboard	Asus x570 Strix-E
Cooling	Hardware Labs
Memory	G.Skill 4000c17 2x16gb
Video Card(s)	RTX 3090
Storage	Sabrent
Display(s)	Samsung G9
Case	Phanteks 719
Audio Device(s)	Fiio K5 Pro
Power Supply	EVGA 1000 P2
Mouse	Logitech G600
Keyboard	Corsair K95

Processor	AMD Ryzen 9 9950X3D
Motherboard	ASRock B850M PRO-A
Cooling	Corsair Nautilus 360 RS
Memory	2x32GB Kingston Fury Beast 6000 CL30
Video Card(s)	PowerColor Hellhound RX 9070 XT
Storage	1TB Samsung 990 Pro, 2TB Samsung 990 Pro, 4TB Samsung 990 Pro
Display(s)	LG 27GS95QE-B, MSI G272QPF E2
Case	Lian Li DAN Case A3 Black Wood Edition
Audio Device(s)	Bose Companion Series 2 III, Sennheiser GSP600 and HD599 SE - Creative Soundblaster X4
Power Supply	Corsair RM1000X ATX 3.1
Mouse	Razer Deathadder V3
Keyboard	Razer Black Widow V3 TKL
VR HMD	Oculus Rift S

Hard resets playing games

New Member

New Member

New Member

New Member

Deleted member 205776

Guest

New Member

Deleted member 205776

Guest

New Member

Deleted member 205776

Guest

New Member

Attachments

New Member

Astronaut

New Member

Similar threads