- Joined
- Jun 20, 2007
- Messages
- 3,942 (0.64/day)
System Name | Widow |
---|---|
Processor | Ryzen 7600x |
Motherboard | AsRock B650 HDVM.2 |
Cooling | CPU : Corsair Hydro XC7 }{ GPU: EK FC 1080 via Magicool 360 III PRO > Photon 170 (D5) |
Memory | 32GB Gskill Flare X5 |
Video Card(s) | GTX 1080 TI |
Storage | Samsung 9series NVM 2TB and Rust |
Display(s) | Predator X34P/Tempest X270OC @ 120hz / LG W3000h |
Case | Fractal Define S [Antec Skeleton hanging in hall of fame] |
Audio Device(s) | Asus Xonar Xense with AKG K612 cans on Monacor SA-100 |
Power Supply | Seasonic X-850 |
Mouse | Razer Naga 2014 |
Software | Windows 11 Pro |
Benchmark Scores | FFXIV ARR Benchmark 12,883 on i7 2600k 15,098 on AM5 7600x |
After some great advisement by fellow members, I have recently platform JUMPED (ok that might be extreme) from Sandybridge to AM5
And while it's pretty much been good overall, I am stumped about system crashing that is occurring
I might have to update this post as I go as I will invariably think of something later that was missed out
Symptom
System freezes and stays that way indefinitely until I restart manually
Observations
Only seems to occurs when the following is true :
Other :
Windows event viewer /error tracking :
Things I have tried
System/Applications/Services
Graphics /Video card/hardware
Miscellaneous
Tests run
(no errors found)
=--=
Thoughts thus far..
Trying to think of some other tests or ways to do analysis to get at least some idea of where this is stemming from just to get a baseline
I wish this was a simple as straight forward as an extreme negative Curve Optimizer was resulting in black screens or WHEA - at least then it makes sense
And while it's pretty much been good overall, I am stumped about system crashing that is occurring
I might have to update this post as I go as I will invariably think of something later that was missed out
Symptom
System freezes and stays that way indefinitely until I restart manually
- The audio will keep playing (though it does not stutter and lead onto a BSOD
- Visuals are frozen
- Light on keyboard and other peripherals go out
- Input of any type has no effect
Observations
Only seems to occurs when the following is true :
- -3D Application such as a game is running
- - A web browser is open
- I haven't been using the system for a moment or
- I alt tabbed or moved mouse to other monitor or
- I do a scroll on the browser on other monitor
E.g. under load it doesn't occur - and by load I mean the 3D program/game is the focus window(not necessarily full load as in boosting cores/maximum output)
Other :
- When it happens I hear a slight jump in fan speed which suggests to me the CPU has been ramped up/possibly boosting and therefore outputting more heat - of note it's not excessively high, only a moderate increase in fan cycle, which to me (since I set a curve) I know is still within safe temperatures. May not be relevant though might give weight to the idea this could be something to due with sleeping or parked cores trying to spring into action then crashing
- It never occurs during boot or start up of windows (login, services start, programs start etc).
- Some occasions ( not the majority), there's activity even after it freezes - where the screen ever so slightly flickers. By flicker I mean a pulsing of brightness higher, then lower, then higher etc. This might be the physical screen refreshing (yet the supplied image is static), causing some type of anomaly our eyes can detect.
- The time span or interval(s) before it begins is dependent upon when I start a 3D program + a browser e.g. I could go for quite a while without the issue, and then when I meet above conditions it can happen five minutes, or forty-five minutes later
- Memory dumps are infrequent, there isn't one every time or even every other time that this occurs(and the ones I have seen are not very helpful)
- After each crash, my mouse settings through Razer Synapse do not work. The program/app loads, however I have to manually exit and relaunch it. Then it is OK
This happens which each crash/manual reset - Can occur when running 3D application or game from two different drives. e.g. One NVME SSD and the other a rust disk
Windows event viewer /error tracking :
- Windows event viewer has no clear errors that shed light on this or seem related - I can see the event log entries and Kernel entries about unplanned system shutdown though not what caused it
- There's only three WHEA errors and those only occurred a few weeks ago when I was doing some manual voltage testing and was too undervolted to be stable
- I do have some critical and warnings (mostly warnings) entries however :
A) The timing of them does not always align with the crashes or anywhere remotely close
B) The frequency of them is such that it happens multiple times in between each crash. One would think if it was causing the crash it wouldn't error multiple times THEN cause the crash, it would be the first time every time
Examples include
Critical
Event 131, DeviceSetupManager - "Metadata staging failed, result=0x80004005 for container '{2D4CAE54-5B69-5328-BC5D-AF10B6CDF8DF}'"
Event 10010, Distributed Com - "The server {8CFC164F-4BE5-4FDD-94E9-E2AF73ED4A19} did not register with DCOM within the required timeout."
Warning
Event 122, DeviceSetupManager - "Access to drivers on Windows Update was blocked by policy"
Event 200, DeviceSetupManager - "A connection to the Windows Update service could not be established."
Event 201, DeviceSetupManager - "A connection to the Windows Metadata and Internet Services (WMIS) could not be established."
Event 202, DeviceSetup Manager - "The Network List Manager reports no connectivity to the internet."
Event 360, User Device Recognition - "Windows Hello for Business provisioning will not be launched. Device is AAD joined ( AADJ or DJ++ ): Not Tested
User has logged on with AAD credentials: No
Windows Hello for Business policy is enabled: Not Tested
Windows Hello for Business post-logon provisioning is enabled: Not Tested
Event 10016, Distributed Com - "The application-specific permission settings do not grant Local Launch permission for the COM Server application with CLSID
Windows.SecurityCenter.WscDataProtection
and APPID Unavailable
to the user NT AUTHORITY\SYSTEM SID (S-1-5-18) from address LocalHost (Using LRPC) running in the application container Unavailable SID (Unavailable). This security permission can be modified using the Component Services administrative tool."
Things I have tried
System/Applications/Services
- All BIOS settings to default
- Latest BIOS from manufacturer
- All motherboard recommended standard drivers(Chipset, LAN, etc.)
- No devices outstanding in Device Manager
- Latest Windows update
- Starting Windows with selective - Microsoft only /non user services
- Starting Windows with virtually no programs /user programs
- Used both Edge and Brave browser (relevance here is it seems to only happen when browser is running)
- Disable/enable hardware acceleration (for example had off in Edge and on in Brave)
- Windows power plan with minimum processor state at 5% versus 90% versus 95%
- Power plan adjusted to not allow anything to auto turn off or goto sleep or similar
- Amended BIOS for testing with :
- PSU idle power behavior to "typical" instead of "auto" or "low"
- Disabled C-states and DF states
- Attempted to set positive offset voltage( no joke, I cannot - apparently according to AsRock this was disabled, yet you can still change voltages overall...)
- Set the DDR Power management to disable the idle state /lower power state of DDR5
- Set RAM to use EXPO to see whether it would bump the voltage as part of the settings to give more stability(assuming the automatically supplied voltage is not enough)
- Set RAM manual timings to ensure they match the advertised product speeds
Graphics /Video card/hardware
- Windows graphics settings (GPU acceleration, Variable Rate, Games Optimization disabled)
- Graphics drivers removed reinstalled
- Disabled onboard GPU in Device manager
- Disconnecting any additional monitors physically (or in software /both)
- Tried to recreate while having Edge/Brave on the main monitor versus a secondary or tertiary monitor - happens on both monitors
- Reseated GPU in motherboard
- Checked mount studs for case - > motherboard they seem OK
- Set Nvidia properties to allow all programs, even browser to use Performance /Maximum ; also tested with Optimal Power setting
Miscellaneous
- Loaded up a Windows install from another drive
- Checked temperatures for CPU, Cores, motherboard components/VRM, RAM, SSD/rust drives, GPU
- Power delivery seems OK - I find no power bottlenecking /delivery issues and when programs are going, they are going without issue (clock speeds boost, wattage is delivered as expected)
Tests run
(no errors found)
- Ycruncher, Prime 95 and Core cycler
- OCCT
- Memtest64and classic RAM test
- AIDA 64 benchmarks
- Cinebench
- Various game or 3Dmark benchmarks
=--=
Thoughts thus far..
- Given it happens on stock settings that is concerning
- Hardware cause instead of software seems more likely be the case as it also occurs when I load up the other Windows system install on a separate SSD drive which admittedly is also Windows 11 though a previous version and - of note doesn't included all the same drivers, as it was from the Sandybridge system prior
- It seems to be more on a low power or low clock usage, where maybe coming out of sleep state or similar is resulting in a crash. And I could understand when using a heavy negative Curve Optimizer as it is known to be unstable at low voltage draw or frequency. However Curve is off and I also disabled the C and DF states in the testing
- It's not the a-typical symptoms you would associate with unstable system due to low voltage setting or in Ryzen's case PBO and Curve Optimizer (even if there was one) i.e. no BSOD, no WHEA, no black screen and system reboot
- I don't believe the aforementioned Event Viewer entries are related. Reading around they seem to be par for the course with Windows and often benign. As well a red herring or misleading - for example it states my Network List is not working or indicates I have no network connection however I do
- Memory dumps haven't been proved to be much use though should anyone else want a copy to take a crack at looking over it, please ask
- I am in possession of a system restore that is /was made not long after Windows install, with some drivers updated and effectively a baseline before any custom/user stuff was installed. I have not tried restoring to that point yet
- Real time note : I have been web browsing, doing this post and etc on/off for forty minutes - no issues,
Trying to think of some other tests or ways to do analysis to get at least some idea of where this is stemming from just to get a baseline
I wish this was a simple as straight forward as an extreme negative Curve Optimizer was resulting in black screens or WHEA - at least then it makes sense
Last edited: