• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD GPU crashing

DjTeorio

New Member
Joined
Oct 21, 2020
Messages
14 (0.01/day)
The problem in short:
I got a mining GPU, it kept crashing in a weird way. I flashed a new BIOS and nothing happened.

The problem in more detail:
When I upgraded my PC, I got a used mining card, a Radeon RX 580 (4GB) to be exact, but it was crashing in a weird way whenever it was under a certain load. The crashes would consist of my monitor losing signal and the fans in my PC ramping up. What was interesting about the crashes was that whenever it happened I could still talk to people in my discord call and I had to force shut down my PC to be able to use it again. I recently swapped it with my old card and the crashes were gone and it was better in some cases than the newer mining card even though it only had 2GB VRAM, so I believed it could be due to the card having a mining BIOS installed, so I spent many hours yesterday trying to flash the BIOS and it kept giving me errors until today when I finally did it using the DOS version. Nothing had changed the crashing kept happening and I even reinstalled the drivers to no avail. So I have returned to this forum with the hopes that you could help me fix my scuffed mining card.

Any help would be greatly appreciated.
 
Joined
Dec 26, 2016
Messages
281 (0.10/day)
Processor Ryzen 3900x
Motherboard B550M Steel Legend
Cooling XPX (custom loop)
Memory 32GB 3200MHz cl16
Video Card(s) 3080 with Bykski block (custom loop)
Storage 980 Pro
Case Fractal 804
Power Supply Focus Plus Gold 750FX
Mouse G603
Keyboard G610 brown
Software yes, lots!
Probably a hardware error. If that card ran in a mining rig for 2 years or more, its lifetime is reached. Capacitors age. And they age much faster with every degree of warmth they are subjected to. So two or more years at full load in a hot mining rig at 60+ °C is much more heat over time than any average card endures in a gaming rig in five years.

First you could check the caps for visible damage, sometimes they kind of expand and get thicker and inflate/get a bulge, sometimes the chemical from the inside bursts out and they have a brownish coating on them. But most of the times they just dry out on the inside and you dont see anything from the outside.
If you can see damage, you can solder in new caps, which only cost a few cents, otherwise you can just switch them all and hope it was the caps. But maybe it was a completely different failure and I was totally wrong ;)
I am sure there are people out there that have better knowledge of what exactly happens with mining cards and can give you more specific advice.
 
Last edited:

DjTeorio

New Member
Joined
Oct 21, 2020
Messages
14 (0.01/day)
Well all I can really say is that when i opened it nothing stood out as being broken and my soldering skills are pretty much non-existent anyway. :/
 
Joined
Jun 3, 2010
Messages
2,540 (0.50/day)
If you can, resetting the driver has the odd chance of fixing it. It happened to me whenever temperature caused an instability with specific overclocks.
 

DjTeorio

New Member
Joined
Oct 21, 2020
Messages
14 (0.01/day)
I mean I have reinstalled the drivers multiple times if that's what you mean.
 
Joined
Oct 12, 2005
Messages
682 (0.10/day)
I had similar issue with my rx 580 8 GB. Ended up RMA the card for a new one as it was hardware.
 
Joined
Oct 26, 2016
Messages
1,740 (0.64/day)
Location
BGD
Processor Intel I9 7940X
Motherboard Asus Strix Rog Gaming E X299
Cooling Xigmatek LOKI SD963 double-Fan
Memory 64Gb DDR4 2666Mhz
Video Card(s) 1)RX 6900XT BIOSTAR 16Gb***2)MATROX M9120LP
Storage 2 x ssd-Kingston 240Gb A400 in RAID 0+ HDD 500Gb +Samsung 128gbSSD +SSD Kinston 480Gb
Display(s) BenQ 28"EL2870U(4K-HDR) / Acer 24"(1080P) / Eizo 2336W(1080p) / 2x Eizo 19"(1280x1024)
Case Lian Li
Audio Device(s) Realtek/Creative T20 Speakers
Power Supply F S P Hyper S 700W
Mouse Asus TUF-GAMING M3
Keyboard Func FUNC-KB-460/Mechanical Keyboard
VR HMD Oculus Rift DK2
Software Win 11
Benchmark Scores Fire Strike=23905,Cinebench R15=3189,Cinebench R20=3791.Passmark=30689,Geekbench4=32885
There is also a possibility that the miners change&adjust the card bios to minimize power consumption and gain much profit as they can and that causing the card instability.....If that is the case you can find the original card bios and flash it with Atiflash utility......GL
 
Joined
Dec 26, 2016
Messages
281 (0.10/day)
Processor Ryzen 3900x
Motherboard B550M Steel Legend
Cooling XPX (custom loop)
Memory 32GB 3200MHz cl16
Video Card(s) 3080 with Bykski block (custom loop)
Storage 980 Pro
Case Fractal 804
Power Supply Focus Plus Gold 750FX
Mouse G603
Keyboard G610 brown
Software yes, lots!
There is also a possibility that the miners change&adjust the card bios to minimize power consumption and gain much profit as they can and that causing the card instability.....If that is the case you can find the original card bios and flash it with Atiflash utility......GL
He already did that
 
Joined
Jun 3, 2010
Messages
2,540 (0.50/day)
I mean I have reinstalled the drivers multiple times if that's what you mean.
If you had a shortcut to reset the driver while in os, it could have helped. For some reason, the card can soft reset gpu errors, but not memory controller errors which is the cause you should suspect, imo. It could also be due to memory although the card wouldn't let you off without a hard reset.
 
Joined
Jul 16, 2014
Messages
8,120 (2.27/day)
Location
SE Michigan
System Name Dumbass
Processor AMD Ryzen 7800X3D
Motherboard ASUS TUF gaming B650
Cooling Artic Liquid Freezer 2 - 420mm
Memory G.Skill Sniper 32gb DDR5 6000
Video Card(s) GreenTeam 4070 ti super 16gb
Storage Samsung EVO 500gb & 1Tb, 2tb HDD, 500gb WD Black
Display(s) 1x Nixeus NX_EDG27, 2x Dell S2440L (16:9)
Case Phanteks Enthoo Primo w/8 140mm SP Fans
Audio Device(s) onboard (realtek?) - SPKRS:Logitech Z623 200w 2.1
Power Supply Corsair HX1000i
Mouse Steeseries Esports Wireless
Keyboard Corsair K100
Software windows 10 H
Benchmark Scores https://i.imgur.com/aoz3vWY.jpg?2
what kind of temps were you seeing? cuz it sounds like something was overheating.
 
Joined
Sep 3, 2019
Messages
2,981 (1.76/day)
Location
Thessaloniki, Greece
System Name PC on since Aug 2019, 1st CPU R5 3600 + ASUS ROG RX580 8GB >> MSI Gaming X RX5700XT (Jan 2020)
Processor Ryzen 9 5900X (July 2022), 150W PPT limit, 79C temp limit, CO -9~14
Motherboard Gigabyte X570 Aorus Pro (Rev1.0), BIOS F37h, AGESA V2 1.2.0.B
Cooling Arctic Liquid Freezer II 420mm Rev7 with off center mount for Ryzen, TIM: Kryonaut
Memory 2x16GB G.Skill Trident Z Neo GTZN (July 2022) 3600MHz 1.42V CL16-16-16-16-32-48 1T, tRFC:288, B-die
Video Card(s) Sapphire Nitro+ RX 7900XTX (Dec 2023) 314~465W (387W current) PowerLimit, 1060mV, Adrenalin v24.3.1
Storage Samsung NVMe: 980Pro 1TB(OS 2022), 970Pro 512GB(2019) / SATA-III: 850Pro 1TB(2015) 860Evo 1TB(2020)
Display(s) Dell Alienware AW3423DW 34" QD-OLED curved (1800R), 3440x1440 144Hz (max 175Hz) HDR1000, VRR on
Case None... naked on desk
Audio Device(s) Astro A50 headset
Power Supply Corsair HX750i, 80+ Platinum, 93% (250~700W), modular, single/dual rail (switch)
Mouse Logitech MX Master (Gen1)
Keyboard Logitech G15 (Gen2) w/ LCDSirReal applet
Software Windows 11 Home 64bit (v23H2, OSB 22631.3155)
Any monitoring done for speeds and voltages during gaming? For GPU and VRAM.

You can use GPU-Z sensors tab, or HWiNFO "sensors only" window. HWiNFO can be more detailed (more sensors).
Just load a game for a while and get out before crash happens, and you can see max values of each.
 
Joined
Mar 20, 2008
Messages
1,369 (0.23/day)
System Name Ryzen5900X
Processor AMD Ryzen 5900X
Motherboard Gigabyte B550 AORUS PRO AC
Cooling NZXT Kraken X62
Memory 4x G.Skill F4-3600C17D-8GTZ
Video Card(s) AMD Radeon RX 6800XT Midnight Black
The crashes would consist of my monitor losing signal and the fans in my PC ramping up. What was interesting about the crashes was that whenever it happened I could still talk to people in my discord call and I had to force shut down my PC to be able to use it again.
I recognize these symptoms. The VRM's seems to be the issue. You can try to put the fan on 100% to see if it takes more time before it crashes.
 
Joined
Jun 3, 2010
Messages
2,540 (0.50/day)
I recognize these symptoms. The VRM's seems to be the issue. You can try to put the fan on 100% to see if it takes more time before it crashes.
It happened to me. Afterwards I got the shifted color dots. I don't know what broke, but there are polka dots which is funny for a computer error. The card is working and the graphics aren't altered, so I think it busted 1 module, or its bga solders.
I suggest you use the card with vsync and chill turned on. You're soon to have the same joke of a card, otherwise.
 
Joined
Sep 2, 2020
Messages
1,478 (1.11/day)
System Name Chip
Processor Amd 5600X
Motherboard MSI B450M Mortar Max
Cooling Hyper 212
Memory 2x 16g ddr4 3200mz
Video Card(s) RX 6700
Storage 5.5 tb hd 220 g ssd
Display(s) Normal moniter
Case something cheap
VR HMD Vive
what do you mean joke of a card 580 is fine at 1080p
 
Top