• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

2 Artifacting 2080 tis, Determining faulty GDDR6 modules?

inspirashun

New Member
Joined
Feb 4, 2020
Messages
6 (0.10/day)
I have 2 faulty 2080 tis on a known good bench.

Dell card boots 100% into artifact mode. Error 43 from win10. Installs drivers okay. GPUZ shows 0 mb GDDR6 but sensors in GPUZ show ~2500 MB in use. Nothing recognized in MSI Afterburner. No significant sensor data in HWinfo.

Rog Strix OC on cold start shows no artifacting and is recognized with 11GB in GPUZ and HWinfo picks up sensor data. When up to temp, card locks up pc and upon restart will artifact from bios and is recognized to the same extent as the dell card, as a 2080 ti in limp mode.

This screams memory failure to me. Any thoughts?
 
Joined
Jun 3, 2010
Messages
796 (0.22/day)
You could try rewriting the memory bootstrap. Generally that is what I would inquire apart from a resolder.
 
Joined
Jan 31, 2010
Messages
3,460 (0.93/day)
Location
Gougeland (NZ)
System Name Cumquat 2000
Processor AMD RyZen R7 3700X
Motherboard Asus Strix x470-F Gaming
Cooling Cryorig R1 Universal
Memory 16GB GSkill TridentZ DR4-3200 16-18-18-38 1.35V
Video Card(s) Gigabyte RX580 Gaming 8GB
Storage 1X Samsung 860 EVO 500GB SATA AHCI 1x Sandisk 128GB PCIe, 12TB of HDD Storage
Display(s) ViewSonic VX2457 MHD 75Hz FreeSync 1920x1080p
Case SilverStone Raven RV02B-W + 3x SS AP181 + USB3.0 upgrade
Audio Device(s) Creative SB X-Fi Titanium Fatal1ty Pro PCIe + Pioneer VSX-531 amp Technics 100W Speaker set
Power Supply Enermax RevolutionXT II 750W 80+ Gold
Mouse Logitech G502 Proteus Core
Keyboard Logitech G105
Software Windows 10 X64 PRO (build 1909)
Benchmark Scores it sucks even more less now ;)
Why bother shouldn't these cards still be covered by manufacturer's warranty so why not just RMA them both for good cards in return. Unless you really want to faff around with them
 

inspirashun

New Member
Joined
Feb 4, 2020
Messages
6 (0.10/day)
They were both purchased as parts for an educational purpose. The Dell has bent fins on the blower heat sink, I don’t see an RMA possibility. The ASUS will be going in for RMA.

I will try bios for the Dell to see what comes of it. If that doesn’t solve it, how would one go about determining the bad individual modules? I would hate to heat, remove and replace all 11 in one go.
 
Joined
Jan 31, 2010
Messages
3,460 (0.93/day)
Location
Gougeland (NZ)
System Name Cumquat 2000
Processor AMD RyZen R7 3700X
Motherboard Asus Strix x470-F Gaming
Cooling Cryorig R1 Universal
Memory 16GB GSkill TridentZ DR4-3200 16-18-18-38 1.35V
Video Card(s) Gigabyte RX580 Gaming 8GB
Storage 1X Samsung 860 EVO 500GB SATA AHCI 1x Sandisk 128GB PCIe, 12TB of HDD Storage
Display(s) ViewSonic VX2457 MHD 75Hz FreeSync 1920x1080p
Case SilverStone Raven RV02B-W + 3x SS AP181 + USB3.0 upgrade
Audio Device(s) Creative SB X-Fi Titanium Fatal1ty Pro PCIe + Pioneer VSX-531 amp Technics 100W Speaker set
Power Supply Enermax RevolutionXT II 750W 80+ Gold
Mouse Logitech G502 Proteus Core
Keyboard Logitech G105
Software Windows 10 X64 PRO (build 1909)
Benchmark Scores it sucks even more less now ;)
They were both purchased as parts for an educational purpose. The Dell has bent fins on the blower heat sink, I don’t see an RMA possibility. The ASUS will be going in for RMA.

I will try bios for the Dell to see what comes of it. If that doesn’t solve it, how would one go about determining the bad individual modules? I would hate to heat, remove and replace all 11 in one go.
with some million dollar test equipment but not having that you'll unfortunately have to do it to each one and then there's the risk that it's just a dead die and no amount of resoldering is going to fix it unless you have a dead donor card to riffle for parts
 
Joined
Nov 18, 2010
Messages
4,973 (1.45/day)
Location
Rīga, Latvia
System Name HELLSTAR
Processor Intel 5960X @ 4.4GHz
Motherboard Gigabyte GA-X99-UD3
Cooling Custom Loop. 360+240 rads. 5x Nidec Servo Gentle Typhoons. AC CUPLEX KRYOS NEXT.
Memory 4x8GB Corsair Vengeance LPX 3000MHz 15-17-17-36 CR1
Video Card(s) ASUS 1080 Ti FE + water block
Storage Optane 900P + Samsung PM981 NVMe 1TB + 750 EVO 500GB
Display(s) Philips PHL BDM3270
Case Phanteks Enthoo Evolv ATX Tempered Glass
Audio Device(s) Sound Blaster ZxR
Power Supply Fractal Design Newton R3 1000W
Mouse Razer Basilisk
Keyboard Razer Deathstalker
Software Windows 10 insider
Freeze or heat them up. See if it repeats the problem count.

Rinse and repeat till you find the weak spot.

I would like add... if one part is faulty... great chance for another one... and you have two faulty cards already. A professional would insist to change them all.
 
Last edited:

inspirashun

New Member
Joined
Feb 4, 2020
Messages
6 (0.10/day)
Freeze or heat them up. See if it repeats the problem count.

Rinse and repeat till you find the weak spot.

I would like add... if one part is faulty... great chance for another one... and you have two faulty cards already. A professional would insist to change them all.
Freezing them would be an interesting approach. I have a Kraken G12 bracket that would allow access to the GDDR6 modules while the unit is live on the bench. I could use that method to determine which module(s) are drawing heat more rapidly, and if it is a significant difference perhaps those are the failed modules. From what I see on the FLIR images posted of the cards, this ram gets HOT.

Is there a better method than an upside down air duster to generate the freeze?
 

eidairaman1

The Exiled Airman
Joined
Jul 2, 2007
Messages
30,377 (6.52/day)
Location
Republic of Texas (True Patriot)
System Name PCGOD
Processor AMD FX 8350@ 5.0GHz
Motherboard Asus TUF 990FX Sabertooth R2 2901 Bios
Cooling Scythe Ashura, 2×BitFenix 230mm Spectre Pro LED (Blue,Green), 2x BitFenix 140mm Spectre Pro LED
Memory 16 GB Gskill Ripjaws X 2133 (2400 OC, 10-10-12-20-20, 1T, 1.65V)
Video Card(s) AMD Radeon 290 Sapphire Vapor-X
Storage Samsung 840 Pro 256GB, WD Velociraptor 1TB
Display(s) NEC Multisync LCD 1700V (Display Port Adapter)
Case AeroCool Xpredator Evil Blue Edition
Audio Device(s) Creative Labs Sound Blaster ZxR
Power Supply Seasonic 1250 XM2 Series (XP3)
Mouse Roccat Kone XTD
Keyboard Roccat Ryos MK Pro
Software Windows 7 Pro 64
Get white sticker from card backs or side of card heatsinks and gpu-z screenshots please.

And no gpu bios since Geforce 1000 and AMD Vega can be modified, they are digitally signed so they break each time. (Thanks china/miners :mad:)

Cross flashing is only method to resolve issues now or detuning using rivatuner, afterburner, trixx etc etc.

Artifacting can be timings/volts or a defect in ram chips or die itself. Always check the video cable too
 

inspirashun

New Member
Joined
Feb 4, 2020
Messages
6 (0.10/day)
Get white sticker from card backs or side of card heatsinks and gpu-z screenshots please.

And no gpu bios since Geforce 1000 and AMD Vega can be modified, they are digitally signed so they break each time. (Thanks china/miners :mad:)

Cross flashing is only method to resolve issues now or detuning using rivatuner, afterburner, trixx etc etc.

Artifacting can be timings/volts or a defect in ram chips or die itself. Always check the video cable too
I will get the pictures shortly.


*********
I was trying to PM you on this exact topic of BIOS editing. I have my answer now but here were my thoughts.

"To create a scenario, on scam cards from various websites, there are cards flashed with BIOS that report incorrect GDDR amounts, typically to a higher degree. I am wondering if the opposite is possible, can you set a GDDR to a lower amount and would the graphics card only use the lower allotted amount?

For the purpose of testing GDDR6 modules, could I create a bios for a card that for instance, has (8) 1GB GDDR6 modules, M1-M8. For diagnostic purposes, one configured BIOS to only use say M1-M4, another to test M5-M8, another to test M1 individually, etc. Does that seem like something that could be done, or would the BIOS not be able to configure on/off at a chip level? If an 8GB card is set to a 4GB capacity, can it be instructed to use 100% of only 4 chips instead of 50% of 8 chips via BIOS?"

DELL 2080TI

ASUS ROG STRIX OC 2080TI


The ASUS posted today cold and without artifact however it went black loading into Win10. I did take note that the performance BIOS initiated all 3 fans, while the quiet BIOS only initiated the 2 outer fans.
 
Joined
Dec 31, 2009
Messages
17,049 (4.55/day)
They were both purchased as parts for an educational purpose. The Dell has bent fins on the blower heat sink, I don’t see an RMA possibility. The ASUS will be going in for RMA.

I will try bios for the Dell to see what comes of it. If that doesn’t solve it, how would one go about determining the bad individual modules? I would hate to heat, remove and replace all 11 in one go.
with these cards, nothing was pinpointed, their issues. So we dont know know if it is. Memory controller, or ic thing. No real way to isolate things.
 

inspirashun

New Member
Joined
Feb 4, 2020
Messages
6 (0.10/day)
with these cards, nothing was pinpointed, their issues. So we dont know know if it is. Memory controller, or ic thing. No real way to isolate things.
I got as far as the power stages on the Dell card, the mosfets pins were almost under the chip so I had to order some thin angled probes to get readings. They should be in this week to resume getting readings across the PCB. Youre correct though, there is far too much on the table to isolate anything yet. This dead time has me brainstorming a pass/fail test of GDDR6 on card 100% known good with no other variable than GDDR6 modules.
 

eidairaman1

The Exiled Airman
Joined
Jul 2, 2007
Messages
30,377 (6.52/day)
Location
Republic of Texas (True Patriot)
System Name PCGOD
Processor AMD FX 8350@ 5.0GHz
Motherboard Asus TUF 990FX Sabertooth R2 2901 Bios
Cooling Scythe Ashura, 2×BitFenix 230mm Spectre Pro LED (Blue,Green), 2x BitFenix 140mm Spectre Pro LED
Memory 16 GB Gskill Ripjaws X 2133 (2400 OC, 10-10-12-20-20, 1T, 1.65V)
Video Card(s) AMD Radeon 290 Sapphire Vapor-X
Storage Samsung 840 Pro 256GB, WD Velociraptor 1TB
Display(s) NEC Multisync LCD 1700V (Display Port Adapter)
Case AeroCool Xpredator Evil Blue Edition
Audio Device(s) Creative Labs Sound Blaster ZxR
Power Supply Seasonic 1250 XM2 Series (XP3)
Mouse Roccat Kone XTD
Keyboard Roccat Ryos MK Pro
Software Windows 7 Pro 64

inspirashun

New Member
Joined
Feb 4, 2020
Messages
6 (0.10/day)
Thanks for the leads. I think Eli has moved on to other projects, I will reach out to buildzoid and zettabit.

I also came across this gent here:

At around 10:30 he starts measurements with a multimeter and has a method that appears to diag a bad module by pulling off an inductor and uses a power supply to send 1.3V to the GDDR modules, one heats up more than others and he pulls it.

I unfortunately dont know Polish and there are no subtitles on the videos. Im not sure the cards initial problems and I dont know if he ends up completing the repair after the part 3 video.
 
Top