• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

@Devs What is "GPU Temperature (Hot Spot)" on RX Vega?

Status
Not open for further replies.
Joined
Sep 7, 2017
Messages
5 (0.00/day)
Location
Austria
Processor i5-4670K @ 4,5Ghz
Motherboard Asus Z87-Pro
Cooling Custom Watercooling
Memory 16GB DDR3-1600 CL8
Video Card(s) Sapphire RX Vega 56@EKWB
Storage Samsung SSD 840 240GB + Samsung SSD 840EVO 500GB + 2x Crucial MX300 525GB Raid0
Display(s) AOC G2460PF
Case Corsair Obsidian 750D
Audio Device(s) Soundblaster Zx
Power Supply Corsair HX850i
Mouse Logitech G502
Keyboard Victsing Mechanical
Software Windows 10 Pro
I guess the title already says it. I would like to know from which sensor exactly that data is being pulled from.
 
Joined
Sep 7, 2017
Messages
5 (0.00/day)
Location
Austria
Processor i5-4670K @ 4,5Ghz
Motherboard Asus Z87-Pro
Cooling Custom Watercooling
Memory 16GB DDR3-1600 CL8
Video Card(s) Sapphire RX Vega 56@EKWB
Storage Samsung SSD 840 240GB + Samsung SSD 840EVO 500GB + 2x Crucial MX300 525GB Raid0
Display(s) AOC G2460PF
Case Corsair Obsidian 750D
Audio Device(s) Soundblaster Zx
Power Supply Corsair HX850i
Mouse Logitech G502
Keyboard Victsing Mechanical
Software Windows 10 Pro
I directed my question directly to the devs in order to avoid getting distracting answers. I know you meant nothing but good but to be honest pointing me to Guru3D didn't help at all, as this is a question only the programmer of GPU-Z can answer. He is the one who picked the sensor with his own code after all..
 

W1zzard

Administrator
Staff member
Joined
May 14, 2004
Messages
26,956 (3.71/day)
Processor Ryzen 7 5700X
Memory 48 GB
Video Card(s) RTX 4080
Storage 2x HDD RAID 1, 3x M.2 NVMe
Display(s) 30" 2560x1600 + 19" 1280x1024
Software Windows 10 64-bit
It's a sensor inside the GPU silicon. Probably (going by the name9 at a location where it gets hottest. That's all I know.

You also asked about HBM temperature location in a post that's now deleted due to cleanup. No idea again. The card gives me a sensor "HBM temperature" that's all I know
 
Joined
Feb 2, 2015
Messages
2,707 (0.81/day)
Location
On The Highway To Hell \m/
He's not the only one who knows what it means. It was implemented by AMD. All he did was make it so his program shows the data reported by it. Frankly, as he's not a GPU designer/engineer, I'd be pretty surprised if he knew anything at all about it. It was rather stupid in my view for the "who-the-hell-ever-he-is" guy over at AMD to suggest you ask the dev of GPU-Z to explain how/what/where/why AMD designs the temperature sensors on their GPUs. And rather smart for you to reply "But isn't this temperature value something that is provided by the card's firmware?". Since it's pretty obvious that's the case.

He just replied as I was typing this. Good to know I wasn't wrong about that.
 
Joined
Sep 7, 2017
Messages
5 (0.00/day)
Location
Austria
Processor i5-4670K @ 4,5Ghz
Motherboard Asus Z87-Pro
Cooling Custom Watercooling
Memory 16GB DDR3-1600 CL8
Video Card(s) Sapphire RX Vega 56@EKWB
Storage Samsung SSD 840 240GB + Samsung SSD 840EVO 500GB + 2x Crucial MX300 525GB Raid0
Display(s) AOC G2460PF
Case Corsair Obsidian 750D
Audio Device(s) Soundblaster Zx
Power Supply Corsair HX850i
Mouse Logitech G502
Keyboard Victsing Mechanical
Software Windows 10 Pro
Thanks for your replies then. I was asking that particular person on the AMD forums as hes the only one from the staff i know who at least answers when mentioned in a post. I guess I'll just nag them until they provide an answer lol.
 
Joined
Feb 19, 2006
Messages
6,270 (0.95/day)
Location
New York
Processor INTEL CORE I9-9900K @ 5Ghz all core 4.7Ghz Cache @1.305 volts
Motherboard ASUS PRIME Z390-P ATX
Cooling CORSAIR HYDRO H150I PRO RGB 360MM 6x120mm fans push pull
Memory CRUCIAL BALLISTIX 3000Mhz 4x8 32gb @ 4000Mhz
Video Card(s) EVGA GEFORECE RTX 2080 SUPER XC HYBRID GAMING
Storage ADATA XPG SX8200 Pro 1TB 3D NAND NVMe,Intel 660p 1TB m.2 ,1TB WD Blue 3D NAND,500GB WD Blue 3D NAND,
Display(s) 50" Sharp Roku TV 8ms responce time and Philips 75Hz 328E9QJAB 32" curved
Case BLACK LIAN LI O11 DYNAMIC XL FULL-TOWER GAMING CASE,
Power Supply 1600 Watt
Software Windows 10
He's not the only one who knows what it means. It was implemented by AMD. All he did was make it so his program shows the data reported by it. Frankly, as he's not a GPU designer/engineer, I'd be pretty surprised if he knew anything at all about it. It was rather stupid in my view for the "who-the-hell-ever-he-is" guy over at AMD to suggest you ask the dev of GPU-Z to explain how/what/where/why AMD designs the temperature sensors on their GPUs. And rather smart for you to reply "But isn't this temperature value something that is provided by the card's firmware?". Since it's pretty obvious that's the case.

He just replied as I was typing this. Good to know I wasn't wrong about that.
I believe W1zzard knows more than you think, I believe he use to work for ATI back in the day.;)
 
Joined
Feb 2, 2015
Messages
2,707 (0.81/day)
Location
On The Highway To Hell \m/
I believe W1zzard knows more than you think, I believe he use to work for ATI back in the day.;)
Oh I know he knows his stuff. And most certainly a lot more than I know. I was just guessing he wouldn't know about this particular feature on Vega. Which, to the best of my knowledge, is entirely new and only found on Vega. And hasn't been mentioned in any documentation(that I've seen). So unless he actually worked on it(which I figured I'd have heard about if he did)...it just didn't seem likely he'd know any more than the rest of us. Which is nothing...yet.
 
Joined
Sep 7, 2017
Messages
5 (0.00/day)
Location
Austria
Processor i5-4670K @ 4,5Ghz
Motherboard Asus Z87-Pro
Cooling Custom Watercooling
Memory 16GB DDR3-1600 CL8
Video Card(s) Sapphire RX Vega 56@EKWB
Storage Samsung SSD 840 240GB + Samsung SSD 840EVO 500GB + 2x Crucial MX300 525GB Raid0
Display(s) AOC G2460PF
Case Corsair Obsidian 750D
Audio Device(s) Soundblaster Zx
Power Supply Corsair HX850i
Mouse Logitech G502
Keyboard Victsing Mechanical
Software Windows 10 Pro

W1zzard

Administrator
Staff member
Joined
May 14, 2004
Messages
26,956 (3.71/day)
Processor Ryzen 7 5700X
Memory 48 GB
Video Card(s) RTX 4080
Storage 2x HDD RAID 1, 3x M.2 NVMe
Display(s) 30" 2560x1600 + 19" 1280x1024
Software Windows 10 64-bit

W1zzard

Administrator
Staff member
Joined
May 14, 2004
Messages
26,956 (3.71/day)
Processor Ryzen 7 5700X
Memory 48 GB
Video Card(s) RTX 4080
Storage 2x HDD RAID 1, 3x M.2 NVMe
Display(s) 30" 2560x1600 + 19" 1280x1024
Software Windows 10 64-bit
he wouldn't know about this particular feature on Vega. Which, to the best of my knowledge, is entirely new and only found on Vega.
which feature?
 
Joined
Nov 5, 2015
Messages
501 (0.16/day)
Location
Skopje, Macedonia
System Name The Tesseract Cube
Processor AMD Ryzen 5 3600
Motherboard MSI X570A-PRO
Cooling DeepCool Maelstrom 240mm, 2 X DeepCool TF120S (radiator fans), 4 X DeepCool RF120 (case fans)
Memory 2 x 16gb Kingston HyperX 3200mhz
Video Card(s) Sapphire Radeon RX 6800 Nitro + 16GB
Storage Corsair MP400 G3 1TB, Western Digital Caviar Blue 1TB
Display(s) MSI MAG241C Full HD, 144hz FreeSync
Case DeepCool Matrexx 55
Audio Device(s) MB Integrated, Sound Blaster Play 3 (Headset)
Power Supply Corsair CX650M Modular 80+ Bronze
Mouse Corsair Dark Core Pro Wirless RGB
Keyboard MSI GK30 Mecha-Membrane
Software Windows 10 Pro
Benchmark Scores CPUZ: Single Thread - 510 Multi Thread - 4.050 Cinebench R20: CPU - 3 500 score
AMD GCN silicon gets hot very quick, and by my experience in most generations of MAD GPUs starting from 5xxx all the way up to RX 4xx and 5xx, max temps go all the way up to 90C if not in a proper ventilated case and room temp.
Most good cooled AMD cards rely on ambient temp, good case and added fans to work in good temps.
Mine doesn't go above 70C with 4 extra fans in case, and air conditioning in room on a cool 23C in summer, in winter no need for air conditioning, just open a window.
TJ max would be 100C by my exp, but optimal temps would be around 65 - 75C.
 
Joined
Mar 10, 2010
Messages
11,878 (2.31/day)
Location
Manchester uk
System Name RyzenGtEvo/ Asus strix scar II
Processor Amd R5 5900X/ Intel 8750H
Motherboard Crosshair hero8 impact/Asus
Cooling 360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory Corsair Vengeance Rgb pro 3600cas14 16Gb in four sticks./16Gb/16GB
Video Card(s) Powercolour RX7900XT Reference/Rtx 2060
Storage Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s) Samsung UAE28"850R 4k freesync.dell shiter
Case Lianli 011 dynamic/strix scar2
Audio Device(s) Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply corsair 1200Hxi/Asus stock
Mouse Roccat Kova/ Logitech G wireless
Keyboard Roccat Aimo 120
VR HMD Oculus rift
Software Win 10 Pro
Benchmark Scores 8726 vega 3dmark timespy/ laptop Timespy 6506
It's a sensor inside the GPU silicon. Probably (going by the name9 at a location where it gets hottest. That's all I know.

You also asked about HBM temperature location in a post that's now deleted due to cleanup. No idea again. The card gives me a sensor "HBM temperature" that's all I know
Sounds a lot like something I said Bossman Ty.
 
Joined
Feb 2, 2015
Messages
2,707 (0.81/day)
Location
On The Highway To Hell \m/
which feature?
The GPU hot spot temp sensor...I guess? Which may or may not qualify as a "feature". I might have worded that poorly. As well as everything else I've said in this thread. I probably should have just kept my mouth shut. :oops:

I am sort of curious about it though. My question at this point is, is it only found on Vega? I noticed yesterday while using Polaris Bios Editor that there's a "Hotspot Temp (C)" value under POWERTUNE for Polaris 20, Ellesmere, Baffin, and Lexa. Which makes me think there's got to be a sensor for it on those too.
 

W1zzard

Administrator
Staff member
Joined
May 14, 2004
Messages
26,956 (3.71/day)
Processor Ryzen 7 5700X
Memory 48 GB
Video Card(s) RTX 4080
Storage 2x HDD RAID 1, 3x M.2 NVMe
Display(s) 30" 2560x1600 + 19" 1280x1024
Software Windows 10 64-bit
Has been there for a while, exposed just now.
 
Joined
Sep 7, 2017
Messages
5 (0.00/day)
Location
Austria
Processor i5-4670K @ 4,5Ghz
Motherboard Asus Z87-Pro
Cooling Custom Watercooling
Memory 16GB DDR3-1600 CL8
Video Card(s) Sapphire RX Vega 56@EKWB
Storage Samsung SSD 840 240GB + Samsung SSD 840EVO 500GB + 2x Crucial MX300 525GB Raid0
Display(s) AOC G2460PF
Case Corsair Obsidian 750D
Audio Device(s) Soundblaster Zx
Power Supply Corsair HX850i
Mouse Logitech G502
Keyboard Victsing Mechanical
Software Windows 10 Pro
Would it theoretically be possible to add support for reading VR_SOC and VR_MEM temps on Vega with GPU-Z?
 
Joined
May 12, 2017
Messages
2,178 (0.87/day)
HBM has built-in thermal sensor from what I understand from the datasheet. So should there not be two thermal readings one for each HBM die?
 
Joined
Dec 6, 2005
Messages
10,881 (1.63/day)
Location
Manchester, NH
System Name Senile
Processor I7-4790K@4.8 GHz 24/7
Motherboard MSI Z97-G45 Gaming
Cooling Be Quiet Pure Rock Air
Memory 16GB 4x4 G.Skill CAS9 2133 Sniper
Video Card(s) GIGABYTE Vega 64
Storage Samsung EVO 500GB / 8 Different WDs / QNAP TS-253 8GB NAS with 2x10Tb WD Blue
Display(s) 34" LG 34CB88-P 21:9 Curved UltraWide QHD (3440*1440) *FREE_SYNC*
Case Rosewill
Audio Device(s) Onboard + HD HDMI
Power Supply Corsair HX750
Mouse Logitech G5
Keyboard Corsair Strafe RGB & G610 Orion Red
Software Win 10
HBM has built-in thermal sensor from what I understand from the datasheet. So should there not be two thermal readings one for each HBM die?

Sounds like that would be the same as the memory temperature, no?
 
Joined
Feb 18, 2005
Messages
5,239 (0.75/day)
Location
Ikenai borderline!
System Name Firelance.
Processor Threadripper 3960X
Motherboard ROG Strix TRX40-E Gaming
Cooling IceGem 360 + 6x Arctic Cooling P12
Memory 8x 16GB Patriot Viper DDR4-3200 CL16
Video Card(s) MSI GeForce RTX 4060 Ti Ventus 2X OC
Storage 2TB WD SN850X (boot), 4TB Crucial P3 (data)
Display(s) 3x AOC Q32E2N (32" 2560x1440 75Hz)
Case Enthoo Pro II Server Edition (Closed Panel) + 6 fans
Power Supply Fractal Design Ion+ 2 Platinum 760W
Mouse Logitech G602
Keyboard Logitech G613
Software Windows 10 Professional x64
... how do you expect a random software developer to know how a hardware manufacturer, that they have no relationship with, exposes its hardware's sensor data? We can't smell these things y'know, we're just as dependent as anyone on the hardware company providing documentation on where that sensor data lives in memory, how to access it, and how to interpret it into a number that actually makes sense to an end-user.

I mean, yeah, you could spend hours peeking and poking through various memory locations to guess at this stuff... or you could save yourself a ton of time and effort and just use what the manufacturer provides... I know which one I go with.
 
Joined
Mar 10, 2010
Messages
11,878 (2.31/day)
Location
Manchester uk
System Name RyzenGtEvo/ Asus strix scar II
Processor Amd R5 5900X/ Intel 8750H
Motherboard Crosshair hero8 impact/Asus
Cooling 360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory Corsair Vengeance Rgb pro 3600cas14 16Gb in four sticks./16Gb/16GB
Video Card(s) Powercolour RX7900XT Reference/Rtx 2060
Storage Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s) Samsung UAE28"850R 4k freesync.dell shiter
Case Lianli 011 dynamic/strix scar2
Audio Device(s) Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply corsair 1200Hxi/Asus stock
Mouse Roccat Kova/ Logitech G wireless
Keyboard Roccat Aimo 120
VR HMD Oculus rift
Software Win 10 Pro
Benchmark Scores 8726 vega 3dmark timespy/ laptop Timespy 6506
Sorry it's off topic but I replied to this thread like originally post two ish early on and its gone , no insult just info , please find a better path ,editing out my help will stop me helping.......
As it's out and out offensive , i told the Op what it was exactly and even before w1zzard , i have a vega and I know that stuff.
Required a dev tut .

Seen the thread here today i thought it new since its been cutting edited
 
Joined
Dec 6, 2005
Messages
10,881 (1.63/day)
Location
Manchester, NH
System Name Senile
Processor I7-4790K@4.8 GHz 24/7
Motherboard MSI Z97-G45 Gaming
Cooling Be Quiet Pure Rock Air
Memory 16GB 4x4 G.Skill CAS9 2133 Sniper
Video Card(s) GIGABYTE Vega 64
Storage Samsung EVO 500GB / 8 Different WDs / QNAP TS-253 8GB NAS with 2x10Tb WD Blue
Display(s) 34" LG 34CB88-P 21:9 Curved UltraWide QHD (3440*1440) *FREE_SYNC*
Case Rosewill
Audio Device(s) Onboard + HD HDMI
Power Supply Corsair HX750
Mouse Logitech G5
Keyboard Corsair Strafe RGB & G610 Orion Red
Software Win 10
or you could save yourself a ton of time and effort and just use what the manufacturer provides

Unless you've seen something different, the only thing I've seen them (AMD) provide is the GPU Core temp. It's up to 3rd parties (i.e. GPU-Z) to read and display other sensor info that's been exposed.

The OP's question was about the meaning of the "hot spot" temp sensor... and it sounds like AMD hasn't given much info on the significance of that value, or where the sensor is located.
 
Joined
Mar 10, 2010
Messages
11,878 (2.31/day)
Location
Manchester uk
System Name RyzenGtEvo/ Asus strix scar II
Processor Amd R5 5900X/ Intel 8750H
Motherboard Crosshair hero8 impact/Asus
Cooling 360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory Corsair Vengeance Rgb pro 3600cas14 16Gb in four sticks./16Gb/16GB
Video Card(s) Powercolour RX7900XT Reference/Rtx 2060
Storage Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s) Samsung UAE28"850R 4k freesync.dell shiter
Case Lianli 011 dynamic/strix scar2
Audio Device(s) Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply corsair 1200Hxi/Asus stock
Mouse Roccat Kova/ Logitech G wireless
Keyboard Roccat Aimo 120
VR HMD Oculus rift
Software Win 10 Pro
Benchmark Scores 8726 vega 3dmark timespy/ laptop Timespy 6506
Unless you've seen something different, the only thing I've seen them (AMD) provide is the GPU Core temp. It's up to 3rd parties (i.e. GPU-Z) to read and display other sensor info that's been exposed.

The OP's question was about the meaning of the "hot spot" temp sensor... and it sounds like AMD hasn't given much info on the significance of that value, or where the sensor is located.
The sensor? Is all the temp sensors , it's the hottest spot, vega is built on infinity fabric which is a bus and control network including sensor's and each chip has its own additional sensors but the hot spot in such chip terms is the hottest spot.
And is king of the thermal throttle hill so to speak, as i previously said ,ish.
 
Joined
Dec 6, 2005
Messages
10,881 (1.63/day)
Location
Manchester, NH
System Name Senile
Processor I7-4790K@4.8 GHz 24/7
Motherboard MSI Z97-G45 Gaming
Cooling Be Quiet Pure Rock Air
Memory 16GB 4x4 G.Skill CAS9 2133 Sniper
Video Card(s) GIGABYTE Vega 64
Storage Samsung EVO 500GB / 8 Different WDs / QNAP TS-253 8GB NAS with 2x10Tb WD Blue
Display(s) 34" LG 34CB88-P 21:9 Curved UltraWide QHD (3440*1440) *FREE_SYNC*
Case Rosewill
Audio Device(s) Onboard + HD HDMI
Power Supply Corsair HX750
Mouse Logitech G5
Keyboard Corsair Strafe RGB & G610 Orion Red
Software Win 10
And is king of the thermal throttle hill so to speak

Does the hot spot factor into thermal throttling? It doesn't seem to, if I'm hitting 97c with mine... the GPU throttle is set at 85c, and I think the Mem throttle is at 85c also. Overclocked, my hotspot was 97c. Core was 75c and Mem at 85c ... that was at a core speed topping out at 1733 and mem at 1050 (according to GPU-Z). Core was undervolted to 1050 Mv for both P6 and P7
 
Joined
May 12, 2017
Messages
2,178 (0.87/day)
Sounds like that would be the same as the memory temperature, no?

Have you read HBM Memory PDF? or have I misunderstood something. It does not look right if each HBM die has it's own built-in thermal sensor.

I would expect something like HBM thermal 0 & HBM thermal 1 (example), but just one HBM thermal temperature reading to cover both die.

What if one of HBM memory die was making poor contact, how would you know which one?

HBM1 also has built-in thermal sensor if my memory serves me well.

If I am missing or misunderstood something, can someone please post more technical details.
 
Last edited:
Joined
Dec 6, 2005
Messages
10,881 (1.63/day)
Location
Manchester, NH
System Name Senile
Processor I7-4790K@4.8 GHz 24/7
Motherboard MSI Z97-G45 Gaming
Cooling Be Quiet Pure Rock Air
Memory 16GB 4x4 G.Skill CAS9 2133 Sniper
Video Card(s) GIGABYTE Vega 64
Storage Samsung EVO 500GB / 8 Different WDs / QNAP TS-253 8GB NAS with 2x10Tb WD Blue
Display(s) 34" LG 34CB88-P 21:9 Curved UltraWide QHD (3440*1440) *FREE_SYNC*
Case Rosewill
Audio Device(s) Onboard + HD HDMI
Power Supply Corsair HX750
Mouse Logitech G5
Keyboard Corsair Strafe RGB & G610 Orion Red
Software Win 10
Have you read HBM Memory PDF? or have I misunderstood something. It does not look right if each HBM die has it's own built-in thermal sensor.

No, I haven't read the data sheets. Yes, as far as I know, there are two HBM chips on the GPU chip, I assume they have a sensor only on one, or only expose information for one.

On another note, my Vega 64 started throwing out some weird readings:

1522189368981.png
 
Joined
May 12, 2017
Messages
2,178 (0.87/day)
No, I haven't read the data sheets. Yes, as far as I know, there are two HBM chips on the GPU chip, I assume they have a sensor only on one, or only expose information for one.

On another note, my Vega 64 started throwing out some weird readings:

View attachment 98874

You can't have a sensor on just one HBM, both should be connected. You have separate dies & for safety/monitoring, each HBM die has it's own Thermal features. " take a glance over at the JEDEC PDF Docs", that's what I did.

If one HBM is overheating how would you know this is happening if it taking a reading from the other. Your CPU has thermal reading for each core, HBM is no different. The ability to monitor each die is important. Fuji chip has four HBM stack, so you should be seeing Thermal 0 to Thermal 3.

You can't just have one thermal read-out for all HBM die when connected to the main Vega/Fuji die, that's not how things are done.
 
Last edited:
Status
Not open for further replies.
Top