• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Nerfed FP64 performance in consumer GPU cards

Sword7

New Member
Joined
Sep 3, 2018
Messages
1 (0.00/day)
Folks,

I googled some FP64 (double precision) performance about any consumer GPU cards but learned that NVIDIA and AMD nerfed FP64 performance in consumer GPU cards. Also I learned that many people asked for FP64 hardware support in several years.

I am concerning about that because of jittery issues when rendering large worlds with FP32 operations like space flight simulators, etc. There are good FP32 performance but inaccurate (limited 7 digits accuracy).

Games need physics and 64-bit vertices for large world to be calculated through GPU process for better performance.

I researched deeper and deeper and learned that AMD have little better FP64 performance than NVIDIA (1/32 FP32 performance).

Does any GPU hardware vendors plan to improve FP64 performance through consumer GPU cards?

Thanks,
Tim
 
Joined
Nov 22, 2014
Messages
91 (0.03/day)
System Name I could say remaining parts or something like that...
Processor i5 2500k @ 4,8 ghz/Xeon x5650
Motherboard Asus Z68 gene-z/G1 Assassin
Cooling 240mm radiator/212+
Memory 8gb ddr3-1333/16gb ddr3-1866
Video Card(s) Nitro RX460 4gb/2x RX570 Red Devil
Storage Many
Display(s) Samsung TV/Topsync 2560*1440
Audio Device(s) onboard
Power Supply Seasonic 620w/Corsair 860i
Software 10 64 bit ultimate
This is why the professionnal cards exists. Market segmentation.
 
Joined
Jan 8, 2017
Messages
8,942 (3.36/day)
System Name Good enough
Processor AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard ASRock B650 Pro RS
Cooling 2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory 32GB - FURY Beast RGB 5600 Mhz
Video Card(s) Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage 1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s) LG UltraGear 32GN650-B + 4K Samsung TV
Case Phanteks NV7
Power Supply GPS-750C
It's not nerfed, the units don't exist physically on the chip and they're just not really required for rendering or game physics, in fact FP16 is more useful probably. Remember that you could have an immense high precision polygon count but at the end it will get rasterized into a 2D image.

Vertex data simply does not require precision that big, in fact some engines employed 16 bit storage for vertex data in order to save space.

Even though it seems like FP64 performance is very low it's still more than what most CPUs can do. I wrote several GPU accelerated algorithms and they all ran faster using FP64 on my 1080 which has a 1/32 FP64 ratio.
 

newtekie1

Semi-Retired Folder
Joined
Nov 22, 2005
Messages
28,472 (4.23/day)
Location
Indiana, USA
Processor Intel Core i7 10850K@5.2GHz
Motherboard AsRock Z470 Taichi
Cooling Corsair H115i Pro w/ Noctua NF-A14 Fans
Memory 32GB DDR4-3600
Video Card(s) RTX 2070 Super
Storage 500GB SX8200 Pro + 8TB with 1TB SSD Cache
Display(s) Acer Nitro VG280K 4K 28"
Case Fractal Design Define S
Audio Device(s) Onboard is good enough for me
Power Supply eVGA SuperNOVA 1000w G3
Software Windows 10 Pro x64
It's not nerfed, the units don't exist physically on the chip and they're just not really required for rendering or game physics, in fact FP16 is more useful probably.

That's not always true. Yes, in the Ampere GPUs, that is the case I believe. The GA100 used in Ampere high end compute cards does have more FP64 units than the GA102 used in the RTX3090/3080.

However, in other cards the FP64 performance is just nerfed. The RTX3090 and Quadro A6000 both use the same GA102 GPU core. So the FP64 units in the cards is the same. However, the Quadro A6000's FP performance is 1:32 of the FP32 but the RTX3090 is 1:64. So there are just far more FP64 units disabled(but still physically there) in the RTX3090 or the performance is nerfed. And that's pretty much how it all goes in the consumer space vs the professional space. Cards that use the same GPU cores have far lower FP64 performance in consumer cards despite the FP64 units being present in the GPU.

Now it also could be entirely a yield thing too. Just like sharders are disabled because they are damaged and the GPU is used in a lower tier card the same might be done with FP64 units. If there are enough damaged FP64 units, then the GPU is downgraded for use in a consumer card.
 
  • Like
Reactions: Rei
Joined
May 8, 2016
Messages
1,741 (0.60/day)
System Name BOX
Processor Core i7 6950X @ 4,26GHz (1,28V)
Motherboard X99 SOC Champion (BIOS F23c + bifurcation mod)
Cooling Thermalright Venomous-X + 2x Delta 38mm PWM (Push-Pull)
Memory Patriot Viper Steel 4000MHz CL16 4x8GB (@3240MHz CL12.12.12.24 CR2T @ 1,48V)
Video Card(s) Titan V (~1650MHz @ 0.77V, HBM2 1GHz, Forced P2 state [OFF])
Storage WD SN850X 2TB + Samsung EVO 2TB (SATA) + Seagate Exos X20 20TB (4Kn mode)
Display(s) LG 27GP950-B
Case Fractal Design Meshify 2 XL
Audio Device(s) Motu M4 (audio interface) + ATH-A900Z + Behringer C-1
Power Supply Seasonic X-760 (760W)
Mouse Logitech RX-250
Keyboard HP KB-9970
Software Windows 10 Pro x64
Just buy old Titan if you need FP64 that badly, and be happy (after some OC).
gpgpu.png
 
Joined
Apr 6, 2015
Messages
246 (0.07/day)
Location
Japan
System Name ChronicleScienceWorkStation
Processor AMD Threadripper 1950X
Motherboard Asrock X399 Taichi
Cooling Noctua U14S-TR4
Memory G.Skill DDR4 3200 C14 16GB*4
Video Card(s) AMD Radeon VII
Storage Samsung 970 Pro*1, Kingston A2000 1TB*2 RAID 0, HGST 8TB*5 RAID 6
Case Lian Li PC-A75X
Power Supply Corsair AX1600i
Software Proxmox 6.2
Joined
Nov 4, 2020
Messages
5 (0.00/day)
Processor TR 1920x
Motherboard X399 Asrock Taichi
Cooling be quiet! Dark Rock Pro TR4
Memory 128GB (8x16GB) 3000MHz CL16 Corsair Vengeance LPX Black
Video Card(s) GTX 1070 Zotac AMP! Extreme Edition
Storage 970 EVO 500GB
Display(s) 3x Eizo EV2450
Case SilentiumPC Regnum RG4T Pure Black
Power Supply SilentiumPC Vero M2 Bronze 600W
Sorry if my post is misplaced, probably should start a new topic but this one is kinda exactly my problem.

Do you happen to know, is it possibile that fp64 perf was nerfed in some of the latest drivers? Or maybe some weird Aida64 quirks? Because I've recently bought an used GTX Titan (not black, which I know it should perform just slightly worse), thinking mostly about this fp64 performance, which is beneficial in my university work. I've ran the test, but the score for double-precision flops for my card was terrible. I'm not sure if that's right and I missed something about nerfing fp64, or maybe bugged drivers, or maybe I was scammed? :( Any of your help, I would really appreciate it

Aida-gpgpu-flop.png
gtx-titan.png
 
Joined
Feb 17, 2010
Messages
1,488 (0.29/day)
Location
Azalea City
System Name Main
Processor Ryzen 5950x
Motherboard B550 PG Velocita
Cooling Water
Memory Ballistix
Video Card(s) RX 6900XT
Storage T-FORCE CARDEA A440 PRO
Display(s) Samsung UE590
Case QUBE 500
Audio Device(s) Logitech Z623
Power Supply LEADEX V 1KW
Mouse Cooler Master MM710
Keyboard Huntsman Elite
Software 11 Pro
Benchmark Scores https://hwbot.org/user/damric/
Sorry if my post is misplaced, probably should start a new topic but this one is kinda exactly my problem.

Do you happen to know, is it possibile that fp64 perf was nerfed in some of the latest drivers? Or maybe some weird Aida64 quirks? Because I've recently bought an used GTX Titan (not black, which I know it should perform just slightly worse), thinking mostly about this fp64 performance, which is beneficial in my university work. I've ran the test, but the score for double-precision flops for my card was terrible. I'm not sure if that's right and I missed something about nerfing fp64, or maybe bugged drivers, or maybe I was scammed? :( Any of your help, I would really appreciate it

View attachment 174387View attachment 174388

You would be way better off with a used Vega.

vega56.png
 
Joined
Nov 4, 2020
Messages
5 (0.00/day)
Processor TR 1920x
Motherboard X399 Asrock Taichi
Cooling be quiet! Dark Rock Pro TR4
Memory 128GB (8x16GB) 3000MHz CL16 Corsair Vengeance LPX Black
Video Card(s) GTX 1070 Zotac AMP! Extreme Edition
Storage 970 EVO 500GB
Display(s) 3x Eizo EV2450
Case SilentiumPC Regnum RG4T Pure Black
Power Supply SilentiumPC Vero M2 Bronze 600W
That's the point, Titan should easily smoke your Vega in case of FP64 but somehow it does not. Apart from that, CUDA support is crucial for me, which unfortunately is not available on Vega.
 
Joined
Jul 19, 2015
Messages
928 (0.29/day)
System Name The Banshee
Processor Ryzen 5 5600 @ 4.45GHz
Motherboard AsRock X370 Taichi
Cooling CM Hyper 212 Plus
Memory 16GB 2x8 G.Skill Trident Z 3600 CL16 1.35V
Video Card(s) RTX 3060M 6GB @ 1750-2000MHz Core / 1875MHz Mem
Storage 1TB Kingston NV2 & 1TB Mass Storage HDD
Display(s) LG 25UM57-P @ 75Hz OC
Case Fractal Design Arc XL
Audio Device(s) ATH-M20x
Power Supply Evga SuperNova 1300 G2
Mouse Evga Torq X3
Keyboard Thermaltake Challenger
Software Win 10 Pro 64-Bit
That's the point, Titan should easily smoke your Vega in case of FP64 but somehow it does not. Apart from that, CUDA support is crucial for me, which unfortunately is not available on Vega.
Yeah, somethings off there, a Titan should be around 1.3TFLOPS. Try an older driver and see if that changes anything.
 

Rei

Joined
Aug 1, 2020
Messages
656 (0.48/day)
Location
Guam
System Name 1 Desktop/2 Laptops/1 Netbook
Processor AMD Athon X2 64/Intel Pentium 997/Intel Pentium 4/Intel Atom
Motherboard EpoX ATX motherboard/Samsung/Toshiba/Lenovo
Cooling Stock
Memory 4 GB/4 GB/2 GB/2 GB
Video Card(s) Asus GeForce GTX 780 Ti/Intel HD Graphics/GeForce 4MX/Intel GMA
Storage 6+ TB Total
Display(s) HP Pavilion 14 Inch 1024x768@60Hz 4:3 Aspect Ratio CRT Monitor
Case None
Audio Device(s) Various
Power Supply Seasonic 500 Watt & VenomRX 500 Watt
Mouse Wayes Iron Man Wireless Mouse
Keyboard Rexus VR2 Wireless Keyboard
Software Win10 & WinXP SP3
Benchmark Scores It sucks...
Because I've recently bought an used GTX Titan (not black, which I know it should perform just slightly worse)
How would you know this? That is very unlikely & should be the opposite.
 
Joined
Nov 4, 2020
Messages
5 (0.00/day)
Processor TR 1920x
Motherboard X399 Asrock Taichi
Cooling be quiet! Dark Rock Pro TR4
Memory 128GB (8x16GB) 3000MHz CL16 Corsair Vengeance LPX Black
Video Card(s) GTX 1070 Zotac AMP! Extreme Edition
Storage 970 EVO 500GB
Display(s) 3x Eizo EV2450
Case SilentiumPC Regnum RG4T Pure Black
Power Supply SilentiumPC Vero M2 Bronze 600W
How would you know this? That is very unlikely & should be the opposite

Well, I wrote it wrongly. What I meant, is that I know my card should perform worse than Titan Black, but not like I've experienced. Sorry, english isn't my native language.

Anyway, I tried couple different driver versions, also different versions of Aida64 - still the same result :( Quickly swapped for my previous card, 750Ti - and the results were normal just like on the website. Soo I'm getting more and more concerned :( Would love to see some comparison from person who also have Titan/Titan Black, before going for Windows reinstall
 
  • Like
Reactions: Rei

Rei

Joined
Aug 1, 2020
Messages
656 (0.48/day)
Location
Guam
System Name 1 Desktop/2 Laptops/1 Netbook
Processor AMD Athon X2 64/Intel Pentium 997/Intel Pentium 4/Intel Atom
Motherboard EpoX ATX motherboard/Samsung/Toshiba/Lenovo
Cooling Stock
Memory 4 GB/4 GB/2 GB/2 GB
Video Card(s) Asus GeForce GTX 780 Ti/Intel HD Graphics/GeForce 4MX/Intel GMA
Storage 6+ TB Total
Display(s) HP Pavilion 14 Inch 1024x768@60Hz 4:3 Aspect Ratio CRT Monitor
Case None
Audio Device(s) Various
Power Supply Seasonic 500 Watt & VenomRX 500 Watt
Mouse Wayes Iron Man Wireless Mouse
Keyboard Rexus VR2 Wireless Keyboard
Software Win10 & WinXP SP3
Benchmark Scores It sucks...
Well, I wrote it wrongly. What I meant, is that I know my card should perform worse than Titan Black, but not like I've experienced. Sorry, english isn't my native language.
Ok, don't worry about it. I was just curious.
Anyway, I tried couple different driver versions, also different versions of Aida64 - still the same result :( Quickly swapped for my previous card, 750Ti - and the results were normal just like on the website. Soo I'm getting more and more concerned :(
a) Did you use the "generic" driver from Nvidia's website?
b) What brand is your Titan from?
c) Have you tried to use the driver from your Titan's brand's website?
Would love to see some comparison from person who also have Titan/Titan Black, before going for Windows reinstall
Sorry, I only have GTX 780 Ti which has nearly the same spec as Titan Black but far inferior FP64 of only 0.21 TFLOPS compared to Titan Black's 1.7 TFLOPS.
 

silentbogo

Moderator
Staff member
Joined
Nov 20, 2013
Messages
5,474 (1.44/day)
Location
Kyiv, Ukraine
System Name WS#1337
Processor Ryzen 7 3800X
Motherboard ASUS X570-PLUS TUF Gaming
Cooling Xigmatek Scylla 240mm AIO
Memory 4x8GB Samsung DDR4 ECC UDIMM
Video Card(s) Inno3D RTX 3070 Ti iChill
Storage ADATA Legend 2TB + ADATA SX8200 Pro 1TB
Display(s) Samsung U24E590D (4K/UHD)
Case ghetto CM Cosmos RC-1000
Audio Device(s) ALC1220
Power Supply SeaSonic SSR-550FX (80+ GOLD)
Mouse Logitech G603
Keyboard Modecom Volcano Blade (Kailh choc LP)
VR HMD Google dreamview headset(aka fancy cardboard)
Software Windows 11, Ubuntu 20.04 LTS
Sorry if my post is misplaced, probably should start a new topic but this one is kinda exactly my problem.

Do you happen to know, is it possibile that fp64 perf was nerfed in some of the latest drivers? Or maybe some weird Aida64 quirks? Because I've recently bought an used GTX Titan (not black, which I know it should perform just slightly worse), thinking mostly about this fp64 performance, which is beneficial in my university work. I've ran the test, but the score for double-precision flops for my card was terrible. I'm not sure if that's right and I missed something about nerfing fp64, or maybe bugged drivers, or maybe I was scammed? :( Any of your help, I would really appreciate it
Just a quick google-foo found a solution for your problem. FP64 acceleration by default is disabled, because it gimps FP32 performance. Basically what you are getting is a 1/24 ratio, like on GTX780. If you enable FP64, you'll get your 1:3 ratio, but expect a slight performance drop in FP32.
 
Joined
Nov 4, 2020
Messages
5 (0.00/day)
Processor TR 1920x
Motherboard X399 Asrock Taichi
Cooling be quiet! Dark Rock Pro TR4
Memory 128GB (8x16GB) 3000MHz CL16 Corsair Vengeance LPX Black
Video Card(s) GTX 1070 Zotac AMP! Extreme Edition
Storage 970 EVO 500GB
Display(s) 3x Eizo EV2450
Case SilentiumPC Regnum RG4T Pure Black
Power Supply SilentiumPC Vero M2 Bronze 600W
c) Have you tried to use the driver from your Titan's brand's website?
No I haven't, I had no idea that such website even exists

Just a quick google-foo found a solution for your problem. FP64 acceleration by default is disabled, because it gimps FP32 performance. Basically what you are getting is a 1/24 ratio, like on GTX780. If you enable FP64, you'll get your 1:3 ratio, but expect a slight performance drop in FP32.

That's great! Do you know how to switch it on? As far as I'm concerned, I need as much FP64 as it gets me, I don't care gaming performance. Also, if it's as simple as turning it on/off, I could test if it's beneficial for me
 

silentbogo

Moderator
Staff member
Joined
Nov 20, 2013
Messages
5,474 (1.44/day)
Location
Kyiv, Ukraine
System Name WS#1337
Processor Ryzen 7 3800X
Motherboard ASUS X570-PLUS TUF Gaming
Cooling Xigmatek Scylla 240mm AIO
Memory 4x8GB Samsung DDR4 ECC UDIMM
Video Card(s) Inno3D RTX 3070 Ti iChill
Storage ADATA Legend 2TB + ADATA SX8200 Pro 1TB
Display(s) Samsung U24E590D (4K/UHD)
Case ghetto CM Cosmos RC-1000
Audio Device(s) ALC1220
Power Supply SeaSonic SSR-550FX (80+ GOLD)
Mouse Logitech G603
Keyboard Modecom Volcano Blade (Kailh choc LP)
VR HMD Google dreamview headset(aka fancy cardboard)
Software Windows 11, Ubuntu 20.04 LTS
Open NVidia Control Panel, go to "Manage 3D Settings", scroll down a bit and you'll see it as "Double Precision" option.
By default it's set to None. You need to switch it to your Titan.
 
Joined
Nov 4, 2020
Messages
5 (0.00/day)
Processor TR 1920x
Motherboard X399 Asrock Taichi
Cooling be quiet! Dark Rock Pro TR4
Memory 128GB (8x16GB) 3000MHz CL16 Corsair Vengeance LPX Black
Video Card(s) GTX 1070 Zotac AMP! Extreme Edition
Storage 970 EVO 500GB
Display(s) 3x Eizo EV2450
Case SilentiumPC Regnum RG4T Pure Black
Power Supply SilentiumPC Vero M2 Bronze 600W
Open NVidia Control Panel, go to "Manage 3D Settings", scroll down a bit and you'll see it as "Double Precision" option.
By default it's set to None. You need to switch it to your Titan.
1604519613952.png

Yep! That's it. Huge thanks to you man, I was getting so worried.. Thank you so much! :roll:
 
  • Like
Reactions: Rei
Joined
May 8, 2016
Messages
1,741 (0.60/day)
System Name BOX
Processor Core i7 6950X @ 4,26GHz (1,28V)
Motherboard X99 SOC Champion (BIOS F23c + bifurcation mod)
Cooling Thermalright Venomous-X + 2x Delta 38mm PWM (Push-Pull)
Memory Patriot Viper Steel 4000MHz CL16 4x8GB (@3240MHz CL12.12.12.24 CR2T @ 1,48V)
Video Card(s) Titan V (~1650MHz @ 0.77V, HBM2 1GHz, Forced P2 state [OFF])
Storage WD SN850X 2TB + Samsung EVO 2TB (SATA) + Seagate Exos X20 20TB (4Kn mode)
Display(s) LG 27GP950-B
Case Fractal Design Meshify 2 XL
Audio Device(s) Motu M4 (audio interface) + ATH-A900Z + Behringer C-1
Power Supply Seasonic X-760 (760W)
Mouse Logitech RX-250
Keyboard HP KB-9970
Software Windows 10 Pro x64
Keep in mind this will decrease max clock on the card hard in games, and probably make it run hotter/louder on top (because this setting makes hard offset on vGPU, at least that's what it does on my Titan Black).
 
Top