TPU's GPU Database Portal & Updates

T4C Fantasy · May 23, 2018

FP16 and FP64 support added
https://www.techpowerup.com/gpudb/3051/titan-v

and added chip numbers under gpu variants for AMD

TRINITAS · May 24, 2018

Hello,

I would like to take advantage of adding some info about computing:

For AMD:

* GCN5: INT8 (4x FP32) - INT24 (= FP32) - INT32 (1/5 FP32) - INT64 (1/20 FP32)
* GCN4 / 3: INT8 (4x FP32) - INT24 (= FP32) - INT32 (1/5 FP32) - INT64 (1/20 FP32) - FP16 (= FP32)
* GCN2 / 1: INT24 (= FP32) - INT32 (1/5 FP32) - INT64 (1/20 FP32)
* TeraScale 3: INT24 (= FP32) - INT32 (1/5 FP32) - INT64 (1/20 FP32)
* TeraScale 2/1: INT24 (1/4 FP32) - INT32 (1/4 FP32) - INT64 (1/20 FP32)

For NVIDIA:

* Volta GV100: INT24-INT32 (= FP32) - INT64 (1/5 FP32)
* Pascal GeForce: FP16 (1/64) - INT8 (4x FP32) - INT24-INT32 (1/3 FP32) - INT64 (1/15 FP32)
* Maxwell: Same as Pascal, but not FP16 and INT8.
* Kepler: INT24-INT32 (1/5 FP32) - INT64 (1/20 FP32)
* Fermi GF100 / 110: INT24-INT32 (1/2 FP32) - INT64 (1/8 FP32)
* Other Fermi: INT24-INT32 (1/3 FP32) - INT64 (1/12 FP32)
* Tesla: INT24 (= FP32) - INT32 (1/5 FP32) - INT64 (1/24 FP32)

AIDA64 software has served me a lot to know the values of calculations INT.

I hope this will enrich the database

T4C Fantasy · May 24, 2018

TRINITAS said:
Hello,

I would like to take advantage of adding some info about computing:

For AMD:

* GCN5: INT8 (4x FP32) - INT24 (= FP32) - INT32 (1/5 FP32) - INT64 (1/20 FP32)
* GCN4 / 3: INT8 (4x FP32) - INT24 (= FP32) - INT32 (1/5 FP32) - INT64 (1/20 FP32) - FP16 (= FP32)
* GCN2 / 1: INT24 (= FP32) - INT32 (1/5 FP32) - INT64 (1/20 FP32)
* TeraScale 3: INT24 (= FP32) - INT32 (1/5 FP32) - INT64 (1/20 FP32)
* TeraScale 2/1: INT24 (1/4 FP32) - INT32 (1/4 FP32) - INT64 (1/20 FP32)

For NVIDIA:

* Volta GV100: INT24-INT32 (= FP32) - INT64 (1/5 FP32)
* Pascal GeForce: FP16 (1/64) - INT8 (4x FP32) - INT24-INT32 (1/3 FP32) - INT64 (1/15 FP32)
* Maxwell: Same as Pascal, but not FP16 and INT8.
* Kepler: INT24-INT32 (1/5 FP32) - INT64 (1/20 FP32)
* Fermi GF100 / 110: INT24-INT32 (1/2 FP32) - INT64 (1/8 FP32)
* Other Fermi: INT24-INT32 (1/3 FP32) - INT64 (1/12 FP32)
* Tesla: INT24 (= FP32) - INT32 (1/5 FP32) - INT64 (1/24 FP32)

AIDA64 software has served me a lot to know the values of calculations INT.

I hope this will enrich the database

our 32 and 64 should be accurate for nvidia and most of amd

https://docs.nvidia.com/cuda/cuda-c-programming-guide/index.html#maximize-instruction-throughput

our pascal FP16 is incorrect right now but volta GV100 is 1/2 FP32

can you link a similar amd document

TRINITAS · May 24, 2018

It is strange that the results of the AIDA64 tests do not reflect the data entrusted by NVIDIA. Unless it's a question of instructions, where there will not really be any ratio sets.

T4C Fantasy · May 24, 2018

TRINITAS said:
It is strange that the results of the AIDA64 tests do not reflect the data entrusted by NVIDIA. Unless it's a question of instructions, where there will not really be any ratio sets.

AIDA64 is wrong for that specific thing i mentioned

right now my concern is FP16 for pascal i know we are not correct, but our 32 and 64 should be 100%

and amd ill find out more but im 80% sure its ok now that you mention gcn 3 as 1:1 for FP16

TRINITAS · May 24, 2018

Do you know any benchmarking software for FP16?

T4C Fantasy · May 24, 2018

TRINITAS said:
Do you know any benchmarking software for FP16?

i wish!! FP16 is my new project now, FP64 seems to be useless unless you have ECC memory and i lost interest in it. but FP16 can improve games and im interested

help me find out if GCN 3.0 really has 1:1 FP16 i need documents though

TRINITAS · May 24, 2018

Source (in French): https://www.hardware.fr/articles/968-3/specifications-cartes.html

For FP64, i understand. And for other: INT8-16-24-32-64 ?

TRINITAS · May 25, 2018

Indeed: All Pascal (Except GP100) have FP16 (1/64 FP32), but all Pascal have INT8 (4:1) and INT16 (2:1)

T4C Fantasy · May 25, 2018

TRINITAS said:
Indeed: All Pascal (Except GP100) have FP16 (1/64 FP32), but all Pascal have INT8 (4:1) and INT16 (2:1)

yeah its going to take more time to fix but i know the calculation now xD

TRINITAS · May 25, 2018

More infos for Intel HD Graphics

:

=>Each EU has a 128-bit wide FPU that natively executes eight 16-bit or four 32-bit operations per clock cycle (Clarkdale, Arrandale, Sandy Bridge, and after)
=>FP64 (1/4 FP32) (Bay Trail, Ivy Bridge, Haswell, Braswell, Broadwell, Skylake, Gemini Lake, Kaby Lake, Coffee Lake)
=>FP64 (1/8 FP32) (Apollo Lake)
=>FP16 (2:1 FP32) (Skylake, Gemini Lake, Kaby Lake, Coffee Lake)

T4C Fantasy · May 25, 2018

TRINITAS said:
More infos for Intel HD Graphics :

=>Each EU has a 128-bit wide FPU that natively executes eight 16-bit or four 32-bit operations per clock cycle (Clarkdale, Arrandale, Sandy Bridge, and after)
=>FP64 (1/4 FP32) (Bay Trail, Ivy Bridge, Haswell, Braswell, Broadwell, Skylake, Gemini Lake, Kaby Lake, Coffee Lake)
=>FP64 (1/8 FP32) (Apollo Lake)
=>FP16 (2:1 FP32) (Skylake, Gemini Lake, Kaby Lake, Coffee Lake)

I will add all of that but please provide proof buddy

TRINITAS · May 25, 2018

Wikipédia for the moment

T4C Fantasy · May 25, 2018

TRINITAS said:
Wikipédia for the moment

all intel chips with support updated

TRINITAS · May 25, 2018

RX Vega M-GH/GL are already Polaris? False Vega?

T4C Fantasy · May 25, 2018

TRINITAS said:
RX Vega M-GH/GL are already Polaris? False Vega?

yes its more Polaris than Vega and under NDA says its Polaris 22

they thought that just because it has HBCC that they could call it Vega, its a GFX 8 chip, Vega is GFX 9

RX Vega M
Graphics/Compute: GFX8 (gfx804)
Display Core Engine: 11.2
Unified Video Decoder: 6.3
Video Compression Engine: 3.4
ROCm Support

RX Vega 64
Graphics/Compute: GFX9 (gfx900)
Display Core Engine: 12.0
Unified Video Decoder: 7.0
Video Compression Engine: 4.0
ROCm Support

TRINITAS · May 25, 2018

Ok

Ah, for other GPU:

Fermi GF110/GF100-GL (Quadro/Tesla): FP64 (1/2 FP32)
Fermi GF110/GF100 (GeForce): FP64 (1/8 FP32)
Fermi GF11x/GF10x (GeForce/Quadro): FP64 (1/12 FP32)
Tesla (GT200 only - GeForce/Quadro/Tesla): FP64 (1/8 FP32)

T4C Fantasy · May 25, 2018

TRINITAS said:
Ok

Ah, for other GPU:

Fermi GF110/GF100-GL (Quadro/Tesla): FP64 (1/2 FP32)
Fermi GF110/GF100 (GeForce): FP64 (1/8 FP32)
Fermi GF11x/GF10x (GeForce/Quadro): FP64 (1/12 FP32)
Tesla (GT200 only - GeForce/Quadro/Tesla): FP64 (1/8 FP32)

all nvidia calculations are by cuda version, we should be all set for nvidia (besides pascal FP16 atm)

TRINITAS · May 25, 2018

Ok,

I see OpenCL for Radeon HD2000 and HD3000. These generation don't support OpenCL, only from HD4000. They use ATI CAL (Used up to GCN1)

T4C Fantasy · May 25, 2018

TRINITAS said:
Ok,

I see OpenCL for Radeon HD2000 and HD3000. These generation don't support OpenCL, only from HD4000. They use ATI CAL (Used up to GCN1)

https://www.techpowerup.com/forums/threads/amd-graphics-ip.243974/
OpenCL was in starting on the R600 series (CTM) Close to metal

never mind CTM is its seperate thing. ill fix after

TRINITAS · May 25, 2018

Ah? So why no software like LuxMark and other don't detect HD3870 in OpenCL with the same driver than my HD4890, where it is detected instead?? strange

T4C Fantasy · May 25, 2018

TRINITAS said:
Ah? So why no software like LuxMark and other don't detect HD3870 in OpenCL with the same driver than my HD4890, where it is detected instead?? strange

it was my bad, ATi made a software called Close to Metal for R600 Series and switched to CL later on, i thought CTM was the beta name for CL

i updated the Graphics IP page

TRINITAS said:
Ah? So why no software like LuxMark and other don't detect HD3870 in OpenCL with the same driver than my HD4890, where it is detected instead?? strange

your right we dont have anything Cuda 2.0 and below, Cuda 1.3 is 1/8 its GT200 etc, you confused me with the GF100 stuff because you repeat it.

find the Cuda version for the fermis you said and that is the unified rate. for that version

TRINITAS · May 25, 2018

Ok, i understand

I forget for IGP AMD APU Excavator "Carrizo" and "Bristol Ridge": FP64 (1:2 FP32)

T4C Fantasy · May 25, 2018

TRINITAS said:
Ok, i understand

I forget for IGP AMD APU Excavator "Carrizo" and "Bristol Ridge": FP64 (1:2 FP32)

i fixed R600 to be no CL support and R700 now has 1.0 --> 1.1

TRINITAS said:
Ok, i understand

I forget for IGP AMD APU Excavator "Carrizo" and "Bristol Ridge": FP64 (1:2 FP32)

FP16 fixed in Pascal, Carrizo etc. fixed

TRINITAS · May 25, 2018

For Nvidia, it is clear that it is easier to find the information, since they communicate a lot on their GPU.

But AMD is really heartbreaking, especially that in the same generation of architecture, we can have different ratios, as the case of Bristol Ridge against others of its version as Fiji or Tonga.

System Name	Whaaaat Kiiiiiiid!
Processor	Intel Core i9-14900K @ Default
Motherboard	Gigabyte Z690 AORUS Elite AX DDR4
Cooling	Corsair H150i AIO Cooler
Memory	Corsair Dominator Platinum 128GB DDR4-3200
Video Card(s)	EVGA GeForce RTX 3080 FTW3 ULTRA @ Default
Storage	Samsung 970 PRO 512GB + Crucial MX500 2TB x3 + Crucial MX500 4TB + Samsung 980 PRO 1TB
Display(s)	27" LG 27MU67-B 4K, + 27" Acer Predator XB271HU 1440P
Case	Thermaltake Core X9 Snow
Audio Device(s)	Logitech G PRO X 2 Lightspeed
Power Supply	SeaSonic Platinum 1050W Snow Silent
Mouse	Logitech G903 Lightspeed
Keyboard	Logitech G915 X Lightspeed
Software	Windows 11 Pro
Benchmark Scores	FFXV: 19329

System Name	Game computer
Processor	AMD RyZen 7 5800X3D 4.35GHZ
Motherboard	ASRock X470 Taichi
Cooling	be quiet! Pure Rock 2 Black
Memory	32768 Mo DDR4-3200 G-Skill CL16
Video Card(s)	AMD Radeon RX 7900 GRE (x2)
Storage	SSD Samsung 970 EVO M2 250 Go, Samsung 970 EVO M2 500 Go, Samsung 850 EVO SATA 500 Go, Toshiba 4 To
Display(s)	AOC 24' 1440p 144 Hz DisplayPort + ACER KG251Q 24' 1080p 144 Hz DisplayPort
Case	NZXT Phantom Black
Audio Device(s)	Corsair Gaming VOID Pro RGB Wireless Special Edition
Power Supply	BeQuiet Straight Power 11 1000W
Mouse	Roccat Kone XTD
Keyboard	BTC USB
Software	Windows 11 24H2 Pro x64

System Name	Whaaaat Kiiiiiiid!
Processor	Intel Core i9-14900K @ Default
Motherboard	Gigabyte Z690 AORUS Elite AX DDR4
Cooling	Corsair H150i AIO Cooler
Memory	Corsair Dominator Platinum 128GB DDR4-3200
Video Card(s)	EVGA GeForce RTX 3080 FTW3 ULTRA @ Default
Storage	Samsung 970 PRO 512GB + Crucial MX500 2TB x3 + Crucial MX500 4TB + Samsung 980 PRO 1TB
Display(s)	27" LG 27MU67-B 4K, + 27" Acer Predator XB271HU 1440P
Case	Thermaltake Core X9 Snow
Audio Device(s)	Logitech G PRO X 2 Lightspeed
Power Supply	SeaSonic Platinum 1050W Snow Silent
Mouse	Logitech G903 Lightspeed
Keyboard	Logitech G915 X Lightspeed
Software	Windows 11 Pro
Benchmark Scores	FFXV: 19329

System Name	Game computer
Processor	AMD RyZen 7 5800X3D 4.35GHZ
Motherboard	ASRock X470 Taichi
Cooling	be quiet! Pure Rock 2 Black
Memory	32768 Mo DDR4-3200 G-Skill CL16
Video Card(s)	AMD Radeon RX 7900 GRE (x2)
Storage	SSD Samsung 970 EVO M2 250 Go, Samsung 970 EVO M2 500 Go, Samsung 850 EVO SATA 500 Go, Toshiba 4 To
Display(s)	AOC 24' 1440p 144 Hz DisplayPort + ACER KG251Q 24' 1080p 144 Hz DisplayPort
Case	NZXT Phantom Black
Audio Device(s)	Corsair Gaming VOID Pro RGB Wireless Special Edition
Power Supply	BeQuiet Straight Power 11 1000W
Mouse	Roccat Kone XTD
Keyboard	BTC USB
Software	Windows 11 24H2 Pro x64

System Name	Whaaaat Kiiiiiiid!
Processor	Intel Core i9-14900K @ Default
Motherboard	Gigabyte Z690 AORUS Elite AX DDR4
Cooling	Corsair H150i AIO Cooler
Memory	Corsair Dominator Platinum 128GB DDR4-3200
Video Card(s)	EVGA GeForce RTX 3080 FTW3 ULTRA @ Default
Storage	Samsung 970 PRO 512GB + Crucial MX500 2TB x3 + Crucial MX500 4TB + Samsung 980 PRO 1TB
Display(s)	27" LG 27MU67-B 4K, + 27" Acer Predator XB271HU 1440P
Case	Thermaltake Core X9 Snow
Audio Device(s)	Logitech G PRO X 2 Lightspeed
Power Supply	SeaSonic Platinum 1050W Snow Silent
Mouse	Logitech G903 Lightspeed
Keyboard	Logitech G915 X Lightspeed
Software	Windows 11 Pro
Benchmark Scores	FFXV: 19329

TPU's GPU Database Portal & Updates

T4C Fantasy

CPU & GPU DB Maintainer

TRINITAS

T4C Fantasy

CPU & GPU DB Maintainer

TRINITAS

T4C Fantasy

CPU & GPU DB Maintainer

TRINITAS

T4C Fantasy

CPU & GPU DB Maintainer

TRINITAS

TRINITAS

T4C Fantasy

CPU & GPU DB Maintainer

TRINITAS

T4C Fantasy

CPU & GPU DB Maintainer

TRINITAS

T4C Fantasy

CPU & GPU DB Maintainer

TRINITAS

T4C Fantasy

CPU & GPU DB Maintainer

TRINITAS

T4C Fantasy

CPU & GPU DB Maintainer

TRINITAS

T4C Fantasy

CPU & GPU DB Maintainer

TRINITAS

T4C Fantasy

CPU & GPU DB Maintainer

TRINITAS

Attachments

T4C Fantasy

CPU & GPU DB Maintainer

TRINITAS