techPowerUp! Forums

Go Back   techPowerUp! Forums > Our Software > GPU-Z

Reply
 
Thread Tools
Old Nov 8, 2012, 05:19 PM   #1
jasonx
 
Join Date: Nov 2012
Posts: 2 (0.01/day)
Thanks: 0
Thanked 0 Times in 0 Posts

GPU-Z readings mismatch

GPU-Z shows 2 different pixel Fill Rates, between 2 different versions.

from 0.5.7 to 0.6.6 - 10.1 GPixles/s
0.5.6- 23 GPixles/s

which one is correct

jasonx is offline  
Reply With Quote
Old Nov 8, 2012, 05:37 PM   #2
W1zzard
Benevolent Dictator
 
W1zzard's Avatar
 
Join Date: May 2004
Location: Stuttgart, Germany
Posts: 13,793 (4.18/day)
Thanks: 184
Thanked 10,293 Times in 3,176 Posts
Send a message via ICQ to W1zzard Send a message via AIM to W1zzard Send a message via MSN to W1zzard

System Specs

http://googleitfor.me/?q=gpu-z+gtx+460+fillrate
W1zzard is offline  
Reply With Quote
The Following User Says Thank You to W1zzard For This Useful Post:
Old Nov 8, 2012, 05:48 PM   #3
temp02
200 Posts
 
Join Date: Mar 2009
Posts: 490 (0.32/day)
Thanks: 0
Thanked 171 Times in 158 Posts

That or this juicy first post.

And yes, the second screen-shot (most recent version of GPU-Z) is the correct one.
temp02 is offline  
Reply With Quote
Old Nov 8, 2012, 06:18 PM   #4
T4C Fantasy
CPU & GPU DB Maintainer
 
T4C Fantasy's Avatar
 
Join Date: May 2012
Location: USA, Rhode Island
Posts: 873 (2.28/day)
Thanks: 208
Thanked 366 Times in 215 Posts
Send a message via AIM to T4C Fantasy Send a message via MSN to T4C Fantasy Send a message via Skype™ to T4C Fantasy

System Specs

Quote:
Originally Posted by W1zzard View Post
hahahah I laughed so hard at this comment even if it wasn't meant to be funny
__________________
GPU Database <-- Want Me To Add A Specific Non-Reference GPU Model Not Listed? Send Me A PM
GPU Database Thread
Rare GPUs / Unreleased GPUs
List Your Favorite GPU
GPU-Z 0.6.7+ Screenshots!

CPU Database
T4C Fantasy is offline  
Reply With Quote
Old Nov 8, 2012, 07:20 PM   #5
jasonx
 
Join Date: Nov 2012
Posts: 2 (0.01/day)
Thanks: 0
Thanked 0 Times in 0 Posts

thanx for the answer and the trolling at the same time, i did search maybe i missed the correct search query or look in the wrong sub forum, but anyways thanx
jasonx is offline  
Reply With Quote
Old Nov 8, 2012, 07:36 PM   #6
95Viper
3500 Posts
 
95Viper's Avatar
 
Join Date: Oct 2008
Location: στο άλφα έως ωμέγα
Posts: 3,843 (2.28/day)
Thanks: 2,034
Thanked 1,418 Times in 1,117 Posts

System Specs

Welcome to TPU, jasonx!

W1zzard gets to troll or anything W1zzard wants to do... it is his site.

However, there are a lot of other Trolls here and you can hit the little triangle, with the exclamation in it, to report them or any violation... the mods here will respond and are very fair.

Again, Welcome, and feel free to contribute.
95Viper is offline  
Reply With Quote
Old Nov 8, 2012, 10:30 PM   #7
W1zzard
Benevolent Dictator
 
W1zzard's Avatar
 
Join Date: May 2004
Location: Stuttgart, Germany
Posts: 13,793 (4.18/day)
Thanks: 184
Thanked 10,293 Times in 3,176 Posts
Send a message via ICQ to W1zzard Send a message via AIM to W1zzard Send a message via MSN to W1zzard

System Specs

Quote:
Originally Posted by jasonx View Post
thanx for the answer and the trolling at the same time, i did search maybe i missed the correct search query or look in the wrong sub forum, but anyways thanx
I must have answered that question like 20 times and keep wondering why people can't find it. no offense, welcome to the forums
W1zzard is offline  
Reply With Quote
The Following User Says Thank You to W1zzard For This Useful Post:
Old Nov 8, 2012, 10:35 PM   #8
Maban
1000 Posts
 
Maban's Avatar
 
Join Date: Mar 2008
Location: Minnesota
Posts: 1,925 (1.01/day)
Thanks: 805
Thanked 554 Times in 362 Posts

System Specs

Perhaps add something about it in the fillrate tooltip only for Fermi? Though I'm sure it would go unnoticed by most.
Maban is offline  
5 Million points folded for TPU
Reply With Quote
The Following User Says Thank You to Maban For This Useful Post:
Old Jan 2, 2013, 06:24 PM   #9
Flickspeed
 
Join Date: Jan 2013
Posts: 4 (0.03/day)
Thanks: 0
Thanked 0 Times in 0 Posts

Pixel Fillrate Calculation

Hey are you sure you are using the correct way to calculate pixel fillrate in the current versions? I read the other threads and see some inconsistencies.

It seems the pixel fillrate is still not calculated properly for Fermi Cards. Are you taking into account the following information?

Each Streaming Multiprocessor(SM) in the GPU of GF100 architecture contains 32 SPs and 4 SFUs.
Each Streaming Multiprocessor(SM) in the GPU of GF104/106/108 architecture contains 48 SPs and 8 SFUs.
Each Streaming Multiprocessor(SM) in the GPU of GF110 architecture contains 32 SPs and 4 SFUs.
Each Streaming Multiprocessor(SM) in the GPU of GF114/116/118/119 architecture contains 48 SPs and 8 SFUs.

Each SP can fulfill up to two single precision operations FMA per clock. Each SFU can fulfill up to four operations SF per clock. The approximate ratio of operations FMA to operations SF is equal: for GF100 4:1 and for GF104/106/108 3:1. The theoretical shader performance in single-precision floating point operations(FMA) [FLOPSsp, GFLOPS] of the graphics card with shader count [n] and shader frequency [f, GHz], is estimated by the following: FLOPSsp ≈ f × n × 2. Alternative formula: for GF100 FLOPSsp ≈ f × m × (32 SPs × 2(FMA)) and for GF104/106/108 FLOPSsp ≈ f × m × (48 SPs × 2(FMA)). [m] - SM count. Total Processing Power: for GF100 FLOPSsp ≈ f × m ×(32 SPs × 2(FMA) + 4 × 4 SFUs) and for GF104/106/108 FLOPSsp ≈ f × m × (48 SPs × 2(FMA) + 4 × 8 SFUs) or for GF100 FLOPSsp ≈ f × n × 2.5 and for GF104/106/108 FLOPSsp ≈ f × n × 8 / 3.[16] where:
SP - Shader Processor (Unified Shader, CUDA Core), SFU - Special Function Unit, SM - Streaming Multiprocessor, FMA - Fused MUL+ADD.

Based on this information the current calculation method is wrong!

Last edited by Flickspeed; Jan 2, 2013 at 07:12 PM.
Flickspeed is offline  
Reply With Quote
Old Jan 2, 2013, 06:27 PM   #10
T4C Fantasy
CPU & GPU DB Maintainer
 
T4C Fantasy's Avatar
 
Join Date: May 2012
Location: USA, Rhode Island
Posts: 873 (2.28/day)
Thanks: 208
Thanked 366 Times in 215 Posts
Send a message via AIM to T4C Fantasy Send a message via MSN to T4C Fantasy Send a message via Skype™ to T4C Fantasy

System Specs

Quote:
Originally Posted by Flickspeed View Post
Hey are you sure you are using the correct way to calculate pixel fillrate in the current versions? I read the other threads and see some inconsistencies.

It seems the pixel fillrate is still not calculated properly for Fermi Cards. Are you taking into account the following information?

Each Streaming Multiprocessor(SM) in the GPU of GF100 architecture contains 32 SPs and 4 SFUs.
Each Streaming Multiprocessor(SM) in the GPU of GF104/106/108 architecture contains 48 SPs and 8 SFUs.
Each Streaming Multiprocessor(SM) in the GPU of GF110 architecture contains 32 SPs and 4 SFUs.
Each Streaming Multiprocessor(SM) in the GPU of GF114/116/118/119 architecture contains 48 SPs and 8 SFUs.

Each SP can fulfill up to two single precision operations FMA per clock. Each SFU can fulfill up to four operations SF per clock. The approximate ratio of operations FMA to operations SF is equal: for GF100 4:1 and for GF104/106/108 3:1. The theoretical shader performance in single-precision floating point operations(FMA) [FLOPSsp, GFLOPS] of the graphics card with shader count [n] and shader frequency [f, GHz], is estimated by the following: FLOPSsp ≈ f × n × 2. Alternative formula: for GF100 FLOPSsp ≈ f × m × (32 SPs × 2(FMA)) and for GF104/106/108 FLOPSsp ≈ f × m × (48 SPs × 2(FMA)). [m] - SM count. Total Processing Power: for GF100 FLOPSsp ≈ f × m ×(32 SPs × 2(FMA) + 4 × 4 SFUs) and for GF104/106/108 FLOPSsp ≈ f × m × (48 SPs × 2(FMA) + 4 × 8 SFUs) or for GF100 FLOPSsp ≈ f × n × 2.5 and for GF104/106/108 FLOPSsp ≈ f × n × 8 / 3.[16] where:
SP - Shader Processor (Unified Shader, CUDA Core), SFU - Special Function Unit, SM - Streaming Multiprocessor, FMA - Fused MUL+ADD.

Based on this information the current calculation method is wrong! Please recheck. For example the GTX 460 has 7 SM's for a total of 7*48 = 336 SP's!!!
check out the gpu database, it uses the latest known calculation for Fermi
http://www.techpowerup.com/gpudb/265...e_GTX_460.html
__________________
GPU Database <-- Want Me To Add A Specific Non-Reference GPU Model Not Listed? Send Me A PM
GPU Database Thread
Rare GPUs / Unreleased GPUs
List Your Favorite GPU
GPU-Z 0.6.7+ Screenshots!

CPU Database
T4C Fantasy is offline  
Reply With Quote
Old Jan 2, 2013, 06:29 PM   #11
Flickspeed
 
Join Date: Jan 2013
Posts: 4 (0.03/day)
Thanks: 0
Thanked 0 Times in 0 Posts

Quote:
Originally Posted by T4C Fantasy View Post
check out the gpu database, it uses the latest known calculation for Fermi
http://www.techpowerup.com/gpudb/265...e_GTX_460.html
Yep and I say that's wrong. Because the assumption was that Each SM could do 2 operations, but it's not each SM, it's each SP
Flickspeed is offline  
Reply With Quote
Old Jan 2, 2013, 06:30 PM   #12
Maban
1000 Posts
 
Maban's Avatar
 
Join Date: Mar 2008
Location: Minnesota
Posts: 1,925 (1.01/day)
Thanks: 805
Thanked 554 Times in 362 Posts

System Specs

Quote:
Originally Posted by Flickspeed View Post
Hey are you sure you are using the correct way to calculate pixel fillrate in the current versions? I read the other threads and see some inconsistencies.

It seems the pixel fillrate is still not calculated properly for Fermi Cards. Are you taking into account the following information?

Each Streaming Multiprocessor(SM) in the GPU of GF100 architecture contains 32 SPs and 4 SFUs.
Each Streaming Multiprocessor(SM) in the GPU of GF104/106/108 architecture contains 48 SPs and 8 SFUs.
Each Streaming Multiprocessor(SM) in the GPU of GF110 architecture contains 32 SPs and 4 SFUs.
Each Streaming Multiprocessor(SM) in the GPU of GF114/116/118/119 architecture contains 48 SPs and 8 SFUs.

Each SP can fulfill up to two single precision operations FMA per clock. Each SFU can fulfill up to four operations SF per clock. The approximate ratio of operations FMA to operations SF is equal: for GF100 4:1 and for GF104/106/108 3:1. The theoretical shader performance in single-precision floating point operations(FMA) [FLOPSsp, GFLOPS] of the graphics card with shader count [n] and shader frequency [f, GHz], is estimated by the following: FLOPSsp ≈ f × n × 2. Alternative formula: for GF100 FLOPSsp ≈ f × m × (32 SPs × 2(FMA)) and for GF104/106/108 FLOPSsp ≈ f × m × (48 SPs × 2(FMA)). [m] - SM count. Total Processing Power: for GF100 FLOPSsp ≈ f × m ×(32 SPs × 2(FMA) + 4 × 4 SFUs) and for GF104/106/108 FLOPSsp ≈ f × m × (48 SPs × 2(FMA) + 4 × 8 SFUs) or for GF100 FLOPSsp ≈ f × n × 2.5 and for GF104/106/108 FLOPSsp ≈ f × n × 8 / 3.[16] where:
SP - Shader Processor (Unified Shader, CUDA Core), SFU - Special Function Unit, SM - Streaming Multiprocessor, FMA - Fused MUL+ADD.

Based on this information the current calculation method is wrong! Please recheck. For example the GTX 460 has 7 SM's for a total of 7*48 = 336 SP's!!!
What the hell are you going on about? This thread is about pixel fill rate not shader count. You are obviously trying to figure floating point performance. That is entirely out of the scope of this thread.
Maban is offline  
5 Million points folded for TPU
Reply With Quote
Old Jan 2, 2013, 06:34 PM   #13
Flickspeed
 
Join Date: Jan 2013
Posts: 4 (0.03/day)
Thanks: 0
Thanked 0 Times in 0 Posts

Quote:
Originally Posted by Maban View Post
What the hell are you going on about? This thread is about pixel fill rate not shader count. You are obviously trying to figure floating point performance. That is entirely out of the scope of this thread.
Maban you can disregard anything after the last sentence in red.

I am just making corrections to the following post which wizzard made a base for calculations. It has a fundental error.

Quote:
Originally Posted by 3dc_member View Post
The pixel fillrate in GPU-Z is displayed wrong for Nvidia Fermi based graphics cards. The pixel fillrate seems to be calculated by multiplying the number of ROPs and the GPU clock. But in case of Fermi gpus the pixel fillrate is generally not limited by the number of ROPs but by the number of streaming multiprocessors. Each streaming multiprocessor is capable of processing two pixels per clock. So if there are 16 SMs and 48 ROPs like in the GeForce GTX 580, the SMs limit the pixel fillrate. This is the case for all Fermi based graphics cards i know.
Having more ROPs than pixels that can be processed per clock help to sustain a high pixel fillrate when using multiple samples per pixel (i.e. multisampling antialiasing) but the peak pixel fillrate is limited by the stream processors.
Check out these benchmarks by hardware.fr (scroll down to section 'Fillrate'): http://www.hardware.fr/articles/806-...x-580-sli.html.
The measured peak pixel fillrate of the GeForce GTX 580 is 23,3 GPixel/s. Simply multiplying the 48 ROPs with the 772 MHz gpu clock would give you a peak pixel fillrate of 37,1 GPixel/s. But as the pixel fillrate is limited by the streaming multiprocessors, the peak fillrate is only 16*2*772 MPixel/s = 24,7 GPixel/s. This number corresponds well to the measurement taken by hardware.fr.
If you look at non fermi graphics cards you will see that the measured peak pixel fillrate corresponds well to the product of number of ROPs and gpu clock.

Many reviews cite the wrong peak pixel fillrate for Fermi cards and Nvidia doesn't publish pixel fillrate numbers on the product pages. But knowing the Fermi architectural properties you can easily calculate the right peak pixel fillrate. I hope that GPU-Z will be fixed in a way to show the right peak pixel fillrate on Nvidia Fermi graphics cards.
It is not the SM that is limiting anything Thats the fundamental error I marked it in red So at the end of the day, it is still ROPs times MHz. Prove me wrong and give me the source saying an SM can only process 2 Pixels per clock.

Last edited by Flickspeed; Jan 2, 2013 at 06:46 PM.
Flickspeed is offline  
Reply With Quote
Old Jan 2, 2013, 06:50 PM   #14
Maban
1000 Posts
 
Maban's Avatar
 
Join Date: Mar 2008
Location: Minnesota
Posts: 1,925 (1.01/day)
Thanks: 805
Thanked 554 Times in 362 Posts

System Specs

Quote:
Originally Posted by Flickspeed View Post
Maban you can disregard anything after the last sentence in red.

I am just making corrections to the following post which wizzard made a base for calculations. It has a fundental error.



It is not the SM that is limiting anything Thats the fundamental error I marked it in red So at the end of the day, it is still ROPs times MHz. Prove me wrong and give me the source saying an SM can only process 2 Pixels per clock.
Read the white papers.
Maban is offline  
5 Million points folded for TPU
Reply With Quote
Old Jan 2, 2013, 06:54 PM   #15
Flickspeed
 
Join Date: Jan 2013
Posts: 4 (0.03/day)
Thanks: 0
Thanked 0 Times in 0 Posts

Quote:
Originally Posted by Maban View Post
Read the white papers.
Show me where is says an SM is limited to two pixels per clock in the white papers. Links please.....

I don't work for nvidia and I am not an nvidia fan boy, I am just not liking the fact of misreporting theoretical pixel fillrate values without any valid proof. The start would be to prove that an SM can only do 2 pixels per clock, I couldn't find this anywhere on www except in some post here

Maban instead of posting useless comments you can start here: http://www.nvidia.com/content/PDF/fe...Whitepaper.pdf good luck and have fun.

Also if there is no proof I would like to ask w1zzard to make GPU-Z calculate Theoretical Pixel Fillrates based on the old formula.

Last edited by Flickspeed; Jan 2, 2013 at 07:21 PM.
Flickspeed is offline  
Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump

Similar Threads
Thread Thread Starter Forum Replies Last Post
gpu-z asus rog 0.6.5. gpu readings DarkStalker GPU-Z 10 Oct 12, 2012 07:21 PM
Gpu-z Readings questions shankly1985 GPU-Z 7 Sep 25, 2012 08:50 PM
GPU-Z 0.6.0 sensor readings sykozis GPU-Z 2 Apr 15, 2012 08:44 PM
GPU mismatch Everestar NVIDIA 7 Feb 25, 2012 06:23 AM
strange gpu clock readings on 3870 sike89 AMD / ATI 3 Oct 25, 2008 03:31 AM


All times are GMT. The time now is 03:31 AM.


Powered by vBulletin® Version 3.8.6
Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.
no new posts