View Single Post
Old Jan 2, 2013, 06:27 PM   #10
T4C Fantasy
CPU & GPU DB Maintainer
 
T4C Fantasy's Avatar
 
Join Date: May 2012
Location: USA, Rhode Island
Posts: 895 (2.19/day)
Thanks: 215
Thanked 381 Times in 224 Posts
Send a message via AIM to T4C Fantasy Send a message via MSN to T4C Fantasy Send a message via Skype™ to T4C Fantasy

System Specs

Quote:
Originally Posted by Flickspeed View Post
Hey are you sure you are using the correct way to calculate pixel fillrate in the current versions? I read the other threads and see some inconsistencies.

It seems the pixel fillrate is still not calculated properly for Fermi Cards. Are you taking into account the following information?

Each Streaming Multiprocessor(SM) in the GPU of GF100 architecture contains 32 SPs and 4 SFUs.
Each Streaming Multiprocessor(SM) in the GPU of GF104/106/108 architecture contains 48 SPs and 8 SFUs.
Each Streaming Multiprocessor(SM) in the GPU of GF110 architecture contains 32 SPs and 4 SFUs.
Each Streaming Multiprocessor(SM) in the GPU of GF114/116/118/119 architecture contains 48 SPs and 8 SFUs.

Each SP can fulfill up to two single precision operations FMA per clock. Each SFU can fulfill up to four operations SF per clock. The approximate ratio of operations FMA to operations SF is equal: for GF100 4:1 and for GF104/106/108 3:1. The theoretical shader performance in single-precision floating point operations(FMA) [FLOPSsp, GFLOPS] of the graphics card with shader count [n] and shader frequency [f, GHz], is estimated by the following: FLOPSsp ≈ f × n × 2. Alternative formula: for GF100 FLOPSsp ≈ f × m × (32 SPs × 2(FMA)) and for GF104/106/108 FLOPSsp ≈ f × m × (48 SPs × 2(FMA)). [m] - SM count. Total Processing Power: for GF100 FLOPSsp ≈ f × m ×(32 SPs × 2(FMA) + 4 × 4 SFUs) and for GF104/106/108 FLOPSsp ≈ f × m × (48 SPs × 2(FMA) + 4 × 8 SFUs) or for GF100 FLOPSsp ≈ f × n × 2.5 and for GF104/106/108 FLOPSsp ≈ f × n × 8 / 3.[16] where:
SP - Shader Processor (Unified Shader, CUDA Core), SFU - Special Function Unit, SM - Streaming Multiprocessor, FMA - Fused MUL+ADD.

Based on this information the current calculation method is wrong! Please recheck. For example the GTX 460 has 7 SM's for a total of 7*48 = 336 SP's!!!
check out the gpu database, it uses the latest known calculation for Fermi
http://www.techpowerup.com/gpudb/265...e_GTX_460.html
__________________
GPU Database <-- Want Me To Add A Specific Non-Reference GPU Model Not Listed? Send Me A PM
GPU Database Thread
Rare GPUs / Unreleased GPUs
List Your Favorite GPU
GPU-Z 0.6.7+ Screenshots!

CPU Database
T4C Fantasy is online now  
Reply With Quote