• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Nvidia GTX300 series Supposed Benchmarks....and what it could mean for ATI's 5800's

Joined
Oct 6, 2009
Messages
2,820 (0.53/day)
Location
Midwest USA
System Name My Gaming System
Processor Intel i7 4770k @ 4.4 Ghz
Motherboard Asus Maximus VI Impact (ITX)
Cooling Custom Full System Water cooling Loop
Memory G.Skill 1866 Mhz Sniper 8 Gb
Video Card(s) EVGA GTX 780 ti SC
Storage Samsung SSD EVO 120GB - Samsung SSD EVO 500GB
Display(s) ASUS W246H - 24" - widescreen TFT active matrix LCD
Case Bitfenix Prodigy
Power Supply Corsair AX760 Modular PSU
Software Windows 8.1 Home Primeum
3 GPUs on a single card and not a CAD program to use them. I doubt they'd spend the money to R&D that trainwreck.

Touche my good friend....... But there was nothing for Xfire at first either:) Nothing would ever work with out first a dream to do it.

They slap a 400W cooler on the 5970, and that is why it is so long

Yeah speaking of that since every Nvidia card I have ever owned has run hotter than ATI's (maybe just my experience) I wonder how hot that bad boy of Femi is going to run?

60C idle and 90C to 100C under load LOL
 
Joined
Sep 25, 2007
Messages
5,965 (0.99/day)
Location
New York
Processor AMD Ryzen 9 5950x, Ryzen 9 5980HX
Motherboard MSI X570 Tomahawk
Cooling Be Quiet Dark Rock Pro 4(With Noctua Fans)
Memory 32Gb Crucial 3600 Ballistix
Video Card(s) Gigabyte RTX 3080, Asus 6800M
Storage Adata SX8200 1TB NVME/WD Black 1TB NVME
Display(s) Dell 27 Inch 165Hz
Case Phanteks P500A
Audio Device(s) IFI Zen Dac/JDS Labs Atom+/SMSL Amp+Rivers Audio
Power Supply Corsair RM850x
Mouse Logitech G502 SE Hero
Keyboard Corsair K70 RGB Mk.2
VR HMD Samsung Odyssey Plus
Software Windows 10
the GTX380 will without a doubt be faster than the HD5870 but the question is will ATI release the HD6870 next year which will use the same architecture as the GTX380(mimd based) and stomp it into the ground, now this is news today, AMD is going to aim at releasing a Globalfoundries for GPU and CPU production, bypassing TSMC, and there are rumors that there will be no HD6870 released next year now, but there probably not true, since actual development on the 6K's are locked in specification.
 

Binge

Overclocking Surrealism
Joined
Sep 15, 2008
Messages
6,979 (1.23/day)
Location
PA, USA
System Name Molly
Processor i5 3570K
Motherboard Z77 ASRock
Cooling CooliT Eco
Memory 2x4GB Mushkin Redline Ridgebacks
Video Card(s) Gigabyte GTX 680
Case Coolermaster CM690 II Advanced
Power Supply Corsair HX-1000
Touche my good friend....... But there was nothing for Xfire at first either:) Nothing would ever work with out first a dream to do it.

There still isn't a multi-gpu CAD program that is used in the industry. ATI loses big bucks just to save face against NV with these dual GPU solutions.

Yeah speaking of that since every Nvidia card I have ever owned has run hotter than ATI's (maybe just my experience) I wonder how hot that bad boy of Femi is going to run?

60C idle and 90C to 100C under load LOL

Funny this 5870 I have in my rig right now is the first ATI card, besides the 5850, that's been more cool than 55nm nVidia products. I'm speaking in terms of degree C per FPS.
 
Last edited:
Joined
Oct 6, 2009
Messages
2,820 (0.53/day)
Location
Midwest USA
System Name My Gaming System
Processor Intel i7 4770k @ 4.4 Ghz
Motherboard Asus Maximus VI Impact (ITX)
Cooling Custom Full System Water cooling Loop
Memory G.Skill 1866 Mhz Sniper 8 Gb
Video Card(s) EVGA GTX 780 ti SC
Storage Samsung SSD EVO 120GB - Samsung SSD EVO 500GB
Display(s) ASUS W246H - 24" - widescreen TFT active matrix LCD
Case Bitfenix Prodigy
Power Supply Corsair AX760 Modular PSU
Software Windows 8.1 Home Primeum
the GTX380 will without a doubt be faster than the HD5870 but the question is will ATI release the HD6870 next year which will use the same architecture as the GTX380(mimd based) and stomp it into the ground, now this is news today, AMD is going to aim at releasing a Globalfoundries for GPU and CPU production, bypassing TSMC, and there are rumors that there will be no HD6870 released next year now, but there probably not true.

Where did you see this not that I doubt you....

Funny this 5870 I have in my rig right now is the first card, besides the 5850, that's been more cool than 55nm nVidia products.

That's why I said maybe it's me..... Because every Nvidia card I have ever owned has been hotter than my ATI cards..... Hence 8800GT vs. my 3850 and GTX 385 FTW vs. 4890. Both of those cards at least for me were way cooler. That might of been your experience but that has never been mine.
 

Binge

Overclocking Surrealism
Joined
Sep 15, 2008
Messages
6,979 (1.23/day)
Location
PA, USA
System Name Molly
Processor i5 3570K
Motherboard Z77 ASRock
Cooling CooliT Eco
Memory 2x4GB Mushkin Redline Ridgebacks
Video Card(s) Gigabyte GTX 680
Case Coolermaster CM690 II Advanced
Power Supply Corsair HX-1000
I was just confirming that it's you. :laugh:
 
Joined
Sep 25, 2007
Messages
5,965 (0.99/day)
Location
New York
Processor AMD Ryzen 9 5950x, Ryzen 9 5980HX
Motherboard MSI X570 Tomahawk
Cooling Be Quiet Dark Rock Pro 4(With Noctua Fans)
Memory 32Gb Crucial 3600 Ballistix
Video Card(s) Gigabyte RTX 3080, Asus 6800M
Storage Adata SX8200 1TB NVME/WD Black 1TB NVME
Display(s) Dell 27 Inch 165Hz
Case Phanteks P500A
Audio Device(s) IFI Zen Dac/JDS Labs Atom+/SMSL Amp+Rivers Audio
Power Supply Corsair RM850x
Mouse Logitech G502 SE Hero
Keyboard Corsair K70 RGB Mk.2
VR HMD Samsung Odyssey Plus
Software Windows 10

next gen discrete graphics for ati(HD6K) is rumored to come in 2011(this could mean Nov2010-Jan 2011 though)

http://www.brightsideofnews.com/new...h-globalfoundries-gpu-production-in-2011.aspx

but the cards could also be delayed for development on the GPU for the Xbox 3
 
Joined
Oct 6, 2009
Messages
2,820 (0.53/day)
Location
Midwest USA
System Name My Gaming System
Processor Intel i7 4770k @ 4.4 Ghz
Motherboard Asus Maximus VI Impact (ITX)
Cooling Custom Full System Water cooling Loop
Memory G.Skill 1866 Mhz Sniper 8 Gb
Video Card(s) EVGA GTX 780 ti SC
Storage Samsung SSD EVO 120GB - Samsung SSD EVO 500GB
Display(s) ASUS W246H - 24" - widescreen TFT active matrix LCD
Case Bitfenix Prodigy
Power Supply Corsair AX760 Modular PSU
Software Windows 8.1 Home Primeum
I was just confirming that it's you.

I don't want to get on this too much because it is off topic...... But from some of the posts I checked I'm not the only one who thinks that:) But I'm sure if you look around it would probably even it's self out...... Some individual cards run hotter than others.

But just from my experience it's been that way.

next gen discrete graphics for ati(HD6K) is rumored to come in 2011(this could mean Nov2010-Jan 2011 though)

Ahh CC I have seen that but I called that Two years really maybe it should be looked at in a year and a half. 2009 to 2011.

But it could be brought out sooner if they need too I think you are right around Nov 2010

But that brings me back to my previous point ..... "possible Nvidia is making these to compete with the 6800 series in mind."
 
Last edited:

jjFarking

New Member
Joined
Nov 30, 2009
Messages
456 (0.09/day)
Location
Brisbane, Australia
System Name Frankenstein C3
Processor AMD Phenom II X4 965BE C3
Motherboard Asus M4A79T DeLuxe
Cooling Aywun A1-V10
Memory Corsair XMS3 DDR3 (2x2GB @ 1333MHz)
Video Card(s) XFX HD 5850 @ 775/1125
Storage Seagate 250gb/500gb/1tb
Display(s) BENQ E2200HD
Case Thermaltake Element S (VK60001W2Z)
Audio Device(s) X-Fi Elite Pro
Power Supply Zalman ZM600-HP
Software Windows 7 Ultimate 32-bit
I think that the more closely you look at the actual figures, the more fake they have to be.
Just on one figure alone, if, on paper, the 'GTX380' looks like a beast, how come that same paper then kind of contradicts itself to being a 1.74 teraflops card, when the hd5870 is already 2.72 teraflops (the 5970 is 5+ teraflops)?
Also, with a fairly mild OC, the hd5850 will outdo a 'GTX360'.. Not exactly comforting stats for nVidia when you think about it.
Additionally, there's going to be an issue that ATI already came across with the 5970 - the ATX standard. Once you go over 300w, you go beyond that standard. I am unaware of any change happening soon to that standard, unless someone else here has more info?

I think someone had their hand(s) on their genitals when posting those stats..
:rolleyes:
 
Joined
Apr 30, 2008
Messages
4,875 (0.84/day)
Location
Multidimensional
System Name Boomer Master Race
Processor AMD Ryzen 7 7735HS APU
Motherboard BareBones Mini PC MB
Cooling Mini PC Cooling
Memory Crucial 32GB 4800MHz
Video Card(s) AMD Radeon 680M 8GB IGPU
Storage Crucial 500GB M.2 SSD + 2TB Ext HDD
Display(s) Sony 4K Bravia X85J 43Inch TV 120Hz
Case Beelink Mini PC Chassis
Audio Device(s) Built In Realtek Digital Audio HD
Power Supply 120w Power Brick
Mouse Logitech G203 Lightsync
Keyboard Atrix RGB Slim Keyboard
VR HMD ( â—” Ę–ĚŻ â—” )
Software Windows 10 Home 64bit
Benchmark Scores Don't do them anymore.
yeah true about ATI's cards having more teraflops than Nvidias cards but i just thought it was different architecture and driver support!
 
Joined
May 25, 2009
Messages
236 (0.04/day)
Processor AMD R5 7600X
Motherboard Asrock X670E Pro RS
Cooling Noctua NH-15S
Memory 2*16 GB 5600 CL34
Video Card(s) XFX 6800XT 319 Merc
Storage Samsung 970 Evo
Power Supply Super Flower 850 Gold
Mouse Steelseries 310
I think that the more closely you look at the actual figures, the more fake they have to be.
Just on one figure alone, if, on paper, the 'GTX380' looks like a beast, how come that same paper then kind of contradicts itself to being a 1.74 teraflops card, when the hd5870 is already 2.72 teraflops (the 5970 is 5+ teraflops)?
Also, with a fairly mild OC, the hd5850 will outdo a 'GTX360'.. Not exactly comforting stats for nVidia when you think about it.
Additionally, there's going to be an issue that ATI already came across with the 5970 - the ATX standard. Once you go over 300w, you go beyond that standard. I am unaware of any change happening soon to that standard, unless someone else here has more info?

I think someone had their hand(s) on their genitals when posting those stats..
:rolleyes:

even @1000 a 5870 might lag even on neutral ground (unless Cat 9.12/10.1 unlocks the idle 160 SP) but still +1 :slap:
 
Joined
Apr 30, 2008
Messages
4,875 (0.84/day)
Location
Multidimensional
System Name Boomer Master Race
Processor AMD Ryzen 7 7735HS APU
Motherboard BareBones Mini PC MB
Cooling Mini PC Cooling
Memory Crucial 32GB 4800MHz
Video Card(s) AMD Radeon 680M 8GB IGPU
Storage Crucial 500GB M.2 SSD + 2TB Ext HDD
Display(s) Sony 4K Bravia X85J 43Inch TV 120Hz
Case Beelink Mini PC Chassis
Audio Device(s) Built In Realtek Digital Audio HD
Power Supply 120w Power Brick
Mouse Logitech G203 Lightsync
Keyboard Atrix RGB Slim Keyboard
VR HMD ( â—” Ę–ĚŻ â—” )
Software Windows 10 Home 64bit
Benchmark Scores Don't do them anymore.
Joined
Aug 30, 2009
Messages
4,012 (0.75/day)
Location
Sarasota, Florida, USA
System Name Awesomesauce 4.3 | Laptop (MSI GE72VR 6RF Apache Pro-023)
Processor Intel Core i7-5820K 4.16GHz 1.28v/3GHz 1.05v uncore | Intel Core i7-6700HQ @ 3.1GHz
Motherboard Gigabyte GA-X99-UD5 WiFi LGA2011-v3| Stock
Cooling Corsair H100i v2 w/ 2x EK Vardar F4-120ER + various 120/140mm case fans | Stock
Memory G.Skill RJ-4 16GB DDR4-2666 CL15 quad channel | 12GB DDR4-2133
Video Card(s) EVGA GTX 1080 Ti Hybrid SC2 11GB @ 2012/5151 boost | NVIDIA GTX 1060 6GB +200/+500 + Intel 530
Storage Samsung 840 EVO 500GB + Seagate 3TB 7200RPM + others | Kingston 256GB M.2 SATA + 1TB 7200RPM
Display(s) Acer G257HU 1440p 60Hz AH-IPS 4ms | 17.3" 1920*1080 60Hz wide angle TN notebook panel
Case Fractal Design Define XL R2 | MSI
Audio Device(s) Creative Sound Blaster Z | Realtek with quad stereo speakers and subwoofer
Power Supply Corsair HX850i Platinum | 19.5v 180w Delta brick
Software Windows 10 Pro x64 | Windows 10 Home x64
Joined
Nov 27, 2006
Messages
2,106 (0.33/day)
System Name Norfree
Processor i5 3570k @4.4
Motherboard Gigabyte UD5H
Cooling 212 Evo
Memory 4x4GB Kingston 1600 @ 1833 9cl
Video Card(s) Sapphire Nitro Fury
Storage Corsair SSD, WD Black
Display(s) 1080p TV
Case Corsair 300-R
Audio Device(s) Auzentech Prelude > Fidelio X2s and AD-700s
Power Supply PCP&C Silent 950w
Software Win 10 Pro 64
edit. sigh, not needed.
 
Joined
Jun 1, 2006
Messages
1,745 (0.27/day)
Location
The Nevada Wasteland
System Name 9th Level
Processor AMD Ryzen 5 5600X
Motherboard MSI X570 Carbon wifi
Cooling EK Basic 360, x2 250mm, x1 140mm, x1 120mm fans.
Memory 32GB Corsair Vengeance 3200mhz.
Video Card(s) EVGA RTX 3080 12GB FTW3
Storage 500gb ssd, 2tb ssd, 6tb HD.
Display(s) MSI 27" Curved 1440p@165hz
Case HAF 932
Power Supply Corsair HX850W
Software Windows 10 64bit
It means nothing these are fabricated by nvidia. I don't believe anything until wiz benchmarks it.
 

Benetanegia

New Member
Joined
Sep 11, 2009
Messages
2,680 (0.50/day)
Location
Reaching your left retina.
Just on one figure alone, if, on paper, the 'GTX380' looks like a beast, how come that same paper then kind of contradicts itself to being a 1.74 teraflops card, when the hd5870 is already 2.72 teraflops (the 5970 is 5+ teraflops)?
Also, with a fairly mild OC, the hd5850 will outdo a 'GTX360'.. Not exactly comforting stats for nVidia when you think about it.

Raw peak GFlops means nothing. Actual sustained Gflops is what matters.

Here you can see actual Gflop numbers from SGEMM matrix multiplication benchmark: http://www.brightsideofnews.com/new...-1tflops---27x-faster-than-nvidia-gt200!.aspx

But as of SC09, the top five performing products for SGEMM 4K x 4K are as follows [do note that multi-GPU products are excluded as they don't run SGEMM]:
1. Intel Larrabee [LRB, 45nm] - 1006 GFLOPS
2. EVGA GeForce GTX 285 FTW - 425 GFLOPS
3. nVidia Tesla C1060 [GT200, 65nm] - 370 GFLOPS
4. AMD FireStream 9270 [RV770, 55nm] - 300 GFLOPS

5. IBM PowerXCell 8i [Cell, 65nm] - 164 GFLOPS

Firestream RV770, advertised 1200 Gflops, actual performance 300 Gflops, 300/1200 = 0.25
Tesla C1060 GT200, advertised 933 Gflops with dual issue and 622 gflops without, actual 370 Gflops, 370/622 = 0.6
GTX285 GT200, advertised 1062/708, actual 425 Gflops, 425/708 = 0.6

Next generation, projected performance on SGEMM:

RV870 --> 2.72 Gflops * 0.25 = 680 Gflops real
GT300 --> 1.74 Gflops * 0.6 = 1044 Gflops real
 
Joined
Feb 18, 2006
Messages
5,147 (0.78/day)
Location
AZ
System Name Thought I'd be done with this by now
Processor i7 11700k 8/16
Motherboard MSI Z590 Pro Wifi
Cooling Be Quiet Dark Rock Pro 4, 9x aigo AR12
Memory 32GB GSkill TridentZ Neo DDR4-4000 CL18-22-22-42
Video Card(s) MSI Ventus 2x Geforce RTX 3070
Storage 1TB MX300 M.2 OS + Games, + cloud mostly
Display(s) Samsung 40" 4k (TV)
Case Lian Li PC-011 Dynamic EVO Black
Audio Device(s) onboard HD -> Yamaha 5.1
Power Supply EVGA 850 GQ
Mouse Logitech wireless
Keyboard same
VR HMD nah
Software Windows 10
Benchmark Scores no one cares anymore lols
Raw peak GFlops means nothing. Actual sustained Gflops is what matters.

Here you can see actual Gflop numbers from SGEMM matrix multiplication benchmark: http://www.brightsideofnews.com/new...-1tflops---27x-faster-than-nvidia-gt200!.aspx



Firestream RV770, advertised 1200 Gflops, actual performance 300 Gflops, 300/1200 = 0.25
Tesla C1060 GT200, advertised 933 Gflops with dual issue and 622 gflops without, actual 370 Gflops, 370/622 = 0.6
GTX285 GT200, advertised 1062/708, actual 425 Gflops, 425/708 = 0.6

Next generation, projected performance on SGEMM:

RV870 --> 2.72 Gflops * 0.25 = 680 Gflops real
GT300 --> 1.74 Gflops * 0.6 = 1044 Gflops real

numbers on dual precision aren't being backed up by even the most basic of google searches.

http://en.wikipedia.org/wiki/Nvidia_Tesla

http://www.engadget.com/2008/06/16/amds-firestream-9250-first-processor-to-top-1-teraflop/

on single precision nvidia advertizes that 933 number you list yet you randomly divided by a number I see nowhere else on the net. Again for ati on single precision they list 1200gflops so as an apples to apples comparison those are the numbers to use. That data is looking like an nvdia bias where none was needed, nvidia was already faster.

370/933 = .396
300/1200 = .25

much better ratio than ati's .25 (though I'm impressed they were able to maintain 25% of advertized on raw computing when they can only utilize 20% of the advertized stream processors at one time)

So to do this ridiculous assumption without a random bias dataset that appears nowhere else on the net.

RV870 --> 2.72 Gflops * 0.25 = 680 Gflops real
GT300 --> 1.74 Gflops * 0.396 = 689.04 Gflops real

again we must indicate that this is single precision, on dual precision the gt200 advertized 78gflops whereas the rv770 advertised 200 gflops. If anyone has a dual precision bench, please share it.
 
Joined
May 15, 2008
Messages
2,863 (0.49/day)
Location
Brasil
System Name Sovereign // HTPC
Processor i7 3770k 4.2 // i7 3770k 4.2
Motherboard Maximus V Gene // Sabertooth Z77
Cooling Noctua D14 // Intel HSF
Memory 16GB Samsung // 16GB VengeanceLP
Video Card(s) Deciding // 660 DC2
Storage OS (X25-M), Data (Seagate 1TB) // Samsung 840 120GB & bunch of drives
Display(s) Samsung T240HD // LG TV
Case TJ08e // Grandia GD08
Audio Device(s) DT880 Pro 250 ohm // TV speakers
Power Supply Seasonic Plat 1000 // Seasonic Gold 760
Software Windows 8 Pro x64 // Windows 7 Pro x64
Fake FAKE FaKe.... Bad graphs bad
 
Joined
Sep 25, 2007
Messages
5,965 (0.99/day)
Location
New York
Processor AMD Ryzen 9 5950x, Ryzen 9 5980HX
Motherboard MSI X570 Tomahawk
Cooling Be Quiet Dark Rock Pro 4(With Noctua Fans)
Memory 32Gb Crucial 3600 Ballistix
Video Card(s) Gigabyte RTX 3080, Asus 6800M
Storage Adata SX8200 1TB NVME/WD Black 1TB NVME
Display(s) Dell 27 Inch 165Hz
Case Phanteks P500A
Audio Device(s) IFI Zen Dac/JDS Labs Atom+/SMSL Amp+Rivers Audio
Power Supply Corsair RM850x
Mouse Logitech G502 SE Hero
Keyboard Corsair K70 RGB Mk.2
VR HMD Samsung Odyssey Plus
Software Windows 10
one thing about the SP's on Radeon cards is that there is ongoing debate if whether the FLOPS are calculated correctly, I for one think its flawed but thats me, I personally think ATI needs to quit with the whole, oh we have 800 or 1600 shaders, im not saying its not true but theres more accurate ways to say it than that.
 

Binge

Overclocking Surrealism
Joined
Sep 15, 2008
Messages
6,979 (1.23/day)
Location
PA, USA
System Name Molly
Processor i5 3570K
Motherboard Z77 ASRock
Cooling CooliT Eco
Memory 2x4GB Mushkin Redline Ridgebacks
Video Card(s) Gigabyte GTX 680
Case Coolermaster CM690 II Advanced
Power Supply Corsair HX-1000
Fake FAKE FaKe.... Bad graphs bad

Thanks for the stunningly original statement! Man I can do this all day. Correct me if I'm wrong, but haven't we covered the possibility from the moment the graphs were posted?
 
Joined
Jul 14, 2006
Messages
2,411 (0.37/day)
Location
People's Republic of America
System Name It's just a computer
Processor i9-9900K Direct Die
Motherboard eVGA Z390 Dark
Cooling Dual D5T Vario, XSPC BayRes, Nemesis GTR560, NF-A14-iPPC3000PWM, NF-A14-iPPC2000, HK IV Pro Nickel
Memory G.Skill F4-4500C19D-16GTZKKE or G.Skill F4-3600C16D-16GTZ or G.Skill F4-4000C19D-32GTZSW
Video Card(s) eVGA RTX2080 FTW3 Ultra
Storage Samsung 960 EVO M.2
Display(s) LG 32GK650F
Case Thermaltake Xaser VI
Audio Device(s) Auzentech X-Meridian 7.1 2G/Z-5500
Power Supply Seasonic Prime PX-1300
Mouse Logitech
Keyboard Logitech
Software Win7 Ultimate x64 SP1
Joined
May 4, 2009
Messages
1,970 (0.36/day)
Location
Bulgaria
System Name penguin
Processor R7 5700G
Motherboard Asrock B450M Pro4
Cooling Some CM tower cooler that will fit my case
Memory 4 x 8GB Kingston HyperX Fury 2666MHz
Video Card(s) IGP
Storage ADATA SU800 512GB
Display(s) 27' LG
Case Zalman
Audio Device(s) stock
Power Supply Seasonic SS-620GM
Software win10
It's all about programing. If your software can utilize the low level hardware, then you can use all of the SPs and hit those 2,7 Gflops. There was some guy who reached 1 Gflop with the HD4870 a while back ( or more precisely half a 4870x2 :) )

Edit: Found it http://forum.beyond3d.com/showthread.php?t=54842
 
Last edited:
Joined
Nov 13, 2007
Messages
10,232 (1.70/day)
Location
Austin Texas
Processor 13700KF Undervolted @ 5.6/ 5.5, 4.8Ghz Ring 200W PL1
Motherboard MSI 690-I PRO
Cooling Thermalright Peerless Assassin 120 w/ Arctic P12 Fans
Memory 48 GB DDR5 7600 MHZ CL36
Video Card(s) RTX 4090 FE
Storage 2x 2TB WDC SN850, 1TB Samsung 960 prr
Display(s) Alienware 32" 4k 240hz OLED
Case SLIGER S620
Audio Device(s) Yes
Power Supply Corsair SF750
Mouse Xlite V2
Keyboard RoyalAxe
Software Windows 11
Benchmark Scores They're pretty good, nothing crazy.
It's all about programing. If your software can utilize the low level hardware, then you can use all of the SPs and hit those 2,7 Gflops. There was some guy who reached 1 Gflop with the HD4870 a while back

Edit: Found it http://forum.beyond3d.com/showthread.php?t=54842

thats what I thought... SGEMM is just an algorithm, and how you hit that tflop rating is up to the software...

HPC labs are always tweaking their software to get the fastest speeds, and they are doing the same matrix operations as the bench.

thanks for confirming :toast:
 

troyrae360

New Member
Joined
Feb 2, 2009
Messages
1,129 (0.20/day)
Location
Christchurch New Zealand
System Name My Computer!
Processor AMD 6400+ Black @ 3.5
Motherboard Gigabyte AM2+ GA-MA790X DS4
Cooling Gigabyte G-Power 2 pro
Memory 2x2 gig Adata 800
Video Card(s) HD3870x2 @ 900gpu and 999mem
Storage 2x wd raid edition 120gig + 1 samsung 320 + samsung 250
Display(s) Samsung 40inch series6 full HD 1080p
Case NZXT Lexa
Audio Device(s) ALC889A HD audio with Enables a Superior Audio Experience (on board)
Power Supply Vantec ION2+ 550w
Software Vista Home pream 64
Joined
Jun 16, 2008
Messages
3,175 (0.55/day)
Location
Brockport, NY
System Name Is rly gud
Processor Intel Core i5 11600kf
Motherboard Asus Prime Z590-V ATX
Memory (48GB total) 16GB (2x8GB) Crucial Ballistix Sport 3000MHZ and G. Skill Ripjaws 32GB 3200MHZ (2x16GB)
Video Card(s) GIGABYTE RTX 3060 12GB
Storage 1TB MSI Spatium M370 NVMe M.2 SSD
Display(s) 32" Viewsonic 4k, 34" Samsung 3440x1440, XP Pen Creative Pro 13.3
Power Supply EVGA 600 80+ Gold
VR HMD Meta Quest Pro, Tundra Trackers
Software Windows 10

Benetanegia

New Member
Joined
Sep 11, 2009
Messages
2,680 (0.50/day)
Location
Reaching your left retina.
It's all about programing. If your software can utilize the low level hardware, then you can use all of the SPs and hit those 2,7 Gflops. There was some guy who reached 1 Gflop with the HD4870 a while back ( or more precisely half a 4870x2 :) )

Edit: Found it http://forum.beyond3d.com/showthread.php?t=54842

thats what I thought... SGEMM is just an algorithm, and how you hit that tflop rating is up to the software...

HPC labs are always tweaking their software to get the fastest speeds, and they are doing the same matrix operations as the bench.

thanks for confirming :toast:

Yeah, but those are unrealistic. You can always find a benchmark that will make a higher use of the resources of a single architecture, pure matrix multiplication is one of the kind, but that is never going to be the case with real applications. The guy on the link is trying to find the best scenario for 4096x4096 matrix-multiplication on the Ati cards (finding the best block size, etc.), get out of that* and performance decreases dramatically. SGEMM tries to represent the performance that you would find in real applications.

*Only using different block size that could be required so that the code is portable to other GPUs or even the CPU in a setup with Ati cards, decreases performance dramatically, as you can see in the link. Add in the fact that different matrix size will benefit from different block size and add in some linear code or some transcendental functions and performance drops dramatically in the case of Ati.

Also the kind of optimization the guy is making there, goes absolutely against the purpose of API's like OpenCL. What's the point of making your code in OpenCL (because it runs in any platform) if you are going to make a different executable for every architecture? In that case you might want to port your code to the native API of each architecture and have the benefits aswell.
 
Top