techPowerUp! Forums

Go Back   techPowerUp! Forums > Hardware > Graphics Cards > AMD / ATI

Reply
 
Thread Tools
Old Nov 14, 2008, 11:01 PM   #1
lemonadesoda
Eligible for custom title
 
lemonadesoda's Avatar
 
Join Date: Aug 2006
Posts: 5,337 (2.17/day)
Thanks: 749
Thanked 960 Times in 710 Posts

System Specs

R700 has too many shaders and not enough ROPs

OK, so here is a quick analysis of the data at http://www.hexus.net/content/item.php?item=16300



And now for a quick analysis: look at the Excel table below. The numbers in black are data from Hexus. The numbers in red are my calculations.


Clock standardised FPS takes the FPS and scales it as though all clocks were at 4850 speeds, ie 625Mhz. It shows (approx) what the FPS would be if we OC the 4830 and underclock the 4870.

FPS per shader takes this standardised FPS and divides by the number of shaders on the GPU.

LOOK! The 4830 is the most efficient card (100%), with the crossfire (2x) 4830 (92%) coming next and the 4850/4870 somewhere behind (78%/81%). In COD4, the 4830 is about 25% more effective per clock per shader than the 4850/4870! Why is the 4870 better than 4850 if they have the same shaders and we standardised the clocks? Because of the faster memory.

BUT even with faster memory, is still underperforms the 4830 clock for clock, shader for shader. The 4830 is 25% faster.

So what gives? Easy, there are only 640 shaders on the 4830 but 40 texture units and 16 ROPs just like 4850/70.

THEREFORE, the 4830 is a more optimal combination of shaders, texture units and ROPs. Since texture units are 40 compared to 16 on the previous gen cards 3xxx, but the ROPs remain at 16, then I conclude that the ROPs is the primary bottleneck on the card, followed by the texture units. Why because adding more shaders on the 4850/70 doesnt increase the relative performance.

We need more ROPs and more texture units in R800! Not more shaders! (Relatively speaking).

Last edited by lemonadesoda; Nov 14, 2008 at 11:14 PM.
lemonadesoda is offline  
Reply With Quote
The Following 8 Users Say Thank You to lemonadesoda For This Useful Post:
Old Nov 14, 2008, 11:07 PM   #2
[I.R.A]_FBi
Eligible for custom title
 
[I.R.A]_FBi's Avatar
 
Join Date: May 2007
Location: c:\programs\kitteh.exe
Posts: 6,152 (2.80/day)
Thanks: 724
Thanked 556 Times in 461 Posts

System Specs

good find
__________________
Rig 1+1
Athlon XP 2200+, MSI KM2M Combo, ATI 9200SE 128 MB DDR, 2 X 512 MB DDR333, 250GB + 80 HDD?

“try intel cpu, amd is only good for going to nude sites” -firehawkxd
“go for the 5850 now and play games while the nvidiots wait for the ceo to show an actual working product” -W1zzard
“An MSI logo? This offends my retina. I await your apology.” -MRCL

www.autolounge.com.jm
[I.R.A]_FBi is offline  
Reply With Quote
Old Nov 14, 2008, 11:08 PM   #3
phanbuey
Eligible for custom title
 
phanbuey's Avatar
 
Join Date: Nov 2007
Location: Miami
Posts: 5,010 (2.48/day)
Thanks: 1,484
Thanked 959 Times in 812 Posts

System Specs

very nice
phanbuey is offline  
Reply With Quote
Old Nov 14, 2008, 11:11 PM   #4
Duffman
500 Posts
 
Duffman's Avatar
 
Join Date: Dec 2007
Location: Mt. Pocono, PA
Posts: 812 (0.41/day)
Thanks: 63
Thanked 68 Times in 65 Posts
Send a message via AIM to Duffman Send a message via MSN to Duffman Send a message via Yahoo to Duffman

System Specs

there is a 4830x2? or is that two of them in Xfire?
__________________

My Rig,Black Knight.
Buy My Silverstone TJ09
Duffman is offline  
Reply With Quote
Old Nov 14, 2008, 11:12 PM   #5
spearman914
2000 Posts
 
spearman914's Avatar
 
Join Date: Apr 2008
Location: Brooklyn, New York 11223
Posts: 3,000 (1.61/day)
Thanks: 339
Thanked 510 Times in 482 Posts
Send a message via AIM to spearman914

System Specs

Quote:
Originally Posted by Duffman View Post
there is a 4830x2? or is that two of them in Xfire?
It isn't released so i think it's either future info or just 2 x xfire
__________________
.................................................. ..
“Originally Posted by mlee49 So pain level on a 1 to 10 scale, 10 being having a wart removed on your butthole with pliers.”
spearman914 is offline  
Reply With Quote
Old Nov 14, 2008, 11:15 PM   #6
Duffman
500 Posts
 
Duffman's Avatar
 
Join Date: Dec 2007
Location: Mt. Pocono, PA
Posts: 812 (0.41/day)
Thanks: 63
Thanked 68 Times in 65 Posts
Send a message via AIM to Duffman Send a message via MSN to Duffman Send a message via Yahoo to Duffman

System Specs

i think my eyes were playing tricks on me. I swore it read 4830x2 not x24830
__________________

My Rig,Black Knight.
Buy My Silverstone TJ09
Duffman is offline  
Reply With Quote
Old Nov 14, 2008, 11:22 PM   #7
wolf2009
Guest
 
Posts: n/a (0/day)

nice info !
 
Reply With Quote
Old Nov 14, 2008, 11:28 PM   #8
lemonadesoda
Eligible for custom title
 
lemonadesoda's Avatar
 
Join Date: Aug 2006
Posts: 5,337 (2.17/day)
Thanks: 749
Thanked 960 Times in 710 Posts

System Specs

I guess another comment is that crossfire really has come on a long way. Remember when it was launched? Somewhere between 0% and 40% performance improvement in the best cases. Now we are often seeing 80% FPS improvement on a crossfire setup. ATI have done a good job getting the hardware and drivers to work.
lemonadesoda is offline  
Reply With Quote
Old Nov 14, 2008, 11:46 PM   #9
swaaye
200 Posts
 
Join Date: May 2005
Posts: 210 (0.07/day)
Thanks: 4
Thanked 14 Times in 14 Posts

ATI has been running lopsided like this since R580/X1900 (well, X1600 actually). They always bank on software needing more shader ALU power than texturing/fill-rate. RV770 didn't change that even though it may look like it did on the surface.

Believe it or not but RV770 outperforms even GTX 280 by a long shot on most game shader code. Games just aren't that heavy with it for it to matter and other bottlenecks take over.

More info:
http://forum.beyond3d.com/showthread.php?t=49327

Last edited by swaaye; Nov 15, 2008 at 12:03 AM.
swaaye is offline  
Reply With Quote
Old Nov 14, 2008, 11:53 PM   #10
AddSub
500 Posts
 
AddSub's Avatar
 
Join Date: Aug 2006
Posts: 990 (0.40/day)
Thanks: 7
Thanked 144 Times in 109 Posts

System Specs

Excellent post lemonadesoda. I've been saying this for years (link below) but it is hard to be heard through all the fanboyism on a forum like this. Raw pixel fillrate and throughput is still very important and it will stay as such until all the functions of a GPU are truly streamlined into a some sort of a unified array architecture. Unified shader architecture was one of the steps in that direction, however, plenty of operative functions on-GPU are still done on the side. Many of them through ROP's, as well totally separate subsections dedicated to certain types of texturing, and similar. All outside of shader realm.

As for why ATI/AMD GPU's are ROP/pixled fillrate starved and have been stuck on 16 ROP's since 2004? I answered that question in the post/link below, months ago. (In short, cost.)

Heck, I answered it many times over, here and on other forums but fanboysim noise is hard to evade. As soon as you start putting out names of corporations (AMD/ATI/nVdia, etc) you attract a certain type of "enthusiast" into the conversation and all intelligent exchange is pretty much finished.

http://forums.techpowerup.com/showpo...&postcount=169

Last edited by AddSub; Nov 15, 2008 at 12:11 AM.
AddSub is offline  
Reply With Quote
Old Nov 14, 2008, 11:58 PM   #11
MKmods
Case Mod Guru
 
MKmods's Avatar
 
Join Date: Feb 2008
Location: Nevada
Posts: 4,711 (2.46/day)
Thanks: 2,879
Thanked 1,766 Times in 1,364 Posts

System Specs

we are gonna be hearing great things about the 4830..Nice post lemonadesoda
__________________
Buy tin snips FIRST!
MKmods is offline  
Reply With Quote
Old Nov 15, 2008, 12:26 AM   #12
LAN_deRf_HA
3500 Posts
 
LAN_deRf_HA's Avatar
 
Join Date: Apr 2008
Posts: 4,024 (2.14/day)
Thanks: 290
Thanked 831 Times in 588 Posts

System Specs

So what is the actual performance difference between a 4830 and a 4850 when you clock them both at 625/1986?

Edit*
Just checked the reviews, at the same speeds it doesn't really close the gap much, you need 700 MHz on the core for it to match the 4850, and a number of 4830s can't clock that high... so I'd rather pay $20 more for a 4850.

Last edited by LAN_deRf_HA; Nov 15, 2008 at 12:48 AM.
LAN_deRf_HA is offline  
Reply With Quote
Old Nov 15, 2008, 12:46 AM   #13
a111087
2000 Posts
 
a111087's Avatar
 
Join Date: Apr 2007
Location: US
Posts: 2,521 (1.12/day)
Thanks: 240
Thanked 190 Times in 157 Posts

System Specs

this was the same for HD2xxx series as well
__________________
a111087 is offline  
Reply With Quote
Old Nov 15, 2008, 10:08 PM   #14
lemonadesoda
Eligible for custom title
 
lemonadesoda's Avatar
 
Join Date: Aug 2006
Posts: 5,337 (2.17/day)
Thanks: 749
Thanked 960 Times in 710 Posts

System Specs

I wonder what the results would look like running the 560 shader 4830 (http://forums.techpowerup.com/showth...highlight=4830).

If we similar game performance as the 640 shader 4830 (not just the synthetic shader only benchmark that W1z did) then we know that the RV770 is very seriously ROP handicapped.

I think R8xx needs to be up there at 32 ROPs. Any less and there is going to be that nasty performance wall at 1920x1200 and especially at 2560x1600. (Oh, and that NEW samsung resolution, 2048x1156 http://forums.techpowerup.com/showth...t=wide+monitor)

I was always surprised at how hard that wall was. It didnt make sense. Now it does. It's not a shader limit, is the ROPs and texture units. They just can't cope with ultra-resolutions. (And why crossfire is so effective at these resolutions... it's NOT the extra shaders, it's the extra ROPs and texture units).
lemonadesoda is offline  
Reply With Quote
Old Nov 16, 2008, 10:07 AM   #15
christof139
75 Posts
 
Join Date: Nov 2008
Location: Detroit, Motown, Michigan.
Posts: 118 (0.07/day)
Thanks: 10
Thanked 20 Times in 19 Posts

System Specs

Great idea and charts showing the equalized ratio/percent!!!!

I hope some manufacturer decide to put 512MB GDDR4 on a 4830 (if they have any GDDR4 left in stock, and if so simply to use it up) to make a limited edition model. Even though the GDDR4 won't make a big difference it will give a small gain and it the limited editon model is slightly OC'd from the factory that would just add a bit more performance, maybe 5% overall performance gain for $5 -$15 additional cost.

Anyone seen ATI's roadmap for the next gen. cards?? 12 (model 5000), 16(5200), 20(5400), 24(5600), 30(5800) and 32(5850) ROP's would be the way to go as you all say for a low to high end line of models. 36 (5900) could be possible. That would just be great as long as the price stayed down, and with the world economy as it is the price may not stay down, although it just may not.

A Special Edition GDDR4 HD 4830 or 48xx would also be great now, maybe even with 4 more ROPS added, and/or a 4830x2. Heck, a 4670x2 would be great. A 4670 with 12 ROPs would also be great. There was a 1GB 2600XT x2 card made by Saphire and Visiontech and ATI.

Why doesn't/didn't ATI/AMD add even a mere 32 or 64 or or 96 or 128 bits to the memory interface?? With the present 4000 series cards that would make a decent difference I bet. I guess it is in the die size production problem area; how to size and cut up the wafers most efficiently.

Same for memory. 576MB or or 608MB or 640MB would be interesting. Just another cost-efficiency per wafer cutting deal I guess.

NVIDI and partner manufacturers did and does this so maybe ATI/AMD and/or one or more of their partners may try something like this.

More diversity might be OK. Limited Editions of several models in a number series might also be OK and economically feasible. Beats me.

Chris

Last edited by christof139; Nov 16, 2008 at 11:21 AM.
christof139 is offline  
Reply With Quote
Old Nov 16, 2008, 10:17 AM   #16
btarunr
Editor & Senior Moderator
 
btarunr's Avatar
 
Join Date: Oct 2007
Location: Hyderabad, India
Posts: 14,983 (7.29/day)
Thanks: 788
Thanked 12,911 Times in 5,655 Posts
Send a message via AIM to btarunr Send a message via MSN to btarunr

System Specs



$250 for two HD 4830 cards is better than $400 for one GTX 280. The argument is just like that of two GTX 260 being better than one HD 4870 X2.

http://www.hexus.net/content/item.php?item=16300&page=1

640 SPs is the right combination for the ROPs. 8 ALU clusters.
__________________

Gadgets, Phones, Tablets, Cameras, TVs, HiFi...NextPowerUp
btarunr is offline  
Reply With Quote
Old Nov 16, 2008, 10:26 AM   #17
ShogoXT
500 Posts
 
ShogoXT's Avatar
 
Join Date: Feb 2008
Location: Cincinnati, Ohio
Posts: 935 (0.49/day)
Thanks: 38
Thanked 92 Times in 72 Posts

System Specs

Im not so sure.
ShogoXT is offline  
Reply With Quote
Old Nov 16, 2008, 11:19 AM   #18
EastCoasthandle
Eligible for custom title
 
EastCoasthandle's Avatar
 
Join Date: Apr 2005
Posts: 6,244 (2.11/day)
Thanks: 382
Thanked 1,526 Times in 865 Posts

System Specs

^^Agreed, 2 4830 (with MIR) are $200. A GTX280 would cost $359.99 after MIR. Although some may prefer another brand at $419.99 after MIR. A difference of $160 or $220 if a person prefers another brand.

A 4870 X2 costs $479.99 (after MIR). 2 GTX 260 216s cost $229.99 each totaling $459.98 after MIR. Another brand can cost $10 more a piece after MIR. A difference of $20 or costing similar to the 4870 X2 depending on what people prefer.

So by using a popular online retailer in the states the difference is quite substantial. Therefore, IMO I don't think that buying 2 4830s are the same as buying 2 260 216s when compared to it's competitor high end part.

Keep in mind that prices may change.

Last edited by EastCoasthandle; Nov 16, 2008 at 12:10 PM.
EastCoasthandle is offline  
Reply With Quote
Old Nov 16, 2008, 06:17 PM   #19
ShogoXT
500 Posts
 
ShogoXT's Avatar
 
Join Date: Feb 2008
Location: Cincinnati, Ohio
Posts: 935 (0.49/day)
Thanks: 38
Thanked 92 Times in 72 Posts

System Specs

I mean I understand the logic in the arguement. The thing is though alot of tech website forums (not so much this one) were speculating on the specs of the 4800 series and saying why it would fail against nvidia. They said because of the ROPs it would suck and never stand a chance to the Nvidia 9k series. And would be stomped by the GTX 200s in value right when they came out....

Oh wait a minute.
ShogoXT is offline  
Reply With Quote
Old Nov 16, 2008, 06:22 PM   #20
EastCoasthandle
Eligible for custom title
 
EastCoasthandle's Avatar
 
Join Date: Apr 2005
Posts: 6,244 (2.11/day)
Thanks: 382
Thanked 1,526 Times in 865 Posts

System Specs

Quote:
Originally Posted by ShogoXT View Post
I mean I understand the logic in the arguement. The thing is though alot of tech website forums (not so much this one) were speculating on the specs of the 4800 series and saying why it would fail against nvidia. They said because of the ROPs it would suck and never stand a chance to the Nvidia 9k series. And would be stomped by the GTX 200s in value right when they came out....

Oh wait a minute.
Yup, this is what happened back then. Turned out to be a fabrication.
EastCoasthandle is offline  
Reply With Quote
Old Nov 20, 2008, 11:54 AM   #21
douglatins
2000 Posts
 
douglatins's Avatar
 
Join Date: May 2008
Location: Brasil
Posts: 2,608 (1.42/day)
Thanks: 295
Thanked 268 Times in 204 Posts
Send a message via MSN to douglatins Send a message via Skype™ to douglatins

System Specs

R700 is the HD4870X2 only!
douglatins is offline  
Reply With Quote
Old Nov 20, 2008, 12:56 PM   #22
SiliconSlick
25 Posts
 
Join Date: Nov 2008
Location: USA
Posts: 44 (0.03/day)
Thanks: 15
Thanked 2 Times in 2 Posts

System Specs

Thank you for the linked post of yours

Quote:
Originally Posted by AddSub View Post
Excellent post lemonadesoda. I've been saying this for years (link below) but it is hard to be heard through all the fanboyism on a forum like this. Raw pixel fillrate and throughput is still very important and it will stay as such until all the functions of a GPU are truly streamlined into a some sort of a unified array architecture. Unified shader architecture was one of the steps in that direction, however, plenty of operative functions on-GPU are still done on the side. Many of them through ROP's, as well totally separate subsections dedicated to certain types of texturing, and similar. All outside of shader realm.

As for why ATI/AMD GPU's are ROP/pixled fillrate starved and have been stuck on 16 ROP's since 2004? I answered that question in the post/link below, months ago. (In short, cost.)

Heck, I answered it many times over, here and on other forums but fanboysim noise is hard to evade. As soon as you start putting out names of corporations (AMD/ATI/nVdia, etc) you attract a certain type of "enthusiast" into the conversation and all intelligent exchange is pretty much finished.

http://forums.techpowerup.com/showpo...&postcount=169
Thanks for telling the truth - and good luck not getting your head chopped off.
(things seem rather nice here actually compared to most spots{all spots I've ever seen} I hope it stays that way)
SiliconSlick is offline  
Reply With Quote
Old Nov 20, 2008, 01:21 PM   #23
W1zzard
Benevolent Dictator
 
W1zzard's Avatar
 
Join Date: May 2004
Location: Stuttgart, Germany
Posts: 13,792 (4.18/day)
Thanks: 184
Thanked 10,289 Times in 3,175 Posts
Send a message via ICQ to W1zzard Send a message via AIM to W1zzard Send a message via MSN to W1zzard

System Specs

Quote:
Originally Posted by AddSub View Post
As for why ATI/AMD GPU's are ROP/pixled fillrate starved and have been stuck on 16 ROP's since 2004?
16 rops, 2 pixels per clock = 32 rops effective
W1zzard is offline  
Reply With Quote
Old Nov 20, 2008, 01:44 PM   #24
christof139
75 Posts
 
Join Date: Nov 2008
Location: Detroit, Motown, Michigan.
Posts: 118 (0.07/day)
Thanks: 10
Thanked 20 Times in 19 Posts

System Specs

A few days ago I read an article that stated ATI shaders are less complex than NVIDIA ones, therefore ATI simply adds more shaders to make up for this, and it worked fine in the 4000 series.

Chris
christof139 is offline  
Reply With Quote
Old Nov 20, 2008, 01:48 PM   #25
tigger
I'm the only one
 
tigger's Avatar
 
Join Date: Mar 2006
Location: HU5 1LL
Posts: 7,212 (2.75/day)
Thanks: 473
Thanked 951 Times in 780 Posts
Send a message via MSN to tigger

System Specs

If anyone has a 4830 we could do our own test with it against a 4850 downclocked to the same clocks as the 30.

Is that the same for the 4830 w1zzard,32 effective?
tigger is online now  
Reply With Quote
Reply


Currently Active Users Viewing This Thread: 1 (0 members and 1 guests)
 
Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off

Forum Jump


All times are GMT. The time now is 11:21 PM.


Powered by vBulletin® Version 3.8.6
Copyright ©2000 - 2013, Jelsoft Enterprises Ltd.
no new posts