• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

HD 5870 Discussion thread.

Status
Not open for further replies.

wolf

Performance Enthusiast
Joined
May 7, 2007
Messages
7,747 (1.25/day)
System Name MightyX
Processor Ryzen 5800X3D
Motherboard Gigabyte X570 I Aorus Pro WiFi
Cooling Scythe Fuma 2
Memory 32GB DDR4 3600 CL16
Video Card(s) Asus TUF RTX3080 Deshrouded
Storage WD Black SN850X 2TB
Display(s) LG 42C2 4K OLED
Case Coolermaster NR200P
Audio Device(s) LG SN5Y / Focal Clear
Power Supply Corsair SF750 Platinum
Mouse Corsair Dark Core RBG Pro SE
Keyboard Glorious GMMK Compact w/pudding
VR HMD Meta Quest 3
Software case populated with Artic P12's
Benchmark Scores 4k120 OLED Gsync bliss
True I had forgotten about that the GTX ever existed. My brother has a 9800GTX+ but we always call it just GTX, that's why.

It only peaked my interest as I actually owned a 512mb original run 9800GTX from 'back in the day', oh how I wished it was 1gb....

:toast:
 

Bo_Fox

New Member
Joined
May 29, 2009
Messages
480 (0.09/day)
Location
Barack Hussein Obama-Biden's Nation
System Name Flame Vortec Fatal1ty (rig1), UV Tourmaline Confexia (rig2)
Processor 2 x Core i7's 4+Gigahertzzies
Motherboard BL00DR4G3 and DFI UT-X58 T3eH8
Cooling Thermalright IFX-14 (better than TRUE) 2x push-push, Customized TT Big Typhoon
Memory 6GB OCZ DDR3-1600 CAS7-7-7-1T, 6GB for 2nd rig
Video Card(s) 8800GTX for "free" S3D (mtbs3d.com), 4870 1GB, HDTV Wonder (DRM-free)
Storage WD RE3 1TB, Caviar Black 1TB 7.2k, 500GB 7.2k, Raptor X 10k
Display(s) Sony GDM-FW900 24" CRT oc'ed to 2560x1600@68Hz, Dell 2405FPW 24" PVA (HDCP-free)
Case custom gutted-out painted black case, silver UV case, lots of aesthetics-souped stuff
Audio Device(s) Sonar X-Fi MB, Bernstein audio riser.. what??
Power Supply OCZ Fatal1ty 700W, Iceberg 680W, Fortron Booster X3 300W for GPU
Software 2 partitions WinXP-32 on 2 drives per rig, 2 of Vista64 on 2 drives per rig
Benchmark Scores 5.9 Vista Experience Index... yay!!! What??? :)
Not to sound like a broken record, but just like with a 5770, giving the 5870 more bandwidth will unleash a lot more out of the card. A 4890 is 20% faster than a 5770 despite a 5770 having slight architecture optimizations. Also, a 4890 has 62.5% greater memory bandwidth than a 5770. That's the only thing holding a 5770 back.

Let's give a 5870 100% greater bandwidth by doubling the bus to 512-bit. Perhaps ATI will actually release it in a few months. We'll be seeing 25-30 performance gains with a 100% greater memory bandwidth, if its "half-sized" derivative still could not reach within 20% of a 4890 that had 62.5% more bandwidth.

I think that it could actually be up to a 35% gain overall, as a bigger chip is in an even greater need of memory bandwidth in order to chug along without stuttering hiccups of any kind (in some cases, when the bottleneck is removed, there are much bigger jumps in performance). Nvidia fully realizes this and is probably secretly delaying with re-spins to ensure that it will still beat a beefied-up 512-bit 5870. Of course there's the 5970 (5870X2) to be released for sure, before we'd ever be seeing anything 512-bit again for a short while.

ATI does not need to do 512-bit right now because of the lack of competition. Rich people do not care too much because they can buy 3 or 4 of those 5870's and pray that whatever games they play still scale nicely with 3x or 4x Xfire. ATI does not want to do this until a 5870X2 is out first (which would be too complicated to do 512-bit anyways).

It's nice to try to solve the mystery of a 5870 performing below expectations from as many angles as possible, but there has been overwhelming evidence with a 5770 versus a 4890. Only logic can solve the mystery.


EDIT: It's only my opinion, really.
 
Last edited:
Joined
Apr 2, 2009
Messages
582 (0.11/day)
System Name Flow
Processor AMD Phenom II 955 BE
Motherboard MSI 790fx GD70
Cooling Water
Memory 8gb Crucial Ballistix Tracer Blue ddr3 1600 c8
Video Card(s) 2 x XFX 6850 - Yet to go under water.
Storage Corsair s128
Display(s) HP 24" 1920x1200
Case Custom Lian-Li V2110b
Audio Device(s) Auzentech X-Fi Forte 7.1
Power Supply Corsair 850HX
yeah, but the bandwidth is less important with the 5870 - the "lack" of performance increase matching that of dual gpus is more because of diminishing returns with the scaling of the architecture and drivers not addressing the resources fully than bandwidth.

I dont think the 5770/4890 is an applicable comparison to the 5870/4870x2
 

Bo_Fox

New Member
Joined
May 29, 2009
Messages
480 (0.09/day)
Location
Barack Hussein Obama-Biden's Nation
System Name Flame Vortec Fatal1ty (rig1), UV Tourmaline Confexia (rig2)
Processor 2 x Core i7's 4+Gigahertzzies
Motherboard BL00DR4G3 and DFI UT-X58 T3eH8
Cooling Thermalright IFX-14 (better than TRUE) 2x push-push, Customized TT Big Typhoon
Memory 6GB OCZ DDR3-1600 CAS7-7-7-1T, 6GB for 2nd rig
Video Card(s) 8800GTX for "free" S3D (mtbs3d.com), 4870 1GB, HDTV Wonder (DRM-free)
Storage WD RE3 1TB, Caviar Black 1TB 7.2k, 500GB 7.2k, Raptor X 10k
Display(s) Sony GDM-FW900 24" CRT oc'ed to 2560x1600@68Hz, Dell 2405FPW 24" PVA (HDCP-free)
Case custom gutted-out painted black case, silver UV case, lots of aesthetics-souped stuff
Audio Device(s) Sonar X-Fi MB, Bernstein audio riser.. what??
Power Supply OCZ Fatal1ty 700W, Iceberg 680W, Fortron Booster X3 300W for GPU
Software 2 partitions WinXP-32 on 2 drives per rig, 2 of Vista64 on 2 drives per rig
Benchmark Scores 5.9 Vista Experience Index... yay!!! What??? :)
It's not exactly about comparing a 5870 to a 4870X2. It's just that a half-sized derivative of a 5870 (5770) is obviously held back by half the amount of 5870's bandwidth.

A 5870 could very well be held back even more, as it would need even more to overcome diminishing returns as related to the bottleneck scenario. Just my 2 cents.
 
Joined
Oct 6, 2009
Messages
2,820 (0.53/day)
Location
Midwest USA
System Name My Gaming System
Processor Intel i7 4770k @ 4.4 Ghz
Motherboard Asus Maximus VI Impact (ITX)
Cooling Custom Full System Water cooling Loop
Memory G.Skill 1866 Mhz Sniper 8 Gb
Video Card(s) EVGA GTX 780 ti SC
Storage Samsung SSD EVO 120GB - Samsung SSD EVO 500GB
Display(s) ASUS W246H - 24" - widescreen TFT active matrix LCD
Case Bitfenix Prodigy
Power Supply Corsair AX760 Modular PSU
Software Windows 8.1 Home Primeum
I didn't get to benchmark my 5870 till recently! While My 5870 does play all the games at really nice FPS. In benchmarking that I just did my overall 3d Vantage score is 13880 at stock speeds with Q9550 @ 3.4. I know I will be able to get more out of the card with more over clocking of the card it's self. But I am also really starting to think that Something else on the outside is holding back the true power of this card! Weather it be Drivers or something else. I just wish if it is a ploy by ATI .... that they start implementing the improvements soon!

It's a conspiracy!
 

Bo_Fox

New Member
Joined
May 29, 2009
Messages
480 (0.09/day)
Location
Barack Hussein Obama-Biden's Nation
System Name Flame Vortec Fatal1ty (rig1), UV Tourmaline Confexia (rig2)
Processor 2 x Core i7's 4+Gigahertzzies
Motherboard BL00DR4G3 and DFI UT-X58 T3eH8
Cooling Thermalright IFX-14 (better than TRUE) 2x push-push, Customized TT Big Typhoon
Memory 6GB OCZ DDR3-1600 CAS7-7-7-1T, 6GB for 2nd rig
Video Card(s) 8800GTX for "free" S3D (mtbs3d.com), 4870 1GB, HDTV Wonder (DRM-free)
Storage WD RE3 1TB, Caviar Black 1TB 7.2k, 500GB 7.2k, Raptor X 10k
Display(s) Sony GDM-FW900 24" CRT oc'ed to 2560x1600@68Hz, Dell 2405FPW 24" PVA (HDCP-free)
Case custom gutted-out painted black case, silver UV case, lots of aesthetics-souped stuff
Audio Device(s) Sonar X-Fi MB, Bernstein audio riser.. what??
Power Supply OCZ Fatal1ty 700W, Iceberg 680W, Fortron Booster X3 300W for GPU
Software 2 partitions WinXP-32 on 2 drives per rig, 2 of Vista64 on 2 drives per rig
Benchmark Scores 5.9 Vista Experience Index... yay!!! What??? :)
I didn't get to benchmark my 5870 till recently! While My 5870 does play all the games at really nice FPS. In benchmarking that I just did my overall 3d Vantage score is 13880 at stock speeds with Q9550 @ 3.4. I know I will be able to get more out of the card with more over clocking of the card it's self. But I am also really starting to think that Something else on the outside is holding back the true power of this card! Weather it be Drivers or something else. I just wish if it is a ploy by ATI .... that they start implementing the improvements soon!

It's a conspiracy!

Yeah, the grand conspiracy of being unable to do 512-bit with GDDR5 memory!!! :laugh::toast:
 
Joined
Oct 6, 2009
Messages
2,820 (0.53/day)
Location
Midwest USA
System Name My Gaming System
Processor Intel i7 4770k @ 4.4 Ghz
Motherboard Asus Maximus VI Impact (ITX)
Cooling Custom Full System Water cooling Loop
Memory G.Skill 1866 Mhz Sniper 8 Gb
Video Card(s) EVGA GTX 780 ti SC
Storage Samsung SSD EVO 120GB - Samsung SSD EVO 500GB
Display(s) ASUS W246H - 24" - widescreen TFT active matrix LCD
Case Bitfenix Prodigy
Power Supply Corsair AX760 Modular PSU
Software Windows 8.1 Home Primeum
Yeah, the grand conspiracy of being unable to do 512-bit with GDDR5 memory!!!

See you heard about it too. I knew it was true LOL :)
 
D

Deleted member 67555

Guest
Can Someone Post A Conclusion

I don't feel like reading this whole POST...
Can someone do a quick conclusion please?;)
 

Mussels

Freshwater Moderator
Staff member
Joined
Oct 6, 2004
Messages
58,413 (8.19/day)
Location
Oystralia
System Name Rainbow Sparkles (Power efficient, <350W gaming load)
Processor Ryzen R7 5800x3D (Undervolted, 4.45GHz all core)
Motherboard Asus x570-F (BIOS Modded)
Cooling Alphacool Apex UV - Alphacool Eisblock XPX Aurora + EK Quantum ARGB 3090 w/ active backplate
Memory 2x32GB DDR4 3600 Corsair Vengeance RGB @3866 C18-22-22-22-42 TRFC704 (1.4V Hynix MJR - SoC 1.15V)
Video Card(s) Galax RTX 3090 SG 24GB: Underclocked to 1700Mhz 0.750v (375W down to 250W))
Storage 2TB WD SN850 NVME + 1TB Sasmsung 970 Pro NVME + 1TB Intel 6000P NVME USB 3.2
Display(s) Phillips 32 32M1N5800A (4k144), LG 32" (4K60) | Gigabyte G32QC (2k165) | Phillips 328m6fjrmb (2K144)
Case Fractal Design R6
Audio Device(s) Logitech G560 | Corsair Void pro RGB |Blue Yeti mic
Power Supply Fractal Ion+ 2 860W (Platinum) (This thing is God-tier. Silent and TINY)
Mouse Logitech G Pro wireless + Steelseries Prisma XL
Keyboard Razer Huntsman TE ( Sexy white keycaps)
VR HMD Oculus Rift S + Quest 2
Software Windows 11 pro x64 (Yes, it's genuinely a good OS) OpenRGB - ditch the branded bloatware!
Benchmark Scores Nyooom.
I don't feel like reading this whole POST...
Can someone do a quick conclusion please?;)

people think the 5870 aint as fast as it should be, and are speculating wildly as to why, and if it will get better via drivers.
 
D

Deleted member 67555

Guest
people think the 5870 aint as fast as it should be, and are speculating wildly as to why, and if it will get better via drivers.
Ok, I'm sure it's a driver thing as there is no real Hurry to make these Cards as spectacular as possible for Now:D as it stands i don't think any games will really Tax these things + the competition has release coming soon Right?
 

wolf

Performance Enthusiast
Joined
May 7, 2007
Messages
7,747 (1.25/day)
System Name MightyX
Processor Ryzen 5800X3D
Motherboard Gigabyte X570 I Aorus Pro WiFi
Cooling Scythe Fuma 2
Memory 32GB DDR4 3600 CL16
Video Card(s) Asus TUF RTX3080 Deshrouded
Storage WD Black SN850X 2TB
Display(s) LG 42C2 4K OLED
Case Coolermaster NR200P
Audio Device(s) LG SN5Y / Focal Clear
Power Supply Corsair SF750 Platinum
Mouse Corsair Dark Core RBG Pro SE
Keyboard Glorious GMMK Compact w/pudding
VR HMD Meta Quest 3
Software case populated with Artic P12's
Benchmark Scores 4k120 OLED Gsync bliss
people think the 5870 aint as fast as it should be, and are speculating wildly as to why, and if it will get better via drivers.

:respect: way to sum up 9 pages in 25 words and a number.
 

grimeleven

New Member
Joined
Oct 10, 2009
Messages
19 (0.00/day)
Processor Intel Core i7@3.5Ghz
Motherboard eVGA X58SLI
Cooling TRUE 120 Xtreme
Memory 6GB Aeneon 1866Mhz
Video Card(s) 4870X2 2GB /w AC Xtreme cooler
Storage Vertex 120g
Display(s) Samsung 32 inch LCD 1080p
Case HAF932
Audio Device(s) SB X-Fi
Power Supply Antec TP3 650W
512bit memory bus "upgrade" on 5870's ain't possible, they would need to re-do the architecture hence future products.

Tho in this article they talk CPU's http://www.physorg.com/news151158992.html (it is still similar explanation towards ATI SPU's)

"But the lack of immediate access to individualized memory caches — the "food" of each processor — slows the process down instead of speeding it up once the number of cores exceeds eight, according to a simulation of high-performance computers by Sandia's Richard Murphy, Arun Rodrigues and former student Megan Vance.

"To some extent, it is pointing out the obvious — many of our applications have been memory-bandwidth-limited even on a single core," says Rodrigues. "However, it is not an issue to which industry has a known solution, and the problem is often ignored."

QFT

*Edit* hehe bring back HD2900 pro/XT 1024-bit bi-directional ring bus (512-bit read and 512-bit write,8 64-bit memory channels for a total bus width of 512-bits)! pricey i know.. :D
http://www.anandtech.com/showdoc.aspx?i=3367&p=5
 
Last edited:

Bo_Fox

New Member
Joined
May 29, 2009
Messages
480 (0.09/day)
Location
Barack Hussein Obama-Biden's Nation
System Name Flame Vortec Fatal1ty (rig1), UV Tourmaline Confexia (rig2)
Processor 2 x Core i7's 4+Gigahertzzies
Motherboard BL00DR4G3 and DFI UT-X58 T3eH8
Cooling Thermalright IFX-14 (better than TRUE) 2x push-push, Customized TT Big Typhoon
Memory 6GB OCZ DDR3-1600 CAS7-7-7-1T, 6GB for 2nd rig
Video Card(s) 8800GTX for "free" S3D (mtbs3d.com), 4870 1GB, HDTV Wonder (DRM-free)
Storage WD RE3 1TB, Caviar Black 1TB 7.2k, 500GB 7.2k, Raptor X 10k
Display(s) Sony GDM-FW900 24" CRT oc'ed to 2560x1600@68Hz, Dell 2405FPW 24" PVA (HDCP-free)
Case custom gutted-out painted black case, silver UV case, lots of aesthetics-souped stuff
Audio Device(s) Sonar X-Fi MB, Bernstein audio riser.. what??
Power Supply OCZ Fatal1ty 700W, Iceberg 680W, Fortron Booster X3 300W for GPU
Software 2 partitions WinXP-32 on 2 drives per rig, 2 of Vista64 on 2 drives per rig
Benchmark Scores 5.9 Vista Experience Index... yay!!! What??? :)
512bit memory bus "upgrade" on 5870's ain't possible, they would need to re-do the architecture hence future products.

Tho in this article they talk CPU's http://www.physorg.com/news151158992.html (it is still similar explanation towards ATI SPU's)



QFT

*Edit* hehe bring back HD2900 pro/XT 1024-bit bi-directional ring bus (512-bit read and 512-bit write,8 64-bit memory channels for a total bus width of 512-bits)! pricey i know.. :D
http://www.anandtech.com/showdoc.aspx?i=3367&p=5

Yeah, pricey, but a HD2900XT was not any more expensive than a 5870 really. Well, maybe just a tiny bit more expensive, but a 5870 is being jacked up in prices due to demand/supply.

Here's a bit from a guy who wrote well concerning this subject, but he had a few errors (1024-bit ringbus, not 512-bit, etc..) and a wild assumption that using 512-bit would take up a quarter of the silicon real estate.

For all those people clamoring on about why ATI didn't go with a 512-bit memory controller I'm going to chime in here with some ATI 512-bit experience. If you're a sharp one reading this, you have already guessed that means I'm going to talk about the R600. Now you can hate the card all you want, but I quite enjoyed this card. First of all, in the R600 it was the last ring-bus. It was a true 512-bit, and large memory controller. I'm not certain on the amount, but I believe it owned about a quarter of the realstate on the die. That's a lot. It also was some of the cause for the chip running hot and why UVD was scrapped from that chip to save room.

Now, to keep that 512-bit ring-bus fed, you needed to push large amounts of data to it. The more you increased system bandwidth, the faster the card would be in any task. I've run hundreds of benchmarks over the years and I'm pretty sure Jared and Anand can attest to this. Same goes for overclocking the card. Raising the core didn't do much, but cranking up the VRAM to feed that hungry ring-bus sure did. Prices anyone? I believe $450 and up depending on where you were located. It was on heck of a pricey chip for ATI to make. Enter the die shrunk 3000 series with the 256-bit memory controller and voila. A cheaper chip to make. It never came close to the theoretical performance of the 2900XT, but the 3870 was about 90% of the performance for a lot cheaper. Yeas I know the cores were tweaked and so on in the 3000 series, but they are very similar.

If ATI ever went to a 512-bit bus, which means more PCB layers, higher cost in manufacturing and a larger die, I'd think they'd do it on something like Juniper or wait till 32nm. It's not feasible right now. They technically could go the MCM route with Juniper and get a mashed up version of a 512-bit bus, but I don't think the chips have been designed with that in mind.

Anyways, most computers out there are starved to feed something like the 5870 and higher cards with a 512-bit bus. I just replaced my R600 with an RV740 (hah, went from 80nm to 40nm) and now I don't need to OC the heck out of my bus to keep the card fed. I'm running an old FX-60 setup due to a glowing review on here back in early 2006. Am I the norm? NO, I'm waiting to upgrade. Is the Core i7 9xx the norm? No. You have to build a card to a certain set of people. I'm building my pal a new computer and he's happy with the 5850. The 5870 is overkill for him. It's 80% of the 5870 but a hundred bucks cheaper. Now, I'm sure ATI looked at the 512-bit bus in much the same way. "Wow, that 512-bit bus sure flies, look at those numbers! Oh, it's going to cost us this much die space and more manufacturing costs.... Well, those 256-bit bus numbers are still pretty imperssive and within 80% of the gaming benchmark scores, so we'll go that way"

Or something along those lines....I'm sure that's why nVIDIA's GTX300 is delayed. It's a massive chip, 512-bit bus and so on. Great, they'll take the performance crown again. Will they take my money? If they have something in the $200-$300 range, they have a fighting chance, just like ATI does, or soon to be Intel. Best price for performance will win me over there. I don't care what the bus size is, or how the card could have been better, just as long as I'm happy with the performance for my money. In which case, I'll be here looking forward to a GPU roundoup in the best bang for buck in that price range. Of course it will have DX11, or else there's no point in me upgrading again.
-by PorscheRacer (http://www.anandtech.com/video/showdoc.aspx?i=3643&cp=30#comments)
 
Joined
Oct 1, 2006
Messages
4,884 (0.76/day)
Location
Hong Kong
Processor Core i7-12700k
Motherboard Z690 Aero G D4
Cooling Custom loop water, 3x 420 Rad
Video Card(s) RX 7900 XTX Phantom Gaming
Storage Plextor M10P 2TB
Display(s) InnoCN 27M2V
Case Thermaltake Level 20 XT
Audio Device(s) Soundblaster AE-5 Plus
Power Supply FSP Aurum PT 1200W
Software Windows 11 Pro 64-bit
Yeah, pricey, but a HD2900XT was not any more expensive than a 5870 really. Well, maybe just a tiny bit more expensive, but a 5870 is being jacked up in prices due to demand/supply.

Here's a bit from a guy who wrote well concerning this subject, but he had a few errors (1024-bit ringbus, not 512-bit, etc..) and a wild assumption that using 512-bit would take up a quarter of the silicon real estate.

-by PorscheRacer (http://www.anandtech.com/video/showdoc.aspx?i=3643&cp=30#comments)
On the other hand, the 512-bit bus did not prevent the 2900XT from getting destroyed by the G80.
That leads to AMD redesigning the R600 to RV670 or the 3870 which in many ways surpasses the R600, and the R770 further opitmized the arch.
The Cypress is simply building ontop of the R770's success.

Isn't it intresting that the HD 4670 with 128-bit GDDR3 performs close to the HD 2900 Pro?
In fact the HD 4670 surpasses the 2900XT when AA is used, the R600/RV670 simply gets destroyed in AA.
 
Last edited:

grimeleven

New Member
Joined
Oct 10, 2009
Messages
19 (0.00/day)
Processor Intel Core i7@3.5Ghz
Motherboard eVGA X58SLI
Cooling TRUE 120 Xtreme
Memory 6GB Aeneon 1866Mhz
Video Card(s) 4870X2 2GB /w AC Xtreme cooler
Storage Vertex 120g
Display(s) Samsung 32 inch LCD 1080p
Case HAF932
Audio Device(s) SB X-Fi
Power Supply Antec TP3 650W
Yeah i've found that post earlier too and check this out, (from Eric Demers, architecture lead on R600)
Does a 512-bit bus require a die size that's going to be in the neighbourhood (or bigger) of R600 going forward?
No, through multiple layers of pads, or through distributed pads or even through stacked dies, large memory bit widths are certainly possible. Certainly a certain size and a minimum number of consumers is required to enable this technology, but it's not required to have a large die
... was he fired or something? i guess they went the "cheap" route.
http://www.beyond3d.com/content/interviews/39/5
 
Joined
Nov 4, 2005
Messages
11,676 (1.73/day)
System Name Compy 386
Processor 7800X3D
Motherboard Asus
Cooling Air for now.....
Memory 64 GB DDR5 6400Mhz
Video Card(s) 7900XTX 310 Merc
Storage Samsung 990 2TB, 2 SP 2TB SSDs and over 10TB spinning
Display(s) 56" Samsung 4K HDR
Audio Device(s) ATI HDMI
Mouse Logitech MX518
Keyboard Razer
Software A lot.
Benchmark Scores Its fast. Enough.
This is for eveyone theorizing about a fact that has been proven again and again in this thread.

It doesn't need more bandwidth. There is enough. We need to move forward with the idea that the setup engine is not providing double the amount of information, but throught the obvious use of the tessellation hardware this card & ATI is intending on using acceptable framerates with uber high end graphics. Did they try and steal the performance crown from Nvidia with the 4XXX series? No they openly said so in interviews. They went for the mainstream market, once they had that they threw two chips on a board and went for the top.


Everyone is forgetting this is a new spin on a old design that worked well. The old chips were not bandwidth limited, but the tessellation needs more surface work done, and that is not a primary storage cache bandwidth issue, it is a internal cache issue, which is where they came up with the insane bandwidth the internal cache has. The raw amount of data needed for good resolution and great looking visuals is no longer going to need to be increase by a insane amout, instead the hardware takes care of generating thousands of extra triangles and surface rendering, leaving the models smaller and more compact, thus using .........less bandwidth.


So, we have a much better looking surface, with few original traingles and fewer orginal vertex points, the tessellator being used to generate thousands if not millions of extra points, and lines.
 

Attachments

  • verysmartverystupid.jpg
    verysmartverystupid.jpg
    24.4 KB · Views: 361
Joined
Oct 1, 2006
Messages
4,884 (0.76/day)
Location
Hong Kong
Processor Core i7-12700k
Motherboard Z690 Aero G D4
Cooling Custom loop water, 3x 420 Rad
Video Card(s) RX 7900 XTX Phantom Gaming
Storage Plextor M10P 2TB
Display(s) InnoCN 27M2V
Case Thermaltake Level 20 XT
Audio Device(s) Soundblaster AE-5 Plus
Power Supply FSP Aurum PT 1200W
Software Windows 11 Pro 64-bit
Yeah i've found that post earlier too and check this out, (from Eric Demers, architecture lead on R600)
... was he fired or something? i guess they went the "cheap" route.
http://www.beyond3d.com/content/interviews/39/5
I would have certainly fire the guy simply the R600 got its ass handed to it by the G80. :nutkick:
The engineers simply don't know what they need to make the GPU perform. :shadedshu
 

Bo_Fox

New Member
Joined
May 29, 2009
Messages
480 (0.09/day)
Location
Barack Hussein Obama-Biden's Nation
System Name Flame Vortec Fatal1ty (rig1), UV Tourmaline Confexia (rig2)
Processor 2 x Core i7's 4+Gigahertzzies
Motherboard BL00DR4G3 and DFI UT-X58 T3eH8
Cooling Thermalright IFX-14 (better than TRUE) 2x push-push, Customized TT Big Typhoon
Memory 6GB OCZ DDR3-1600 CAS7-7-7-1T, 6GB for 2nd rig
Video Card(s) 8800GTX for "free" S3D (mtbs3d.com), 4870 1GB, HDTV Wonder (DRM-free)
Storage WD RE3 1TB, Caviar Black 1TB 7.2k, 500GB 7.2k, Raptor X 10k
Display(s) Sony GDM-FW900 24" CRT oc'ed to 2560x1600@68Hz, Dell 2405FPW 24" PVA (HDCP-free)
Case custom gutted-out painted black case, silver UV case, lots of aesthetics-souped stuff
Audio Device(s) Sonar X-Fi MB, Bernstein audio riser.. what??
Power Supply OCZ Fatal1ty 700W, Iceberg 680W, Fortron Booster X3 300W for GPU
Software 2 partitions WinXP-32 on 2 drives per rig, 2 of Vista64 on 2 drives per rig
Benchmark Scores 5.9 Vista Experience Index... yay!!! What??? :)
On the other hand, the 512-bit bus did not prevent the 2900XT from getting destroyed by the G80.
That leads to AMD redesigning the R600 to RV670 or the 3870 which in many ways surpasses the R600, and the R770 further opitmized the arch.
The Cypress is simply building ontop of the R770's success.

Isn't it intresting that the HD 4670 with 128-bit GDDR3 performs close to the HD 2900 Pro?
In fact the HD 4670 surpasses the 2900XT when AA is used, the R600/RV670 simply gets destroyed in AA.

Already stated a few posts ago that R600 chip had "flawed" architecture in which the antialiasing work was offloaded onto the shader units (and there were only 320 shaders). The performance was bottlenecked by the TMU's and shaders, not the memory. Once again, to quote my words from a few posts ago, from memory, a HD2900XT was probably the only card that was not bandwidth-limited. I wish to say it with a bit more clarity.. it was the chip that was the least likely to be bandwidth-limited in any games/resolutions/settings.

ATI was just doing whatever they could with that poorly-designed R600 chip that couldn't hold a candle to an 8800GTX that was several months older. But ATI does not need to do that with a 5870, due to the lack of competition. The R600 architecture was designed with the future in mind, when the shader units would then be doubled or quadrupled in order for the architecture to make sense. It has probably damaged the reputation of 512-bit memory for a few of the enthusiasts.

Yeah i've found that post earlier too and check this out, (from Eric Demers, architecture lead on R600)
... was he fired or something? i guess they went the "cheap" route.
http://www.beyond3d.com/content/interviews/39/5

I wish I could thank you twice for this quote that proves that using 512-bit does not require much more silicon die size area. :toast:



Just as a 5770 would have done at least 20% better with 4890's bandwidth (which is 60% greater), a 5870 would do ~30% better with 512-bit bandwidth (which is 100% greater than the 256-bit one on a 5870 right now).

Actually, in the chart above, the 4890 is only 19% faster than a 5770 with identical specs (except for lower memory bandwidth), but that is also because of slight architecture optimizations with the 58xx series. So I'd add in about 3% for the optimizations and say that a 5770 would be 22% faster if it had the same bandwidth as a 4890.

Now, look at the chart below, at how increasing the memory bandwidth for a 4890 still gave consistent increases (except for Fallout3 which was probably already core-limited by either TMU's or shaders or possibly even ROP's):


source: Anandtech

That's a 4890 with the bandwith of a 5870. A chip with half the "theoretical fillrate", half the "theoretical TFLOPs", half the TMU's, half the ROP's, half the shaders, and without slight architecture optimizations, still benefited from greater bandwidth in every single game.

Imagine a 5770 with 5870's bandwidth (256-bit memory, 4.8GHz). It'd be showing more than a 22% increase for sure (plus additional 3% due to the difference between a 4890 and 5870's bandwidth as shown in the chart above).

A 5770 is exactly half of a 5870, and it still benefits from having 5870's bandwidth. Imagine how starved a 5870 chip (2x of a 5770) is for more bandwidth!



A sales pitch: Memory does matter!

By the way, overclocking the core of a 4890 up to 1GHz shows how it's even more starved for memory bandwidth, with much bigger gains across the chart when increasing the memory from 3.9Gbps to 4.8Gbps:


source: Anandtech
 
Last edited:
Joined
Nov 4, 2005
Messages
11,676 (1.73/day)
System Name Compy 386
Processor 7800X3D
Motherboard Asus
Cooling Air for now.....
Memory 64 GB DDR5 6400Mhz
Video Card(s) 7900XTX 310 Merc
Storage Samsung 990 2TB, 2 SP 2TB SSDs and over 10TB spinning
Display(s) 56" Samsung 4K HDR
Audio Device(s) ATI HDMI
Mouse Logitech MX518
Keyboard Razer
Software A lot.
Benchmark Scores Its fast. Enough.
yes, the 18 and three quarter percent increas in memory overclocking had the huge 3% maximum effect.........or mebey just the increase at timings gives the benefit.....same as DDR2 did to DDR, same as DDR3 is doing to DDR2 and is just the evolution of the beast. the "theoretical" has no place when you have hard facts to work with.


For example lets consider DDR at 400Mhz with 5.5.5.15 timings. I had a set when that was cream. It lost to DDR2 with 10.12.10.32 timgs in bandwidth just do to the speed, unless you filled it to the last row, column, and chip on the furthest stick, then it might have come close.

You must realise unless you are dealing with the last row, last column, last chip the speed increase in this case in always going to benefit latentcy/cache misses and thus the wait time for the core will be less, and has absolute shit to do with the bandwidth.

So when we divide out whatever the timings are the number of ns it takes to get data, by the increase in speed and the shorter delay we will find that the resuolt is close to the "theoretical" increase shown in the chart. And nothign to do with being able to deliver 2GB of data in one second less.
 
Last edited by a moderator:

Mussels

Freshwater Moderator
Staff member
Joined
Oct 6, 2004
Messages
58,413 (8.19/day)
Location
Oystralia
System Name Rainbow Sparkles (Power efficient, <350W gaming load)
Processor Ryzen R7 5800x3D (Undervolted, 4.45GHz all core)
Motherboard Asus x570-F (BIOS Modded)
Cooling Alphacool Apex UV - Alphacool Eisblock XPX Aurora + EK Quantum ARGB 3090 w/ active backplate
Memory 2x32GB DDR4 3600 Corsair Vengeance RGB @3866 C18-22-22-22-42 TRFC704 (1.4V Hynix MJR - SoC 1.15V)
Video Card(s) Galax RTX 3090 SG 24GB: Underclocked to 1700Mhz 0.750v (375W down to 250W))
Storage 2TB WD SN850 NVME + 1TB Sasmsung 970 Pro NVME + 1TB Intel 6000P NVME USB 3.2
Display(s) Phillips 32 32M1N5800A (4k144), LG 32" (4K60) | Gigabyte G32QC (2k165) | Phillips 328m6fjrmb (2K144)
Case Fractal Design R6
Audio Device(s) Logitech G560 | Corsair Void pro RGB |Blue Yeti mic
Power Supply Fractal Ion+ 2 860W (Platinum) (This thing is God-tier. Silent and TINY)
Mouse Logitech G Pro wireless + Steelseries Prisma XL
Keyboard Razer Huntsman TE ( Sexy white keycaps)
VR HMD Oculus Rift S + Quest 2
Software Windows 11 pro x64 (Yes, it's genuinely a good OS) OpenRGB - ditch the branded bloatware!
Benchmark Scores Nyooom.
i think your examples got messed. DDR2 was where CL5 came in, and CL10 for DDR3
 
Joined
Nov 4, 2005
Messages
11,676 (1.73/day)
System Name Compy 386
Processor 7800X3D
Motherboard Asus
Cooling Air for now.....
Memory 64 GB DDR5 6400Mhz
Video Card(s) 7900XTX 310 Merc
Storage Samsung 990 2TB, 2 SP 2TB SSDs and over 10TB spinning
Display(s) 56" Samsung 4K HDR
Audio Device(s) ATI HDMI
Mouse Logitech MX518
Keyboard Razer
Software A lot.
Benchmark Scores Its fast. Enough.
i think your examples got messed. DDR2 was where CL5 came in, and CL10 for DDR3

:toast: This is me with a few shots, just ogt back from ......uhh.......yeah.
 

Bo_Fox

New Member
Joined
May 29, 2009
Messages
480 (0.09/day)
Location
Barack Hussein Obama-Biden's Nation
System Name Flame Vortec Fatal1ty (rig1), UV Tourmaline Confexia (rig2)
Processor 2 x Core i7's 4+Gigahertzzies
Motherboard BL00DR4G3 and DFI UT-X58 T3eH8
Cooling Thermalright IFX-14 (better than TRUE) 2x push-push, Customized TT Big Typhoon
Memory 6GB OCZ DDR3-1600 CAS7-7-7-1T, 6GB for 2nd rig
Video Card(s) 8800GTX for "free" S3D (mtbs3d.com), 4870 1GB, HDTV Wonder (DRM-free)
Storage WD RE3 1TB, Caviar Black 1TB 7.2k, 500GB 7.2k, Raptor X 10k
Display(s) Sony GDM-FW900 24" CRT oc'ed to 2560x1600@68Hz, Dell 2405FPW 24" PVA (HDCP-free)
Case custom gutted-out painted black case, silver UV case, lots of aesthetics-souped stuff
Audio Device(s) Sonar X-Fi MB, Bernstein audio riser.. what??
Power Supply OCZ Fatal1ty 700W, Iceberg 680W, Fortron Booster X3 300W for GPU
Software 2 partitions WinXP-32 on 2 drives per rig, 2 of Vista64 on 2 drives per rig
Benchmark Scores 5.9 Vista Experience Index... yay!!! What??? :)
Hey Steevo, it's not about who's getting bent. You have no way of knowing what the latencies are.

Anyways, I already stated several posts ago that the latencies on an X1950XTX's GDDR4 memory was proven to be quite similar to that of an X1900XTX's GDDR3 memory, according to identical performance when the GDDR4 was downclocked to the same speed.

That's pretty much all I can say. It's not a matter of being right or wrong. It's not a matter of winning over others. It's only a matter of knowledge and understanding.. perhaps you really think that you understand the insignificance of memory bandwidth, but this is not where the truth lies. To put money where the mouth is, look at how much more an X1950XTX sold for than an X1900XTX. Would you trade your 4890 for my 5770 (if we had those cards in our possession)? No, I do not think so.

I'm just trying to pacify something that's going on here. That's all there is to it.

I mean, why are we even discussing latencies here in the first place? Why is there such a need to try as hard as possible to blame everything else but the memory bandwidth?
 

bobzilla2009

New Member
Joined
Oct 7, 2009
Messages
455 (0.09/day)
System Name Bobzilla the second
Processor AMD Phenom II 940
Motherboard Asus M3A76-CM
Cooling 3*120mm case fans
Memory 4GB 1066GHz DDR2 Kingston HyperX
Video Card(s) Sapphire Radeon HD5870 1GB
Storage Seagate 7200RPM 500GB
Display(s) samsung T220HD
Case Guardian 921
Power Supply OCZ MODXSTREAM Pro 700w (2*25A 12v rail)
Software Windows 7 Beta
Benchmark Scores 19753 3dmark06 15826 3dmark vantage 38.4Fps crysis benchmarking tool (1680x1050, 4xAA)
Right, I'm just going to do some investigations into the effect of memory overclocking on my hd5870, the core clock on the card will be set to 900MHz for all the following results, i'll update this post as i do more benchmarks.

First up is left 4 dead.
Time demo was recorded for the first level of no mercy, from the rooftop start point to entering the safe room door, framerates recorded with fraps. Any aspect of human error has been included with results. settings: 1680x1050, maximum, 8xmsaa and no vsync.

900/1200: 118fps average +/- 2
900/1250: 117fps average +/- 2
900/1300: 118fps average +/- 2

So for the first test we see no gain from memory overclocking outside the bounds of any human error that could occur. From an 8% increase, we would at least expect to see a small increase in the region of 3-4 fps in a memory bottlenecked system. It is worthwhile to note that gpu usage was around 60% at most, while cpu usage was 50% at most, so neither were pushed to the limit, therefore the l4d test should be taken with a pinch of salt.

UNIGINE DX11 BENCHMARK

moving on to one of the more extreme gpu tests for the hd5870, this benchmark pushed the gpu usage up to 100% as expected, so the memory bandwidth should become more of a factor than in the l4d test. Settings used were 1680x1050, 4xMSAA, all settings highest and tessellation on. (Vsync turned off, ofc).

900/1200: FPS 31.9 [score 803]
900/1250: FPS 32.3 [score 814]
900/1300: FPS 32.8 [score 825]

this time around we see a linear increase when increasing the memory speed. However, the increase is only in the region of around 1% in each case. If the card were truly bottlenecked, we would expect to see a more drastic increase, but as it stands it does appear that the speed of the card seems to be pretty well matched against the memory speed. Of course, that may all change as ati improve the drivers to unleash more power from the card. Comparing the card to other generations is not feasible either, if the hd4890 was bottlenecked by it's memory bus, how can the hd5870 be so much faster if it is using the same memory? the increase in bandwidth is only around 23% but the increase in overall speed is around 60-80%. The numbers just don't add up, also remember that ati aren't idiots, if they had an epic card that would be severely bottlenecked by a 256 bit bus, they wouldn't used one. The card would cost more, that's for sure, but it would also be a helluva lot faster if it was severely bottlenecked. My experimenting with memory clocks has not yielded anything to indicated such a bottleneck thus far (and i have done others as well as the ones shown here), but as stated earlier, driver improvements in the future will likely prove whether or not the bottleneck exists.

However it could be entirely likely that the memory speed has a far lesser effect at the resolutions i am testing at, especially in l4d. If that is the case, i'm not too worried about the whole thing :)
 
Last edited:
Joined
Nov 4, 2005
Messages
11,676 (1.73/day)
System Name Compy 386
Processor 7800X3D
Motherboard Asus
Cooling Air for now.....
Memory 64 GB DDR5 6400Mhz
Video Card(s) 7900XTX 310 Merc
Storage Samsung 990 2TB, 2 SP 2TB SSDs and over 10TB spinning
Display(s) 56" Samsung 4K HDR
Audio Device(s) ATI HDMI
Mouse Logitech MX518
Keyboard Razer
Software A lot.
Benchmark Scores Its fast. Enough.
WTFBBQ!!!!!!


The increase was from the marginaly faster cache miss data delivery than from the extra bandwidth. XYZOMG!!!!


No can we please discuss the new hardware the card has?


can you run s set with tessellation on and off Bob?
 
Joined
Oct 1, 2006
Messages
4,884 (0.76/day)
Location
Hong Kong
Processor Core i7-12700k
Motherboard Z690 Aero G D4
Cooling Custom loop water, 3x 420 Rad
Video Card(s) RX 7900 XTX Phantom Gaming
Storage Plextor M10P 2TB
Display(s) InnoCN 27M2V
Case Thermaltake Level 20 XT
Audio Device(s) Soundblaster AE-5 Plus
Power Supply FSP Aurum PT 1200W
Software Windows 11 Pro 64-bit
WTFBBQ!!!!!!


The increase was from the marginaly faster cache miss data delivery than from the extra bandwidth. XYZOMG!!!!


No can we please discuss the new hardware the card has?


can you run s set with tessellation on and off Bob?
You know, some people just can't get over themselves you know :p
They always think the 5870 needs more bandwidth because the nVidia cards are supposed to have more.
 
Status
Not open for further replies.
Top