• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Orochi ''Bulldozer'' Die Holds 16 MB Cache

JF-AMD

AMD Rep (Server)
Joined
Dec 18, 2009
Messages
163 (0.03/day)
To be honest i kinda of wish the bulldozer would come in a g34 socket (1974 pin) so that it came with quad channel ram... although i doubt it would make much different apart from benchmarks and maybe virtual machines.

What if you got much greater throughput without having to increase memory channels?
 

bear jesus

New Member
Joined
Aug 12, 2010
Messages
1,534 (0.31/day)
Location
Britland
System Name Gaming temp// HTPC
Processor AMD A6 5400k // A4 5300
Motherboard ASRock FM2A75 PRO4// ASRock FM2A55M-DGS
Cooling Xigmatek HDT-D1284 // stock phenom II HSF
Memory 4GB 1600mhz corsair vengeance // 4GB 1600mhz corsair vengeance low profile
Storage 64gb sandisk pulse SSD and 500gb HDD // 500gb HDD
Display(s) acer 22" 1680x1050
Power Supply Seasonic G-450 // Corsair CXM 430W
What if you got much greater throughput without having to increase memory channels?

Then that would be perfect, even more so with high density moduals becoming more normal as 8gb over a dual channel with a higher bandwith/throughput would be great imo.
 

cheezburger

New Member
Joined
Sep 6, 2010
Messages
265 (0.05/day)
System Name no bases
Processor E8400/e5300/qx9770
Motherboard rampage formula/DG41TY/p5q DELUXE
Cooling stock DTC cooler&copper core
Memory titanium XTC DDR2 800 2gbx4/2gbx2/ballistix 2GBx4 DDR2-800
Video Card(s) evga gtx 460 oc/zotac 9600gt amp/evga gtx 580
Storage WD cavior black 2TB 16mb eSATA 2/500gb 16mb ATA133/ OCZSSD2-1ONX32G + samsung 320gb 8mb ESATA
Case cm 690/GZ-x2/antec qaudro 1200w
Power Supply antec quattro 1200w/zumax 500w v2/antec HCG 900w
Software windows server 2008 sp2/windows xp x64 pro sp2c/windows server 2008 sp1
I remember the "massive cache" Gallatin P4's over Northwood. Didn't make more than 5% difference clock for clock except in very special circumstances.

So let's wait for benchmarks.

gallatin has 2mb l3 but the l2 is cut in half which was only 256kb compare to northwood's 512kb. the difference in performance per clock is not increase but decrease as result. but only advantage of gallatin is the clock is very high compare to northwood's 3.06ghz limit


I would have thought there would be better gains by rethinking cache and memory entirely, possibly producing a separate socket for L3 cache just like in the old days. It would be so much cheaper to do it that way, you could easily pack 256MB cache. Yes, the latency would be worse than current on-die L3 cache, but with the space, heat and transistors saved, you could bump up L1 and L2 cache and win back any performance losses. Plus you could build your L3 cache to order.

it would be the worst scenario to do so. i still remember how terrible a 850mhz slot1 pentium iii couldn't even pace a 533mhz coppermine because of 1/3 speed cache and extremely high latency. going back to slot will be stupid just like going back from core to netburst architecture. cheap price don't mean anything when you don't even have basic performance...plus why do we need these external low performance cache if we already have high speed ram available?

First orochi is 4 module - 8 core design. Second not only the size but how fast is the cache. Third it is very important how the prediction of instructions will work, if the design is good then you dont need big L1 cache which increase cost and die size. And yes 2mb per module 1 mb per core is the amount that bulldozer will have.

the problem is that 64kb l1 instruction cache and l2 cache are uncore. that is a huge difference. it will make each of bulldozer core have theoretically only 8kb l1 cache while no l2 cache built in. it makes bulldozer quitetly different from its counterpart as intel wrap everything inside each core except pcie ctrl, memory ctrl and l3 cache. they need larger l1 cache because their l1 cache is way slower than intel's cache. and now their l1 cache on each core only 8kb. it will be hard to imagine they can outperform any intel line...

instruction prediction, same thing that intel had done long time ago when back to netburst time. such feature only work when you have ridiculous number of pipeline and a trace cache. but despite everything they had done with it they still end up performing pathetic in every benches
 
Last edited:

1Kurgan1

The Knife in your Back
Joined
Dec 8, 2008
Messages
10,421 (1.86/day)
Location
Duluth, Minnesota
System Name My Comp | Fiancees Comp
Processor i7 5820k @ 4.6Ghz 1.285v| i5 2500k
Motherboard MSI x99 SLI Plus | AsRock Z77 Pro 3
Cooling Watercooled
Memory 16GB DDR4 2400 @ 2666 | 12GB DDR3 1600
Video Card(s) AMD R9 290x | MSI 5850 OC
Storage 128gb SSD + 2x 2TB | 2TB
Display(s) Asus 27" LCD | 25" Hanns G
Case CM Storm | CM Elite 430
Audio Device(s) Creative Recon 3D PCIe
Power Supply Enermax Galaxy 1250W | Rosewill 630w
Mouse Logitech G700s | Logitech G100s
Keyboard Logitech G901 | Logitech G105
Software Win 8.1 Ultimate x64 | Win 8.1 Ultimate x64
Benchmark Scores 3D Mark - Fire Strike Extreme - 4403
Very cool to see this, can't wait to see what the Bulldozer can really do. Loving my 6 core, hopefully they will have some lower priced ones, thats why I've always been a fan of AMD.
 

CDdude55

Crazy 4 TPU!!!
Joined
Jul 12, 2007
Messages
8,178 (1.34/day)
Location
Virginia
System Name CDdude's Rig!
Processor AMD Athlon II X4 620
Motherboard Gigabyte GA-990FXA-UD3
Cooling Corsair H70
Memory 8GB Corsair Vengence @1600mhz
Video Card(s) XFX HD 6970 2GB
Storage OCZ Agility 3 60GB SSD/WD Velociraptor 300GB
Display(s) ASUS VH232H 23" 1920x1080
Case Cooler Master CM690 (w/ side window)
Audio Device(s) Onboard (It sounds fine)
Power Supply Corsair 850TX
Software Windows 7 Home Premium 64bit SP1
Shaping out to be an awesome architecture, hopefully it can actually walk above i7 while maintaining a decent price tag. If that's the case and i actually have a job by then, i definitely will be considering moving up to this.:)
 

cheezburger

New Member
Joined
Sep 6, 2010
Messages
265 (0.05/day)
System Name no bases
Processor E8400/e5300/qx9770
Motherboard rampage formula/DG41TY/p5q DELUXE
Cooling stock DTC cooler&copper core
Memory titanium XTC DDR2 800 2gbx4/2gbx2/ballistix 2GBx4 DDR2-800
Video Card(s) evga gtx 460 oc/zotac 9600gt amp/evga gtx 580
Storage WD cavior black 2TB 16mb eSATA 2/500gb 16mb ATA133/ OCZSSD2-1ONX32G + samsung 320gb 8mb ESATA
Case cm 690/GZ-x2/antec qaudro 1200w
Power Supply antec quattro 1200w/zumax 500w v2/antec HCG 900w
Software windows server 2008 sp2/windows xp x64 pro sp2c/windows server 2008 sp1
Shaping out to be an awesome architecture, hopefully it can actually walk above i7 while maintaining a decent price tag. If that's the case and i actually have a job by then, i definitely will be considering moving up to this.:)

wait until bench come up first. but i doubt 8kb l1 cache on each core can do much of shit.......
 
Joined
Oct 16, 2009
Messages
510 (0.10/day)
Location
UK South
System Name AMD FX
Processor AMD FX 8350 @ 4.8Ghz
Motherboard Asus Sabretooth 990FX R2.0
Cooling Corsair H100
Memory 16GB Corsair Vegance 1866
Video Card(s) AMD HD7970 Gigabyte
Storage Sandisk Extreme SSD, 500gb SG Sata
Display(s) Samsung 2333sw
Case HAF 922
Audio Device(s) Realtek HD Audio
Power Supply Thermaltake Toughpower 750w
Software Windows 10 Professional x64
What if you got much greater throughput without having to increase memory channels?

is that a hint? bulldozer could be dual channel
 

CDdude55

Crazy 4 TPU!!!
Joined
Jul 12, 2007
Messages
8,178 (1.34/day)
Location
Virginia
System Name CDdude's Rig!
Processor AMD Athlon II X4 620
Motherboard Gigabyte GA-990FXA-UD3
Cooling Corsair H70
Memory 8GB Corsair Vengence @1600mhz
Video Card(s) XFX HD 6970 2GB
Storage OCZ Agility 3 60GB SSD/WD Velociraptor 300GB
Display(s) ASUS VH232H 23" 1920x1080
Case Cooler Master CM690 (w/ side window)
Audio Device(s) Onboard (It sounds fine)
Power Supply Corsair 850TX
Software Windows 7 Home Premium 64bit SP1
wait until bench come up first. but i doubt 8kb l1 cache on each core can do much of shit.......

I am waiting for the benchmarks most definitely.
 

ROad86

New Member
Joined
Sep 24, 2010
Messages
21 (0.00/day)
Processor AMD Phenom II x4 B55
Motherboard Gigabyte MA790XT-UD4P
Cooling SilverStone Nitrogon NT06 Evolution+Noiseblocker BlackSilentPro
Memory Corsair XMS3 4GB
Video Card(s) Saphire Radeon 4870
Storage WD 640 Black + WD 500 Blue
Case Antec P193
Power Supply Corsair CMPSU-650TX
Software Win 7 Professional 64bit
wait until bench come up first. but i doubt 8kb l1 cache on each core can do much of shit.......


http://techreport.com/r.x/bulldozer-uarch/bulldozer-frontend.jpg

The module's front end includes a prediction pipeline, which predicts what instructions will be used next. A separate fetch pipeline then populates the two instruction queues—one for each thread—with those instructions. The decoders convert complex x86 instructions into the CPU's simpler internal instructions. Bulldozer has four of these, like Nehalem, while Barcelona has three.

Each module has a trio of schedulers, one for each integer core and one for the FPU.

This is from techreport and explains just fine. There is no 8kb L1 cache per core. If i am making a mistake please correct me.

And since we have JF-AMD at the forum please explain this clearly!
 

cheezburger

New Member
Joined
Sep 6, 2010
Messages
265 (0.05/day)
System Name no bases
Processor E8400/e5300/qx9770
Motherboard rampage formula/DG41TY/p5q DELUXE
Cooling stock DTC cooler&copper core
Memory titanium XTC DDR2 800 2gbx4/2gbx2/ballistix 2GBx4 DDR2-800
Video Card(s) evga gtx 460 oc/zotac 9600gt amp/evga gtx 580
Storage WD cavior black 2TB 16mb eSATA 2/500gb 16mb ATA133/ OCZSSD2-1ONX32G + samsung 320gb 8mb ESATA
Case cm 690/GZ-x2/antec qaudro 1200w
Power Supply antec quattro 1200w/zumax 500w v2/antec HCG 900w
Software windows server 2008 sp2/windows xp x64 pro sp2c/windows server 2008 sp1
http://techreport.com/r.x/bulldozer-uarch/bulldozer-frontend.jpg

The module's front end includes a prediction pipeline, which predicts what instructions will be used next. A separate fetch pipeline then populates the two instruction queues—one for each thread—with those instructions. The decoders convert complex x86 instructions into the CPU's simpler internal instructions. Bulldozer has four of these, like Nehalem, while Barcelona has three.

Each module has a trio of schedulers, one for each integer core and one for the FPU.

This is from techreport and explains just fine. There is no 8kb L1 cache per core. If i am making a mistake please correct me.

And since we have JF-AMD at the forum please explain this clearly!

it was confirm that it would be either 8~16kb incore l1 data cache while the instruction cache is uncored. very unlikely for a typical x86 design. if we all know how slow intel's l3 cache is because it's uncored then why bulldozer put everything out of core and make each of core only has basic functions?. correct5 me about the stage pipeline in bulldozer but it seem to be unlike typical x86 design..... for what i know each prediction pipeline controls two instruction which theoretically make it 4 pipeline per core. but is it really powerful enough just use less pipeline like this? isn't it going to cost the slower clockrate per core? and how is it possible to separate stage pipeline from core and make it uncored?
 

JF-AMD

AMD Rep (Server)
Joined
Dec 18, 2009
Messages
163 (0.03/day)
L1 cache is not 8k. Check my blog in a week or so for the answer. There is l1 instruction shared between two cores, l1 data per core and l2 shared between 2 cores. L3 is shared at the die level
 
W

wahdangun

Guest
If am3+ socket also supports am3 chips, what do you think?

wow, if that was true and not cripple bulldozer performance than thats was great,

do you know if AMD will be release 980G chipset ?
 

bear jesus

New Member
Joined
Aug 12, 2010
Messages
1,534 (0.31/day)
Location
Britland
System Name Gaming temp// HTPC
Processor AMD A6 5400k // A4 5300
Motherboard ASRock FM2A75 PRO4// ASRock FM2A55M-DGS
Cooling Xigmatek HDT-D1284 // stock phenom II HSF
Memory 4GB 1600mhz corsair vengeance // 4GB 1600mhz corsair vengeance low profile
Storage 64gb sandisk pulse SSD and 500gb HDD // 500gb HDD
Display(s) acer 22" 1680x1050
Power Supply Seasonic G-450 // Corsair CXM 430W
I am a server guy, I don't know about client stuff.

I must admit that i love the fact that you are active here and on other forums i visit, the personal touch along with just the fact that a company has people wiling talking to the bottom end customers really makes the difference when it comes to answering questiong and proving the point of "marketing talk" so i just wated to thank you for taking the time to talk to us through multiple forums and even more so out of office hours.
 
Joined
Oct 8, 2009
Messages
2,047 (0.39/day)
Location
Republic of Texas
Processor R9 5950x
Motherboard Asus x570 Crosshair VIII Formula
Cooling EK 360mm AIO D-RGB
Memory G.Skill Trident Z Neo 2x16gb (CL16@3800MHz)
Video Card(s) PNY GeForce RTX 3090 24GB
Storage Samsung 970 EVO Plus 1TB NVMe | Intel 660p 2TB NVMe
Display(s) Acer Predator XB323QK 4K 144Hz
Case Corsair 5000D Airflow
Audio Device(s) Objective2 Amp/DAC | GoXLR | AKG K612PRO | Beyerdynamic DT880| Rode Pod Mic
Power Supply Corsair AX 850w
Mouse Razer DeathAdder Elite V2
Keyboard Corsair K95 Platinum RGB "Cherry MX Brown"
VR HMD Oculus Rift
Software Window 11 Pro
JF needs a title ;) have u contacted the mods?
 

cheezburger

New Member
Joined
Sep 6, 2010
Messages
265 (0.05/day)
System Name no bases
Processor E8400/e5300/qx9770
Motherboard rampage formula/DG41TY/p5q DELUXE
Cooling stock DTC cooler&copper core
Memory titanium XTC DDR2 800 2gbx4/2gbx2/ballistix 2GBx4 DDR2-800
Video Card(s) evga gtx 460 oc/zotac 9600gt amp/evga gtx 580
Storage WD cavior black 2TB 16mb eSATA 2/500gb 16mb ATA133/ OCZSSD2-1ONX32G + samsung 320gb 8mb ESATA
Case cm 690/GZ-x2/antec qaudro 1200w
Power Supply antec quattro 1200w/zumax 500w v2/antec HCG 900w
Software windows server 2008 sp2/windows xp x64 pro sp2c/windows server 2008 sp1
L1 cache is not 8k. Check my blog in a week or so for the answer. There is l1 instruction shared between two cores, l1 data per core and l2 shared between 2 cores. L3 is shared at the die level

incorrect.....the l1 instruction share by one module(2 cores) and l2 is share by two modules and l3 is share by all modules...

and about 8k l1 data....i remember i saw the spec from anandtech three months ago...however i found wiki had 16k l1 cache.....which i'd rather believe anandtech's source...
 
Joined
Mar 6, 2010
Messages
2,188 (0.42/day)
Location
England
Hmm i wonder if they will follow intel's lead (refering to the cooler that comes with the top end i7's) by using a better cooler for the high end cpu's if they run hot, would be nice to see a better cooler than the current one's as i am not really a fan of them.

Meh... the current stock cooler/fan comes with heat pipes, 10 years ago that was unheard off... be greatfull :toast:
 

bear jesus

New Member
Joined
Aug 12, 2010
Messages
1,534 (0.31/day)
Location
Britland
System Name Gaming temp// HTPC
Processor AMD A6 5400k // A4 5300
Motherboard ASRock FM2A75 PRO4// ASRock FM2A55M-DGS
Cooling Xigmatek HDT-D1284 // stock phenom II HSF
Memory 4GB 1600mhz corsair vengeance // 4GB 1600mhz corsair vengeance low profile
Storage 64gb sandisk pulse SSD and 500gb HDD // 500gb HDD
Display(s) acer 22" 1680x1050
Power Supply Seasonic G-450 // Corsair CXM 430W
There is no such thing as a "bottom end customer". There are either customers or people who will be customers. And both are the people that pay my salary.

Thank you for correcting me,you are right and once again i'm just thankful amd employes people like you who are willing to put the effort in with the community.
 

bear jesus

New Member
Joined
Aug 12, 2010
Messages
1,534 (0.31/day)
Location
Britland
System Name Gaming temp// HTPC
Processor AMD A6 5400k // A4 5300
Motherboard ASRock FM2A75 PRO4// ASRock FM2A55M-DGS
Cooling Xigmatek HDT-D1284 // stock phenom II HSF
Memory 4GB 1600mhz corsair vengeance // 4GB 1600mhz corsair vengeance low profile
Storage 64gb sandisk pulse SSD and 500gb HDD // 500gb HDD
Display(s) acer 22" 1680x1050
Power Supply Seasonic G-450 // Corsair CXM 430W
Meh... the current stock cooler/fan comes with heat pipes, 10 years ago that was unheard off... be greatfull :toast:

Ok i admit i am greatful for he copper based hsf wih copper heatpipes.... even if i did just put it on a cpu thats 2 generation old and used my corsair h50 on the cpu the original hfs came with :p
 
Joined
Mar 6, 2010
Messages
2,188 (0.42/day)
Location
England
Thank you for correcting me,you are right and once again i'm just thankful amd employes people like you who are willing to put the effort in with the community.

Ok i admit i am greatful for he copper based hsf wih copper heatpipes.... even if i did just put it on a cpu thats 2 generation old and used my corsair h50 on the cpu the original hfs came with :p

Tut tut... double posting, that's a no no :p

I understand where your coming from though but for any enthusiast, the stock coolers are just not enough, but then again if they was we wouldn't be very good enthusiast's would we :D
 

bear jesus

New Member
Joined
Aug 12, 2010
Messages
1,534 (0.31/day)
Location
Britland
System Name Gaming temp// HTPC
Processor AMD A6 5400k // A4 5300
Motherboard ASRock FM2A75 PRO4// ASRock FM2A55M-DGS
Cooling Xigmatek HDT-D1284 // stock phenom II HSF
Memory 4GB 1600mhz corsair vengeance // 4GB 1600mhz corsair vengeance low profile
Storage 64gb sandisk pulse SSD and 500gb HDD // 500gb HDD
Display(s) acer 22" 1680x1050
Power Supply Seasonic G-450 // Corsair CXM 430W
Tut tut... double posting, that's a no no :p

I understand where your coming from though but for any enthusiast, the stock coolers just are not enough, but then again if they was we wouldn't be very good enthusiast's would we :D

Sorry too much vodka and it beiong past 5am made me get confused too the fact i was posting within the same thread not 2 seperate ones lol.

but i suppose i can add this, i still have the old all aluminum heatsink that came iwth the athlon x2 that i am currently readying to be chopped up for other uses so i am greatful for the heatsinks that come with amd processors and am glad they are now heatpipe coolers as even with chopping them up i cam make good use of them.

(no more drunken double posting... at least tonight lol :p)
 
Top