• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Orochi ''Bulldozer'' Die Holds 16 MB Cache

To be honest i kinda of wish the bulldozer would come in a g34 socket (1974 pin) so that it came with quad channel ram... although i doubt it would make much different apart from benchmarks and maybe virtual machines.

What if you got much greater throughput without having to increase memory channels?
 
What if you got much greater throughput without having to increase memory channels?

Then that would be perfect, even more so with high density moduals becoming more normal as 8gb over a dual channel with a higher bandwith/throughput would be great imo.
 
I remember the "massive cache" Gallatin P4's over Northwood. Didn't make more than 5% difference clock for clock except in very special circumstances.

So let's wait for benchmarks.

gallatin has 2mb l3 but the l2 is cut in half which was only 256kb compare to northwood's 512kb. the difference in performance per clock is not increase but decrease as result. but only advantage of gallatin is the clock is very high compare to northwood's 3.06ghz limit


I would have thought there would be better gains by rethinking cache and memory entirely, possibly producing a separate socket for L3 cache just like in the old days. It would be so much cheaper to do it that way, you could easily pack 256MB cache. Yes, the latency would be worse than current on-die L3 cache, but with the space, heat and transistors saved, you could bump up L1 and L2 cache and win back any performance losses. Plus you could build your L3 cache to order.

it would be the worst scenario to do so. i still remember how terrible a 850mhz slot1 pentium iii couldn't even pace a 533mhz coppermine because of 1/3 speed cache and extremely high latency. going back to slot will be stupid just like going back from core to netburst architecture. cheap price don't mean anything when you don't even have basic performance...plus why do we need these external low performance cache if we already have high speed ram available?

First orochi is 4 module - 8 core design. Second not only the size but how fast is the cache. Third it is very important how the prediction of instructions will work, if the design is good then you dont need big L1 cache which increase cost and die size. And yes 2mb per module 1 mb per core is the amount that bulldozer will have.

the problem is that 64kb l1 instruction cache and l2 cache are uncore. that is a huge difference. it will make each of bulldozer core have theoretically only 8kb l1 cache while no l2 cache built in. it makes bulldozer quitetly different from its counterpart as intel wrap everything inside each core except pcie ctrl, memory ctrl and l3 cache. they need larger l1 cache because their l1 cache is way slower than intel's cache. and now their l1 cache on each core only 8kb. it will be hard to imagine they can outperform any intel line...

instruction prediction, same thing that intel had done long time ago when back to netburst time. such feature only work when you have ridiculous number of pipeline and a trace cache. but despite everything they had done with it they still end up performing pathetic in every benches
 
Last edited:
Very cool to see this, can't wait to see what the Bulldozer can really do. Loving my 6 core, hopefully they will have some lower priced ones, thats why I've always been a fan of AMD.
 
Shaping out to be an awesome architecture, hopefully it can actually walk above i7 while maintaining a decent price tag. If that's the case and i actually have a job by then, i definitely will be considering moving up to this.:)
 
Shaping out to be an awesome architecture, hopefully it can actually walk above i7 while maintaining a decent price tag. If that's the case and i actually have a job by then, i definitely will be considering moving up to this.:)

wait until bench come up first. but i doubt 8kb l1 cache on each core can do much of shit.......
 
What if you got much greater throughput without having to increase memory channels?

is that a hint? bulldozer could be dual channel
 
wait until bench come up first. but i doubt 8kb l1 cache on each core can do much of shit.......

I am waiting for the benchmarks most definitely.
 
wait until bench come up first. but i doubt 8kb l1 cache on each core can do much of shit.......


http://techreport.com/r.x/bulldozer-uarch/bulldozer-frontend.jpg

The module's front end includes a prediction pipeline, which predicts what instructions will be used next. A separate fetch pipeline then populates the two instruction queues—one for each thread—with those instructions. The decoders convert complex x86 instructions into the CPU's simpler internal instructions. Bulldozer has four of these, like Nehalem, while Barcelona has three.

Each module has a trio of schedulers, one for each integer core and one for the FPU.

This is from techreport and explains just fine. There is no 8kb L1 cache per core. If i am making a mistake please correct me.

And since we have JF-AMD at the forum please explain this clearly!
 
http://techreport.com/r.x/bulldozer-uarch/bulldozer-frontend.jpg

The module's front end includes a prediction pipeline, which predicts what instructions will be used next. A separate fetch pipeline then populates the two instruction queues—one for each thread—with those instructions. The decoders convert complex x86 instructions into the CPU's simpler internal instructions. Bulldozer has four of these, like Nehalem, while Barcelona has three.

Each module has a trio of schedulers, one for each integer core and one for the FPU.

This is from techreport and explains just fine. There is no 8kb L1 cache per core. If i am making a mistake please correct me.

And since we have JF-AMD at the forum please explain this clearly!

it was confirm that it would be either 8~16kb incore l1 data cache while the instruction cache is uncored. very unlikely for a typical x86 design. if we all know how slow intel's l3 cache is because it's uncored then why bulldozer put everything out of core and make each of core only has basic functions?. correct5 me about the stage pipeline in bulldozer but it seem to be unlike typical x86 design..... for what i know each prediction pipeline controls two instruction which theoretically make it 4 pipeline per core. but is it really powerful enough just use less pipeline like this? isn't it going to cost the slower clockrate per core? and how is it possible to separate stage pipeline from core and make it uncored?
 
L1 cache is not 8k. Check my blog in a week or so for the answer. There is l1 instruction shared between two cores, l1 data per core and l2 shared between 2 cores. L3 is shared at the die level
 
If am3+ socket also supports am3 chips, what do you think?

wow, if that was true and not cripple bulldozer performance than thats was great,

do you know if AMD will be release 980G chipset ?
 
I am a server guy, I don't know about client stuff.

I must admit that i love the fact that you are active here and on other forums i visit, the personal touch along with just the fact that a company has people wiling talking to the bottom end customers really makes the difference when it comes to answering questiong and proving the point of "marketing talk" so i just wated to thank you for taking the time to talk to us through multiple forums and even more so out of office hours.
 
JF needs a title ;) have u contacted the mods?
 
L1 cache is not 8k. Check my blog in a week or so for the answer. There is l1 instruction shared between two cores, l1 data per core and l2 shared between 2 cores. L3 is shared at the die level

incorrect.....the l1 instruction share by one module(2 cores) and l2 is share by two modules and l3 is share by all modules...

and about 8k l1 data....i remember i saw the spec from anandtech three months ago...however i found wiki had 16k l1 cache.....which i'd rather believe anandtech's source...
 
Hmm i wonder if they will follow intel's lead (refering to the cooler that comes with the top end i7's) by using a better cooler for the high end cpu's if they run hot, would be nice to see a better cooler than the current one's as i am not really a fan of them.

Meh... the current stock cooler/fan comes with heat pipes, 10 years ago that was unheard off... be greatfull :toast:
 
There is no such thing as a "bottom end customer". There are either customers or people who will be customers. And both are the people that pay my salary.

Thank you for correcting me,you are right and once again i'm just thankful amd employes people like you who are willing to put the effort in with the community.
 
Meh... the current stock cooler/fan comes with heat pipes, 10 years ago that was unheard off... be greatfull :toast:

Ok i admit i am greatful for he copper based hsf wih copper heatpipes.... even if i did just put it on a cpu thats 2 generation old and used my corsair h50 on the cpu the original hfs came with :p
 
Thank you for correcting me,you are right and once again i'm just thankful amd employes people like you who are willing to put the effort in with the community.

Ok i admit i am greatful for he copper based hsf wih copper heatpipes.... even if i did just put it on a cpu thats 2 generation old and used my corsair h50 on the cpu the original hfs came with :p

Tut tut... double posting, that's a no no :p

I understand where your coming from though but for any enthusiast, the stock coolers are just not enough, but then again if they was we wouldn't be very good enthusiast's would we :D
 
Tut tut... double posting, that's a no no :p

I understand where your coming from though but for any enthusiast, the stock coolers just are not enough, but then again if they was we wouldn't be very good enthusiast's would we :D

Sorry too much vodka and it beiong past 5am made me get confused too the fact i was posting within the same thread not 2 seperate ones lol.

but i suppose i can add this, i still have the old all aluminum heatsink that came iwth the athlon x2 that i am currently readying to be chopped up for other uses so i am greatful for the heatsinks that come with amd processors and am glad they are now heatpipe coolers as even with chopping them up i cam make good use of them.

(no more drunken double posting... at least tonight lol :p)
 
Back
Top