AMD to Cough Up $12.1 Million to Settle "Bulldozer" Core Count Class-Action Lawsuit

Vya Domus · Aug 29, 2019

A core has cores.
A processor has processors.

Something ain't right. This bizzare recursion needs to be addressed somehow.

An if statement is just a conditional jump that only neends an ALU to work, or an execution core. There is nothing special that is required and that can only be found in some other type of core.

Moreover, the execution of every instruction no matter how complex, can be driven pretty much exclusively by lookup tabels.

You can have a memory where every adress corresponds to an instruction and the value at that address represents the control signals required for it's execution. From then on you just need the ALU and wiring.

Of course actual CPUs aren't really implemented like this but this proves everything else other than the execution core is redundant. That's the only dinamic bit required for a processor.

GreiverBlade · Aug 30, 2019

previously to that settle matter still they defined what is a core ... which is kinda confusing ... well a core is a core be it execution, cuda, tensor, stream or pure simple core...
\
point of view.

jaggerwild · Aug 30, 2019

That's a lot of Ryzen CPU'S 12.1 mil ugh!

seronx · Aug 30, 2019

jaggerwild said:
That's a lot of Ryzen CPU'S 12.1 mil ugh!

Or around FX's cheapest, which is at least 121,000 native eight-core CPUs.

Keviny Oliveira · Aug 31, 2019

It's clear who will win in this process are the lawyers besides he actually has eight cores but were a grambiarra with modular cores, and before the Intel fanboys comment something stupid remember that the Core 2 Quad were two glued Core 2 Duo, but because Windows Scheduler is bad on AMD's processors, it says 4/8, nothing more stupid on Microsoft's part.

FordGT90Concept · Sep 1, 2019

Keviny Oliveira said:
...Core 2 Quad were two glued Core 2 Duo...

Core 2 Duo was a legit module containing two complete processors (aka cores). Core 2 Quad was a multi-chip module where two Core 2 Duo modules were attached to the same wafer exposing a total of four complete processors on the front-side bus.

Here's what the individual dies look like ("core replication is obvious"):

In the case of Bulldozer, each "module" only contained one complete processor (aka core). That's why in the literature, it's called a "conjoined-core."

Kumar et al said:
This paper proposes conjoined-core chip multiprocessing – topologically feasible resource sharing between adjacent cores of a chip multiprocessor to reduce die area with minimal impact on performance and hence improving the overall computational efﬁciency

"conjoined-core" is referring to "chip"-level "core" which is synonymous with "processor" consistent with Pentium D and Athlon 64 X2 (which were out at the time).
"adjacent cores" is referring to execution "cores" which share resources in a "conjoined-core." These do not qualify as "processors."

Vya Domus · Sep 1, 2019

They totally qualify as cores and in the papers the authors refer to the arrangement as being a pair of cores, as in two.

They don't mean two execution cores or waffles or anything else, they mean just two cores.

FordGT90Concept · Sep 1, 2019

Sure they do, further in, have another quote:

Conjoined-core chip multiprocessing deviates from a conventional chip multiprocessor design by sharing selected hardware structures between adjacent cores to improve processor efﬁciency.

They're using two different definitions of "core" interchangeably.

"Conjoined-core" refers to this:

What is a Core?

Computer dictionary definition of what core means, including related links, information, and terms.

www.computerhope.com

A core is part of a CPU that receives instructions and performs calculations, or actions, based on those instructions.

"Adjacent cores" refers to this:

Discussion 10: Execution Unit
The execution unit contains the data registers and the ALU.

The use of the phrase "execution core" is *rare* outside of conjoined-core literature.

So you see the problem? Bulldozer "execution cores" lack the hardware to decode AMD64 instructions which is a function of the "core" (aka processor). "Execution cores" as defined in Bulldozer lack the hardware necessary to be considered a "core:" they are merely "execution units." ...and these are the wheels the turn the gears of false advertising.

Vya Domus · Sep 1, 2019

This is purely an invention of yours, as are most of your arguments.

They simply do not ever make a distinction between the kinds of cores that they are talking about because they don't have to, a core is a core in any circumstance. "Adjacent" simply refers to the pair of cores that share the resources, nothing less nothing more.

Have these quotes in which it's crystal clear what they mean by those adjacent cores in relation to the traditional cores :

Wires connecting the FPU to the left core and the right core can be interdigitated, so no additional horizontal wiring tracks are required

"Connecting the FPU to the left and right core.". An FPU classifies as an execution core, clearly they don't mean it's shared between other execution cores.

A core can alternate accesses between the two banks. It can fetch 4 instructions every cycle but only if their desired bank is available. A core has access to bank 0 one cycle, bank 1 the next, etc., with the other core having the opposite allocation

There you go, each core fetches instructions, in an alternating fashion. It cannot get any more obvious that this, they mean cores as in not execution cores. An execution core can't fetch instructions on it's own.

You are simply wrong, end of story.

FordGT90Concept · Sep 1, 2019

"Conjoined-core" is never plural. Think of another context where "conjoined" is commonly used: "conjoined-twins." Note "twins" is plural because they are, in fact, separate entities but they both share a birth defect: being joined to each other.

If the other's intent was truly to say monolithic-core and conjoined-core were indistinguishable, they would have used the plural form of core: "cores." They do not, because they're not independent processors; they are in fact very dependent on each other. The two combined, therefore, make an indivisible new entity: a conjoined-core.

"left and right core" are referring to execution units, not the whole "conjoined-core."

"A core can alternate accesses between the two banks" is referring to the "conjoined-core" where the "two banks" are the "execution units."

As I said, and you just demonstrated again, the article is using two definitions of "core" interchangeably. It's a technical document that assumes the reader will understand the difference.

Vya Domus · Sep 1, 2019

Execution cores can't fetch instructions. They mean fully functional cores, if they meant one core, then who is the other core that they are talking about ?

You are out of touch with the technical aspects of this papers.

FordGT90Concept · Sep 1, 2019

Vya Domus said:
An execution core can't fetch instructions on it's own.

And this is where Bulldozer is hilarious: there's actually two types of instructions:
1) x86 which is what the "conjoined-core" exposes to the system.
2) microOPs which is what the "adjacent cores" process and aren't directly accessible.

They both fetch their respective instructions. This is probably why they love using two meanings of "core." But only one of them matters to the public.

Not that it matters. In Steamroller, they split instruction decode too but the "module" is still a "conjoined-core" sharing resources--aka a "core" (not plural).

Remember how Sun designed a conjoined-core on steroids? Why do you think they never released it? My guess: poor performance like AMD saw. Even after four generations of conjoined-core designs, AMD abandoned it entirely. Sun's chip likely had the same problems AMD's chip did, but four fold, because they shared a crapload more than AMD did. There was no market for a chip that performs that badly, so they never launched it. The cost to support it (hardware platforms and software) would have compounded the losses.

Vya Domus · Sep 1, 2019

And here you end up contradicting yourself.

You've battled for the last couple of pages to prove execution cores can't be cores because they are just "glorified calculators". But now what do you know, turns out a calculator can even fetch instructions from memory, hmm.

FordGT90Concept said:
A "core" relies on nothing other than memory subsystems to carry out instructions.

It's settled, they are cores.

FordGT90Concept · Sep 1, 2019

How can they calculate if they have no data? Point is, microOPs afford very little capability; hence, glorified calcultors.

Anyway, the x86 decoder (as like all processors), hands the microOPs to the execution units on a silver platter known as L1 Instruction Cache. You know where I'm going with this.

Vya Domus · Sep 1, 2019

Everything a processor executes consists of microOPs. Either everything is a glorified calculator or nothing is.

FordGT90Concept · Sep 1, 2019

The Bulldozer "execution unit" is incapable of processing FADD. There's different types of execution units and the processor (aka "core") has to make sure the appropriate data gets to the appropriate unit then collates the results.

"conjoined-core" is very, very different from "adjacent cores."

Vya Domus · Sep 1, 2019

You are clutching at straws with what a Bullzdozer core can or can't do. It's no question that it's capabilities are more limited compared to a conventional core but it's a core nonetheless, it can fetch, decode and execute instructions on it's own. If any of those stages are blocked by another core, it's a different matter but the two are very much obvious distinct entities.

FordGT90Concept · Sep 1, 2019

AMD disagrees with your assessment:

Vya Domus · Sep 1, 2019

Well, for one there aren't two execution units, there are four. Two for integer, two for floating point and they can be driven independently by two threads with limitations.

That makes it a dual core.

FordGT90Concept · Sep 1, 2019

Now you're confusing execution units for components of them (ALUs, AGUs, MMXs, and FMACs). More detailed slide:

Vya Domus · Sep 1, 2019

You literally have it spelled out for you mate.

Dual 128-bit FMAC pipes.

Plus the two integer clusters, four. Four execution units, two for integer, two for floating point.

If you want to brake them down fine, you'd have :

- 2x two ALUs
- 2x two AGUs
- 2x 128-bit FP units

But they are grouped like that for a reason, because each integer cluster can be used by one thread and the two FP units can either be shared or used by one thread in the case of 256-bit instructions.

FordGT90Concept · Sep 1, 2019

Look at the picture again. These are pipelines which are part of the execution units (two integer, one floating point):
4 ALUs (EX/MUL pipeline + EX/DIV pipeline * 2)
4 AGUs (AGen pipeline * 4)
2 128-bit MMX pipelines
2 128-bit FMAC pipelines

That's a total of 12 pipelines for each Bulldozer conjoined-core. Each thread has 4 pipelines (2 x ALU + 2 x AGU) dedicated to it. When counting the FPU, pipeline usage can expand up to 8 when performing an AVX + 2 MMX instruction. In these instances, the other thread is deprived of progress on FPU tasks.

Still don't know why you insist on carrying on with this train of thought: the decoder and fetcher in Bulldozer is undeniably shared and "cores" don't share logic. It's a "conjoined core" which means the whole of it is a "core," not specific components as AMD would have you believe. AMD intentionally called the execution units "cores" to mislead the public in respect to its performance (overselling the capabilities of its product).

Vya Domus · Sep 1, 2019

I am looking and I see 4 groups, two for integer, two for floating point. This is better illustrated here, one blue block, one green and two yellow. That's the higher level grouping of these execution units.

The problem here is that you are getting confused because your definitions of what is an execution core or whatever fall into a strange twilight zone. It's neither a core nor an ALU, the only thing left it's a collection of ALUs/FPUs of which a Bulldozer module has 4.

Everyone either thinks in terms of cores or execution units (ALUs or FPUs). You are making this unnecessarily difficult in your pursuit of differentiating cores from anything else.

FordGT90Concept said:
Still don't know why you insist on carrying on with this train of thought: the decoder and fetcher in Bulldozer is undeniably shared and "cores" don't share logic.

Because even though logic is shared multiple instructions end up being processed. That's the whole point, get work done with less logic.

FordGT90Concept · Sep 1, 2019

Vya Domus said:
I am looking and I see 4 groups, two for integer, two for floating point. This is better illustrated here, one blue block, one green and two yellow. That's the higher level grouping of these execution units.

View attachment 130550

I see four cores as clearly indicated by fetchers and decoders.

Oh look, Zen looks similar:

Look at the text below the diagram: AMD is referring the whole (from Fetch to L2) as the core (not just the integer execution unit). AMD doesn't get to change the rules for its own advantage on Bulldozer. It was well understood what a "core" was before and after Bulldozer debuted.

Oh look! Zen even has 2 x 256-bit FMACs + 1 x MMX per core! Gee, I wonder why Bulldozer gets dragged through the mud for being pokey. Maybe it's because AMD *really* skimped on floating-point performance in the name of supporting more integer-heavy threads? Considering Zen's design, it's clear AMD believed this was a mistake in Bulldozer.

Vya Domus said:
The problem here is that you are getting confused because your definitions of what is an execution core or whatever fall into a strange twilight zone. It's neither a core nor an ALU, the only thing left it's a collection of ALUs/FPUs of which a Bulldozer module has 4.

These phrases are not my own. They're phrases used in different literature to describe the same circuits. Why I keep changing phrasing is to stay consistent with the sourced documents. To be perfectly clear: "integer cluster" = "execution core" = "adjacent core" which is not to be confused with the singular "core" which is synonymous with "processor."

The best way to describe Bulldozer is thusly:
FX-8350 is a quad-core processor with each core accepting two threads. The integer payload of each thread is executed by a dedicated integer cluster while the floating-point payload is handed off to the shared floating-point cluster. The result of this design is accelerated performance in multi-threaded, integer-heavy scenarios like 7-zip compression; however, any workload that strains the processor cores' shared resources (like AVX), performance tanks.

Vya Domus · Sep 1, 2019

FordGT90Concept said:
I see four cores as clearly indicated by fetchers and decoders.

I see eight cores, each pair of two cores sharing some fetch and decode logic. You can see in the picture posted by yourself that the module has 4 decode units, enough to feed two independent threads, at the very least, and enough execution units to be driven by them.

FordGT90Concept said:
The result of this design is accelerated performance in multi-threaded, integer-heavy scenarios like 7-zip compression; however, any workload that strains the processor cores' shared resources (like AVX), performance tanks.

Again, it's irrelevant how performance tanks or doesn't. CPUs behaved differently because of the way they used resources all throughout history, the first Pentium that had MMX suffered from major performance degradation in other workloads when MMX was used because it would stall other pipelines.

System Name	Good enough
Processor	AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard	ASRock B650 Pro RS
Cooling	2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory	32GB - FURY Beast RGB 5600 Mhz
Video Card(s)	Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage	1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s)	LG UltraGear 32GN650-B + 4K Samsung TV
Case	Phanteks NV7
Power Supply	GPS-750C

System Name	main/SFFHTPCARGH!(tm)/Xiaomi Mi TV Stick/Samsung Galaxy S25/Ally
Processor	Ryzen 7 5800X3D/i7-3770/S905X/Snapdragon 8 Elite/Ryzen Z1 Extreme
Motherboard	MSI MAG B550 Tomahawk/HP SFF Q77 Express/uh?/uh?/Asus
Cooling	Enermax ETS-T50 Axe aRGB /basic HP HSF /errr.../oh! liqui..wait, no:sizable vapor chamber/a nice one
Memory	64gb DDR4 3600/8gb DDR3 1600/2gbLPDDR3/12gbLPDDR5x/16gb(10 sys)LPDDR5 6400
Video Card(s)	Hellhound Spectral White RX 7900 XTX 24gb/GT 730/Mali 450MP5/Adreno 830/Radeon 780M 6gb LPDDR5
Storage	250gb870EVO/500gb860EVO/2tbSandisk/NVMe2tb+1tb/4tbextreme V2/1TB Arion/500gb/8gb/512gb/4tb SN850X
Display(s)	X58222 32" 2880x1620/32"FHDTV/273E3LHSB 27" 1920x1080/6.67"/LTPO AMOLED panel FHD+120hz/7" FHD 120hz
Case	Cougar Panzer Max/Elite 8300 SFF/None/Gorilla Glass Victus 2/front-stock back-JSAUX RGB transparent
Audio Device(s)	Logi Z333/SB Audigy RX/HDMI/HDMI/Dolby Atmos/CVJ NightElf/Moondrop Chu II+BT20S/Nekocake GfL QBZ-191
Power Supply	Chieftec Proton BDF-1000C /HP 240w/12v 1.5A/USAMS GAN PD 33w/USAMS GAN 100w
Mouse	Speedlink Sovos Vertical-Asus ROG Spatha-Logi Ergo M575/Xiaomi XMRM-006/touch/touch
Keyboard	Endorfy Thock 75%/Lofree Edge/none/touch/virtual
VR HMD	Medion Erazer
Software	Win10 64/Win8.1 64/Android TV 8.1/Android 14/Win11 64
Benchmark Scores	bench...mark? i do leave mark on bench sometime, to remember which one is the most comfortable. :o

Processor	5930K
Motherboard	MSI X99 SLI
Cooling	WATER
Memory	16GB DDR4 2132
Video Card(s)	EVGAY 2070 SUPER
Storage	SEVERAL SSD"S
Display(s)	Catleap/Yamakasi 2560X1440
Case	D Frame MINI drilled out
Audio Device(s)	onboard
Power Supply	Corsair TX750
Mouse	DEATH ADDER
Keyboard	Razer Black Widow Tournament
Software	W10HB
Benchmark Scores	PhIlLyChEeSeStEaK

System Name	SolarwindMobile
Processor	AMD FX-9800P RADEON R7, 12 COMPUTE CORES 4C+8G
Motherboard	Acer Wasp_BR
Cooling	It's Copper.
Memory	2 x 8GB SK Hynix/HMA41GS6AFR8N-TF
Video Card(s)	ATI/AMD Radeon R7 Series (Bristol Ridge FP4) [ACER]
Storage	TOSHIBA MQ01ABD100 1TB + KINGSTON RBU-SNS8152S3128GG2 128 GB
Display(s)	ViewSonic XG2401 SERIES
Case	Acer Aspire E5-553G
Audio Device(s)	Realtek ALC255
Power Supply	PANASONIC AS16A5K
Mouse	SteelSeries Rival
Keyboard	Ducky Channel Shine 3
Software	Windows 10 Home 64-bit (Version 1607, Build 14393.969)

System Name	BY-2021
Processor	AMD Ryzen 7 5800X (65w eco profile)
Motherboard	MSI B550 Gaming Plus
Cooling	Scythe Mugen (rev 5)
Memory	2 x Kingston HyperX DDR4-3200 32 GiB
Video Card(s)	AMD Radeon RX 7900 XT
Storage	Samsung 980 Pro, Seagate Exos X20 TB 7200 RPM
Display(s)	Nixeus NX-EDG274K (3840x2160@144 DP) + Samsung SyncMaster 906BW (1440x900@60 HDMI-DVI)
Case	Coolermaster HAF 932 w/ USB 3.0 5.25" bay + USB 3.2 (A+C) 3.5" bay
Audio Device(s)	Realtek ALC1150, Micca OriGen+
Power Supply	Enermax Platimax 850w
Mouse	Nixeus REVEL-X
Keyboard	Tesoro Excalibur
Software	Windows 10 Home 64-bit
Benchmark Scores	Faster than the tortoise; slower than the hare.

AMD to Cough Up $12.1 Million to Settle "Bulldozer" Core Count Class-Action Lawsuit

Vya Domus

GreiverBlade

jaggerwild

seronx

Keviny Oliveira

New Member

FordGT90Concept

"I go fast!1!11!1!"

Vya Domus

FordGT90Concept

"I go fast!1!11!1!"

What is a Core?

Vya Domus

FordGT90Concept

"I go fast!1!11!1!"

Vya Domus

FordGT90Concept

"I go fast!1!11!1!"

Vya Domus

FordGT90Concept

"I go fast!1!11!1!"

Vya Domus

FordGT90Concept

"I go fast!1!11!1!"

Vya Domus

FordGT90Concept

"I go fast!1!11!1!"

Vya Domus

FordGT90Concept

"I go fast!1!11!1!"

Vya Domus

FordGT90Concept

"I go fast!1!11!1!"

Vya Domus

FordGT90Concept

"I go fast!1!11!1!"

Vya Domus