Your opinion - how much IPC improvement could be squeezed out of a new CPU gen if normal limits are removed?

qubit · Jul 25, 2018

The article below on an apparent IPC improvement of 10-15% for Zen 2 got me thinking of the following hypothetical scenario.

Today's x86 CPUs are pretty well optimized for IPC with large caches, out of order execution etc. So, how much further do you think Intel or AMD could improve IPC performance if they went all out to absolutely maximise it if cost, power use, wafer yield etc didn't matter?

Do you think 50-100%, or maybe even more might be possible? Perhaps we're reaching the point of diminishing returns and it won't improve much more regardless of what resources are thrown at it? I have no idea, just throwing this out there.

EDIT: IPC = Instructions Per Clock.

https://wccftech.com/amds-zen-2-ipc-uplift-will-be-in-the-10-15-range

the54thvoid · Jul 25, 2018

I'm not that tech literate but I think the frequency scaling is very difficult and recent security scares have dialled back actual IPC based on 'predictive' solutions. I think we need to move away from silicon, as we know it to massively ramp up IPC.

But, like I say, I am not knowledgeable...

kastriot · Jul 25, 2018

Good thing is that IPC it's not important anymore like it was 10 years ago and this trend will continue with new games which use DX12 like they should, so now more cores is trend and IPC will be just another footnote.

R0H1T · Jul 25, 2018

the54thvoid said:
I'm not that tech literate but I think the frequency scaling is very difficult and recent security scares have dialled back actual IPC based on 'predictive' solutions. I think we need to move away from silicon, as we know it to massively ramp up IPC.

But, like I say, I am not knowledgeable...

Yes, also I don't believe Zen 2 will get to 15% higher IPC than PR, on avg I'd expect not more than 10% improvement over SR, after accounting for the spectre fixes. With 7nm GF, I expect AMD to be able to reach 5GHz for desktop chips, even if we're talking about single core turbo or OCed speeds, one or the other.

Moving away from Silicon to gain IPC, do you mean frequency?

AltCapwn · Jul 25, 2018

kastriot said:
Good thing is that IPC it's not important anymore like it was 10 years ago and this trend will continue with new games which use DX12 like they should, so now more cores is trend and IPC will be just another footnote.

The thing is, without proper tools, coding for multi-thread is quite complex and as long as there will be a big amount of Dx11 GPU, dev can't make a game that supports "only" DX12 (or Vulkan).

So IPC is still important, in a way where even if you have a threadripper, a core i3 with a higher IPC nuke its Ryzen counterpart in heavy single thread games and apps.

But yeah, it's less important than back in the days, but still relevant.

FordGT90Concept · Jul 25, 2018

Ryzen has cache performance issues. Ryzen 2 focuses on addressing that. 10-15% is reasonable because it is that bad in Ryzen.

AltCapwn · Jul 25, 2018

FordGT90Concept said:
Ryzen has cache performance issues. Ryzen 2 focuses on addressing that.

Ryzen is really a good card for AMD; there's still plenty of room left for improvements on the IPC side and I think they already master the core count part.

EDIT; Look @qubit the next incoming article from TPU!

What a coincidence.
EDIT2; can you TL;DR the PM @eidairaman1?

eidairaman1 · Jul 25, 2018

Check your PMs @qubit for my thoughts.

Bill_Bright · Jul 25, 2018

I agree with the comment we need to move away from silicon. The gates are already getting too small. The smaller the gate (gap), the easier it is for the voltage to jump (arc) across it. And the more gates you jam into the same space, the smaller each gate becomes.

The problem becomes, how to do push enough voltage into the IC while keeping that voltage low enough so it does not arc across gates? The smaller the gate, the lower the voltage must be to prevent arcing. Not an easy task at that scale without superconductors.

Affordable zero resistance conductors need to become a reality before we can go much further.

Mussels · Jul 25, 2018

Ryzen was about 30-40% faster than FX series IIRC, so that would be about the max we could ever expect

10-20% seems logical between generations when theres competition, 5-10% when there is not

qubit · Jul 25, 2018

kastriot said:
Good thing is that IPC it's not important anymore like it was 10 years ago and this trend will continue with new games which use DX12 like they should, so now more cores is trend and IPC will be just another footnote.

I still think that IPC is still the most important metric for CPU performance, because everything else hangs off it. It's like the fastest car in a drag race. Imagine a CPU that's got just a 5% IPC performance advantage over another one. Spread over many cores, that advantage multiplies, making the gain quite significant. No wonder that Intel is scrambling to head off AMD, as they will understand this better than anyone. With AMD offering so many cores on top of this, we see Intel scrambling to compete.

TheoneandonlyMrK · Jul 25, 2018

It's entirely dependent on application.
The Ipc of some types of computation can be improved immensely.
While others are close to the limit with the tech we have.

Imagine one core with the register width and alus of a quad , would it do four times the math per clock, possibly but it may bottleneck in some scenario's.
There is a limit and it's presently defined by register sizes imho.

A new computing platform might overcome this such as HPs in memory computing beomoth.

Frick · Jul 25, 2018

eidairaman1 said:
Check your PMs @qubit for my thoughts.

Too many curse words and tentacle-porn references for public posting?

I've no idea, but I highly doubt >50% of what we have now.

eidairaman1 · Jul 26, 2018

Frick said:
Too many curse words and tentacle-porn references for public posting?

I've no idea, but I highly doubt >50% of what we have now.

For 1 I do not look at porn, 2 I stopped cursing. 3 You don't know me, so stop assuming you do. 4 Since you have nothing constructive to say go else where instead of insulting a member.

PS You have been on my ignore list for several years now. You will remain there.

Caring1 · Jul 26, 2018

Don't take his comments personally @eidairaman1, I'm pretty sure his comment was a light hearted jest not aimed at you specifically.

eidairaman1 · Jul 26, 2018

Caring1 said:
Don't take his comments personally @eidairaman1, I'm pretty sure his comment was a light hearted jest not aimed at you specifically.

He has taken jabs at me in the past, so yeah sure "rolling eyes"

Mussels · Jul 26, 2018

leave the drama be guys, just leave it alone and keep this on topic

Caring1 · Jul 26, 2018

Wow, I get a down vote for trying to be the good guy here. :shadedshu:

HTC · Jul 26, 2018

Mussels said:
Ryzen was about 30-40% faster than FX series IIRC, so that would be about the max we could ever expect

10-20% seems logical between generations when theres competition, 5-10% when there is not

According to the slides when Ryzen launched, Ryzen had 52% increase over excavator architecture (not piledriver).

The reason AMD managed this impressive feat is simply because excavator's IPC was that bad.

10% - 20% between generations seems too much, even with competition: i'd say 5% - 12%, tops.

Frick · Jul 26, 2018

eidairaman1 said:
For 1 I do not look at porn, 2 I stopped cursing. 3 You don't know me, so stop assuming you do. 4 Since you have nothing constructive to say go else where instead of insulting a member.

PS You have been on my ignore list for several years now. You will remain there.

Fair enough. But seriously, why didn't you post it here? I just don't see the point.

Caring1 said:
Wow, I get a down vote for trying to be the good guy here.

Downvotes is always a bad idea and it always results in groupthink and tribalism.

Vya Domus · Jul 26, 2018

qubit said:
Today's x86 CPUs are pretty well optimized for IPC with large caches, out of order execution etc. So, how much further do you think Intel or AMD could improve IPC performance if they went all out to absolutely maximise it if cost, power use, wafer yield etc didn't matter?

Here's the real kicker , modern CPUs can already execute way more instruction than they will ever be able to under normal circumstances. There are many things that can be done to increase IPC , problem is you will see little improvement due to the everlasting problem that slow system memory is. Many CPUs would see a massive uplift in performance by simply pairing them with memory that can provide instructions and data at the same rate that they can process them. Think about it , memory technology changes at a much lower pace than IPC improvements and somehow manufacturers of CPU have to deal with this ever present issue that keeps on growing.

The issue as of now isn't IPC , there is still plenty of room for improvement there , the issue is that there is no point in pushing it much further. Out-of-order CPUs have something called an instruction window , which represents how many instructions you can look at ahead of time to potentially decode and execute them. You can design a CPU with a huge instruction window and many ALUs and therefore insane IPC but you will never be able to fetch enough instructions and data to make it worthwhile.

qubit · Jul 26, 2018

@Vya Domus I think memory performance could be improved to keep up with demand - remember, this is a hypothetical scenario we're talking about here of what could technically be possible with the right motivation eg a "TV competition" to make the fastest CPU.

Wouldn't a huge instruction Window increase IPC at the expense of lag, though?

Vya Domus · Jul 26, 2018

qubit said:
Wouldn't a huge instruction Window increase IPC at the expense of lag, though?

Latency as in what ?

qubit said:
think memory performance could be improved to keep up with demand

That's the thing , it can't. It's too expensive to manufacture memory in current capacities that would be fast enough to make big leaps in IPC worthwhile.

qubit said:
@Vya Domus a hypothetical scenario we're talking about here of what could technically be possible with the right motivation eg a "TV competition" to make the fastest CPU.

Hypothetically you can make a very fast CPU with high IPC as I described , you wouldn't see it inside anything though. The point is CPUs are already "uselessly" fast to a degree so that wouldn't prove anything.

You can tell that IPC isn't such a challenging thing when you look at mobile ARM based processors who saw massive improvements in just a couple of years but now things slow down because it runs into the same issue.

ExV6k · Jul 26, 2018

HTC said:
According to the slides when Ryzen launched, Ryzen had 52% increase over excavator architecture (not piledriver).

The reason AMD managed this impressive feat is simply because excavator's IPC was that bad.

10% - 20% between generations seems too much, even with competition: i'd say 5% - 12%, tops.

Now that you mention it, how come intel managed to pull about +40% IPC with Sandy?

GorbazTheDragon · Jul 26, 2018

CPUs will always be a balance between thread switching speeds and IPC. There is a number of threads per core you can efficiently address simultaneously before the performance goes downhill a LOT. There was a thread on OCN a few years back and IIRC the thread switching started incurring a lot of performance loss past around 20 threads per core. So in theory you don't actually need more than around 4 cores in most situations though some games can make efficient use of 6+ these days.

Obviously you have to have a total IPC big enough to keep up with all the tasks which is why dual cores are basically dead in the water now.

Another thing to consider is that certain loads do not fully utilise all of the resources presented by a CPU core. Modern CPUs are actually moving to a point where they are designed with the assumption that not all the resources will be utilised (AVX offset is an example).

With the multi-chip CPUs we also have to start considering the inter-die latency and the delays incurred in NUMA which can further bottleneck you to a certain power of a single core.

System Name	Quantumville™
Processor	Intel Core i7-2700K @ 4GHz
Motherboard	Asus P8Z68-V PRO/GEN3
Cooling	Noctua NH-D14
Memory	16GB (2 x 8GB Corsair Vengeance Black DDR3 PC3-12800 C9 1600MHz)
Video Card(s)	MSI RTX 2080 SUPER Gaming X Trio
Storage	Samsung 850 Pro 256GB \| WD Black 4TB \| WD Blue 6TB
Display(s)	ASUS ROG Strix XG27UQR (4K, 144Hz, G-SYNC compatible) \| Asus MG28UQ (4K, 60Hz, FreeSync compatible)
Case	Cooler Master HAF 922
Audio Device(s)	Creative Sound Blaster X-Fi Fatal1ty PCIe
Power Supply	Corsair AX1600i
Mouse	Microsoft Intellimouse Pro - Black Shadow
Keyboard	Yes
Software	Windows 10 Pro 64-bit

Processor	Ryzen 7800X3D
Motherboard	MSI MAG Mortar B650 (wifi)
Cooling	be quiet! Dark Rock Pro 4
Memory	32GB Kingston Fury
Video Card(s)	MSI RTX 5080 Vanguard SOC
Storage	Seagate FireCuda 530 M.2 1TB / Samsumg 960 Pro M.2 512Gb
Display(s)	LG 32" 165Hz 1440p GSYNC
Case	Asus Prime AP201
Audio Device(s)	On Board
Power Supply	be quiet! Pure POwer M12 850w Gold (ATX3.0)
Software	W10

System Name	My PC
Processor	4670K@4.4GHz
Motherboard	Gryphon Z87
Cooling	CM 212
Memory	2x8GB+2x4GB @2400GHz
Video Card(s)	XFX Radeon RX 580 GTS Black Edition 1425MHz OC+, 8GB
Storage	Intel 530 SSD 480GB + Intel 510 SSD 120GB + 2x500GB hdd raid 1
Display(s)	HP envy 32 1440p
Case	CM Mastercase 5
Audio Device(s)	Sbz ZXR
Power Supply	Antec 620W
Mouse	G502
Keyboard	G910
Software	Win 10 pro

System Name	BY-2021
Processor	AMD Ryzen 7 5800X (65w eco profile)
Motherboard	MSI B550 Gaming Plus
Cooling	Scythe Mugen (rev 5)
Memory	2 x Kingston HyperX DDR4-3200 32 GiB
Video Card(s)	AMD Radeon RX 7900 XT
Storage	Samsung 980 Pro, Seagate Exos X20 TB 7200 RPM
Display(s)	Nixeus NX-EDG274K (3840x2160@144 DP) + Samsung SyncMaster 906BW (1440x900@60 HDMI-DVI)
Case	Coolermaster HAF 932 w/ USB 3.0 5.25" bay + USB 3.2 (A+C) 3.5" bay
Audio Device(s)	Realtek ALC1150, Micca OriGen+
Power Supply	Enermax Platimax 850w
Mouse	Nixeus REVEL-X
Keyboard	Tesoro Excalibur
Software	Windows 10 Home 64-bit
Benchmark Scores	Faster than the tortoise; slower than the hare.

System Name	PCGOD
Processor	AMD FX 8350@ 5.0GHz
Motherboard	Asus TUF 990FX Sabertooth R2 2901 Bios
Cooling	Scythe Ashura, 2×BitFenix 230mm Spectre Pro LED (Blue,Green), 2x BitFenix 140mm Spectre Pro LED
Memory	16 GB Gskill Ripjaws X 2133 (2400 OC, 10-10-12-20-20, 1T, 1.65V)
Video Card(s)	AMD Radeon 290 Sapphire Vapor-X
Storage	Samsung 840 Pro 256GB, WD Velociraptor 1TB
Display(s)	NEC Multisync LCD 1700V (Display Port Adapter)
Case	AeroCool Xpredator Evil Blue Edition
Audio Device(s)	Creative Labs Sound Blaster ZxR
Power Supply	Seasonic 1250 XM2 Series (XP3)
Mouse	Roccat Kone XTD
Keyboard	Roccat Ryos MK Pro
Software	Windows 7 Pro 64

Your opinion - how much IPC improvement could be squeezed out of a new CPU gen if normal limits are removed?

qubit

Overclocked quantum bit

the54thvoid

Super Intoxicated Moderator

kastriot

R0H1T

AltCapwn

FordGT90Concept

"I go fast!1!11!1!"

AltCapwn

eidairaman1

The Exiled Airman

Bill_Bright

Mussels

Freshwater Moderator

qubit

Overclocked quantum bit

TheoneandonlyMrK

Frick

Fishfaced Nincompoop

eidairaman1

The Exiled Airman

Caring1

eidairaman1

The Exiled Airman

Mussels

Freshwater Moderator

Caring1

HTC

Frick

Fishfaced Nincompoop

Vya Domus

qubit

Overclocked quantum bit

Vya Domus

ExV6k

GorbazTheDragon

System Name	Brightworks Systems BWS-6 E-IV
Processor	Intel Core i5-6600 @ 3.9GHz
Motherboard	Gigabyte GA-Z170-HD3 Rev 1.0
Cooling	Quality Fractal Design Define R4 case, 2 x FD 140mm fans, CM Hyper 212 EVO HSF
Memory	32GB (4 x 8GB) DDR4 3000 Corsair Vengeance
Video Card(s)	EVGA GEForce GTX 1050Ti 4Gb GDDR5
Storage	Samsung 850 Pro 256GB SSD, Samsung 860 Evo 500GB SSD
Display(s)	Samsung S24E650BW LED x 2
Case	Fractal Design Define R4
Power Supply	EVGA Supernova 550W G2 Gold
Mouse	Logitech M190
Keyboard	Microsoft Wireless Comfort 5050
Software	W10 Pro 64-bit

System Name	Rainbow Sparkles (Power efficient, <350W gaming load)
Processor	Ryzen R7 5800x3D (Undervolted, 4.45GHz all core)
Motherboard	Asus x570-F (BIOS Modded)
Cooling	Alphacool Apex UV - Alphacool Eisblock XPX Aurora + EK Quantum ARGB 3090 w/ active backplate
Memory	2x32GB DDR4 3600 Corsair Vengeance RGB @3866 C18-22-22-22-42 TRFC704 (1.4V Hynix MJR - SoC 1.15V)
Video Card(s)	Galax RTX 3090 SG 24GB: Underclocked to 1700Mhz 0.750v (375W down to 250W))
Storage	2TB WD SN850 NVME + 1TB Sasmsung 970 Pro NVME + 1TB Intel 6000P NVME USB 3.2
Display(s)	Phillips 32 32M1N5800A (4k144), LG 32" (4K60) \| Gigabyte G32QC (2k165) \| Phillips 328m6fjrmb (2K144)
Case	Fractal Design R6
Audio Device(s)	Logitech G560 \| Corsair Void pro RGB \|Blue Yeti mic
Power Supply	Fractal Ion+ 2 860W (Platinum) (This thing is God-tier. Silent and TINY)
Mouse	Logitech G Pro wireless + Steelseries Prisma XL
Keyboard	Razer Huntsman TE ( Sexy white keycaps)
VR HMD	Oculus Rift S + Quest 2
Software	Windows 11 pro x64 (Yes, it's genuinely a good OS) OpenRGB - ditch the branded bloatware!
Benchmark Scores	Nyooom.

System Name	RyzenGtEvo/ Asus strix scar II
Processor	Amd R5 5900X/ Intel 8750H
Motherboard	Crosshair hero8 impact/Asus
Cooling	360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory	Gskill Trident Z 3900cas18 32Gb in four sticks./16Gb/16GB
Video Card(s)	Asus tuf RX7900XT /Rtx 2060
Storage	Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s)	Samsung UAE28"850R 4k freesync.dell shiter
Case	Lianli 011 dynamic/strix scar2
Audio Device(s)	Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply	corsair 1200Hxi/Asus stock
Mouse	Roccat Kova/ Logitech G wireless
Keyboard	Roccat Aimo 120
VR HMD	Oculus rift
Software	Win 10 Pro
Benchmark Scores	laptop Timespy 6506

System Name	Black MC in Tokyo
Processor	Ryzen 5 7600
Motherboard	MSI X670E Gaming Plus Wifi
Cooling	Be Quiet! Pure Rock 2
Memory	2 x 16GB Corsair Vengeance @ 6000Mhz
Video Card(s)	XFX 6950XT Speedster MERC 319
Storage	Kingston KC3000 1TB \| WD Black SN750 2TB \|WD Blue 1TB x 2 \| Toshiba P300 2TB \| Seagate Expansion 8TB
Display(s)	Samsung U32J590U 4K + BenQ GL2450HT 1080p
Case	Fractal Design Define R4
Audio Device(s)	AuraSound AS42 Soundbar \| Plantronics 5220 \| Sony WH-1000XM3 \| Nektar SE61 \| Behringer XR18
Power Supply	Corsair RM850x v3
Mouse	Logitech G602
Keyboard	Dell SK3205
Software	Windows 10 Pro
Benchmark Scores	Rimworld 4K ready!

System Name	H7 Flow 2024
Processor	AMD 5800X3D
Motherboard	Asus X570 Tough Gaming
Cooling	Custom liquid
Memory	32 GB DDR4
Video Card(s)	Intel ARC A750
Storage	Crucial P5 Plus 2TB.
Display(s)	AOC 24" Freesync 1m.s. 75Hz
Mouse	Lenovo
Keyboard	Eweadn Mechanical
Software	W11 Pro 64 bit

System Name	HTC's System
Processor	Ryzen 5 5800X3D
Motherboard	Asrock Taichi X370
Cooling	NH-C14, with the AM4 mounting kit
Memory	G.Skill Kit 16GB DDR4 F4 - 3200 C16D - 16 GTZB
Video Card(s)	Sapphire Pulse 6600 8 GB
Storage	1 Samsung NVMe 960 EVO 250 GB + 1 3.5" Seagate IronWolf Pro 6TB 7200RPM 256MB SATA III
Display(s)	LG 27UD58
Case	Fractal Design Define R6 USB-C
Audio Device(s)	Onboard
Power Supply	Corsair TX 850M 80+ Gold
Mouse	Razer Deathadder Elite
Software	Ubuntu 20.04.6 LTS

System Name	Good enough
Processor	AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard	ASRock B650 Pro RS
Cooling	2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory	32GB - FURY Beast RGB 5600 Mhz
Video Card(s)	Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage	1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s)	LG UltraGear 32GN650-B + 4K Samsung TV
Case	Phanteks NV7
Power Supply	GPS-750C

System Name	RED
Processor	Intel Core i5 2500
Motherboard	Gigabyte GA-B75M-HD3
Cooling	Cooler Master Hyper 212 Evo
Memory	2x8GB 1600MHz DDR3 @1333 [Avexir]
Video Card(s)	Gigabyte GeForce GTX 1060 3GB Windforce 2X OC
Storage	1Samsung 840 Series SSD 250GB, 2Seagate Barracuda 1TB HDDs
Display(s)	LG W1934S
Case	Aerocool X-Warrior Red Devil Edition
Audio Device(s)	Realtek ALC887
Power Supply	Seasonic M12II Evo 520W (80+ Bronze)
Mouse	Cooler Master Devastator II Mouse
Keyboard	Cooler Master Devastator II Keyboard

System Name	Indis the Fair (cursed edition)
Processor	11900k 5.1/4.9 undervolted.
Motherboard	MSI Z590 Unify-X
Cooling	Heatkiller VI Pro, VPP755 V.3, XSPC TX360 slim radiator, 3xA12x25, 4x Arctic P14 case fans
Memory	G.Skill Ripjaws V 2x16GB 4000 16-19-19 (b-die@3600 14-14-14 1.45v)
Video Card(s)	EVGA 2080 Super Hybrid (T30-120 fan)
Storage	970EVO 1TB, 660p 1TB, WD Blue 3D 1TB, Sandisk Ultra 3D 2TB
Display(s)	BenQ XL2546K, Dell P2417H
Case	FD Define 7
Audio Device(s)	DT770 Pro, Topping A50, Focusrite Scarlett 2i2, Røde VXLR+, Modmic 5
Power Supply	Seasonic 860w Platinum
Mouse	Razer Viper Mini, Odin Infinity mousepad
Keyboard	GMMK Fullsize v2 (Boba U4Ts)
Software	Win10 x64/Win7 x64/Ubuntu