• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

PlayStation 3 Emulator Delivers Modest Speed-Ups with Disabled E-Cores on Intel Alder Lake Processors

AleksandarK

Staff member
Joined
Aug 19, 2017
Messages
1,340 (0.85/day)
According to some testing performed by the team behind RPCS3, a free and open-source emulation software for Sony's PlayStation 3, Intel's Alder Lake processors are enjoying a hefty performance boost when E-Cores is disabled. First of all, the Alder Lake processors feature a hybrid configuration with high-performance P-cores and low-power E-cores. The P-cores are based on Golden Cove architecture and can execute AVX-512 instructions with ease. However, the AVX-512 boost is only applicable when E-cores are disabled as software looks at the whole package. Officially, Alder Lake processors don't support AVX-512, as the processor's little E-cores cannot execute AVX-512 instruction.

Thanks to the team behind the RPCS3 emulator, we have some information and tests that suggest that turning E-cores off gives a performance boost to the emulation speed and game FPS. With E-Cores disabled, and only P-cores left, the processor can execute AVX-512 and gets a higher ring ratio. This means that latency in the ring bus is presumably lower. The team benchmarked Intel Core i9-12900K, and Core i9-11900K processors clocked at 5.2 GHz for tests. The Alder Lake chip had disabled E-cores. In God of War: Ascension, the Rocket Lake processor produced 68 FPS, while Alder Lake produced 78 FPS, representing around 15% improvement.



This suggests that more applications can take advantage of disabling E-cores, especially if the application has support for AVX-512 instructions, where only P-cores can execute them. So it remains to be seen throughout trial and error if more cases like this appear.

View at TechPowerUp Main Site
 
Joined
Jan 31, 2011
Messages
2,015 (0.51/day)
System Name Ultima
Processor AMD Ryzen 7 5800X
Motherboard MSI Mag B550M Mortar
Cooling Arctic Liquid Freezer II 240 rev4 w/ Ryzen offset mount
Memory G.SKill Ripjaws V 2x16GB DDR4 3600
Video Card(s) Palit GeForce GTX 1070 Super JetStream 2Ghz
Storage Samsung 970 Evo Plus 500GB nvme, 1TB Crucial MX500 SSD sata, 1TB Seagate Barracuda 7200rpm HDD sata
Display(s) LG 23MP68VQ-P IPS 75HZ 23" 1080p MONITOR
Case DarkFlash DLM21 Mesh
Audio Device(s) Onboard Realtek ALC1200 Audio/Nvidia HD Audio
Power Supply Corsair RM650
Mouse Logitech G304 | Wacom Intuos CTH-480
Keyboard A4Tech B314 Keyboard
Software Windows 10 Pro
This is what ive been waiting for to see, hope others would use RPCS3 as a CPU benchmark like they did before with dolphin

sample test of RPCS3 running Red dead redemption on several CPUs
1637658413310.png
 
Joined
Feb 23, 2019
Messages
3,234 (3.18/day)
Location
Poland
Processor Ryzen 7 3700X
Motherboard Gigabyte X570 Aorus Elite
Cooling BeQuiet Dark Rock 4
Memory 2x16 GB Crucial Ballistix 3600 CL16 Rev E
Video Card(s) EVGA 1060 6GB SSC
Storage SX8200 Pro 1 TB, Plextor M6Pro 256 GB, WD Blue 2TB
Display(s) Acer XB273GP
Case SilverStone Primera PM01 RGB
Audio Device(s) SoundBlaster G6 | Fidelio X2 | Sennheiser 6XX
Power Supply SeaSonic Focus Plus Gold 750W
Mouse SteelSeries Rival 300
Keyboard MK Typist (Kailh Box White)
This is what ive been waiting for to see, hope others would use RPCS3 as a CPU benchmark like they did before with dolphin

sample test of RPCS3 running Red dead redemption on several CPUs
View attachment 226315
Yeah but you would have to retest everything with each new build of RPCS3.
 
Joined
Jan 31, 2011
Messages
2,015 (0.51/day)
System Name Ultima
Processor AMD Ryzen 7 5800X
Motherboard MSI Mag B550M Mortar
Cooling Arctic Liquid Freezer II 240 rev4 w/ Ryzen offset mount
Memory G.SKill Ripjaws V 2x16GB DDR4 3600
Video Card(s) Palit GeForce GTX 1070 Super JetStream 2Ghz
Storage Samsung 970 Evo Plus 500GB nvme, 1TB Crucial MX500 SSD sata, 1TB Seagate Barracuda 7200rpm HDD sata
Display(s) LG 23MP68VQ-P IPS 75HZ 23" 1080p MONITOR
Case DarkFlash DLM21 Mesh
Audio Device(s) Onboard Realtek ALC1200 Audio/Nvidia HD Audio
Power Supply Corsair RM650
Mouse Logitech G304 | Wacom Intuos CTH-480
Keyboard A4Tech B314 Keyboard
Software Windows 10 Pro
Yeah but you would have to retest everything with each new build of RPCS3.
Just stick to one build then for a test, or maybe a common build for benchmark only like what Dolphin did
 

qubit

Overclocked quantum bit
Joined
Dec 6, 2007
Messages
16,886 (3.30/day)
Location
Quantum Well UK
System Name Quantumville™
Processor Intel Core i7-2700K @ 4GHz
Motherboard Asus P8Z68-V PRO/GEN3
Cooling Noctua NH-D14
Memory 16GB (2 x 8GB Corsair Vengeance Black DDR3 PC3-12800 C9 1600MHz)
Video Card(s) MSI RTX 2080 SUPER Gaming X Trio
Storage Samsung 850 Pro 256GB | WD Black 4TB | WD Blue 6TB
Display(s) BenQ XL2720Z (144Hz, 3D Vision 2, 1080p) | Asus MG28UQ (4K, 60Hz, FreeSync compatible)
Case Cooler Master HAF 922
Audio Device(s) Creative Sound Blaster X-Fi Fatal1ty PCIe
Power Supply Corsair HX 850W v1
Mouse Microsoft Intellimouse Pro - Black Shadow
Keyboard Yes
Software Windows 10 Pro 64-bit
Something is badly wrong with a CPU architecture if disabling half the cores results in a performance improvement.

Say what you want about unoptimized software allegedly being the issue, but the bottom line is that we have 16 core CPU with 8 low performance cores rather than the full complement of 16 performance cores as it should be. I really don't like this hybrid design and feel that the consumer (us) is getting cheated out of a lot of performance.

AMD really need to come back with Alder Lake beating performance with all cores being performance cores, or this situation will continue.
 
Joined
Jan 18, 2020
Messages
108 (0.16/day)
Pretty pointless application? You can get a PS3 off Ebay for about £50 and run anything on the native hardware, saving yourself the huge upgrade cost to Alder Lake for this purpose!
 
Joined
Oct 16, 2013
Messages
38 (0.01/day)
Processor i7 4930k
Motherboard Rampage IV Extreme
Cooling Thermalright HR-02 Macho
Memory 4 X 4096 MB G.Skill DDR3 1866 9-10-9-26
Video Card(s) Gigabyte GV-N780OC-3GD
Storage Crucial M4 128GB, M500 240GB, Samsung HD103SJ 1TB
Display(s) Planar PX2710MW 27" 1920x1080
Case Corsair 500R
Power Supply RAIDMAX RX-1200AE
Software Windows 10 64-bit
This is what ive been waiting for to see, hope others would use RPCS3 as a CPU benchmark like they did before with dolphin

sample test of RPCS3 running Red dead redemption on several CPUs
View attachment 226315
That's basically an SIMD test. 2500 as a desktop processor only has 1/3 performance of 7700HQ because the former lacks AVX2 support.
Something is badly wrong with a CPU architecture if disabling half the cores results in a performance improvement.

Say what you want about unoptimized software allegedly being the issue, but the bottom line is that we have 16 core CPU with 8 low performance cores rather than the full complement of 16 performance cores as it should be. I really don't like this hybrid design and feel that the consumer (us) is getting cheated out of a lot of performance.

AMD really need to come back with Alder Lake beating performance with all cores being performance cores, or this situation will continue.
Because P-core only enables AVX512, which wasn't very useful outside of several cases and may cause unexpected throttling
 
Joined
Jan 8, 2017
Messages
7,039 (3.93/day)
System Name Good enough
Processor AMD Ryzen R7 1700X - 4.0 Ghz / 1.350V
Motherboard ASRock B450M Pro4
Cooling Deepcool Gammaxx L240 V2
Memory 16GB - Corsair Vengeance LPX - 3333 Mhz CL16
Video Card(s) OEM Dell GTX 1080 with Kraken G12 + Water 3.0 Performer C
Storage 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s) 4K Samsung TV
Case Deepcool Matrexx 70
Power Supply GPS-750C
Something is badly wrong with a CPU architecture if disabling half the cores results in a performance improvement.

There is nothing wrong with the architecture (apart from the horrid power efficiency), what is wrong is that these types of processors have no place in desktop.

AMD really need to come back with Alder Lake beating performance with all cores being performance cores, or this situation will continue.

I wish the same but this is probably a cope, AMD will likely move to big.LITTLE as well because of the better margins.
 
Joined
Oct 10, 2009
Messages
652 (0.15/day)
Location
Madrid, Spain
System Name Rectangulote / Cubito
Processor Core I9-9900KF / Core I7-8700K
Motherboard Asus TUF Z390M / Gigabyte Z370M D3H
Cooling Alphacool Eisbaer Aurora 280 + Eisblock RTX 3090 RE + 2 x 240 ST30 / Noctua NH-D15
Memory 32 GB DDR4 3600mhz CL16 Crucial Ballistix / 32 GB DDR 3000mhz Corsair LPX
Video Card(s) KFA2 RTX 3090 SG / Gigabyte GTX 1070Ti
Storage 1 WD Blue 3D 1TB + 2 x WD Black SN750 1TB / 2 x 1 WD Blue 3D 1TB
Display(s) 2 x Asus ROG Swift PG278QR / LG Q60R
Case Corsair 5000D Airflow / Thermaltake Core V21
Audio Device(s) Evga Nu Audio + Sennheiser HD599SE + Trust GTX 258
Power Supply Corsair RMX850 / Corsair RM750
Mouse Razer Naga Wireless Pro
Keyboard Dierya DK61 Pro
Software Windows 11 Pro
Say what you want about unoptimized software allegedly being the issue, but the bottom line is that we have 16 core CPU with 8 low performance cores rather than the full complement of 16 performance cores as it should be. I really don't like this hybrid design and feel that the consumer (us) is getting cheated out of a lot of performance.
16 full cores for office and gaming use is just overkill. Why would you need 16 p cores for most people? The problem is not having less p cores rather than giving up features to use the e cores, is a bad implementation, not a bad idea.
 
Joined
Jan 31, 2019
Messages
442 (0.43/day)
Something is badly wrong with a CPU architecture if disabling half the cores results in a performance improvement.

Say what you want about unoptimized software allegedly being the issue, but the bottom line is that we have 16 core CPU with 8 low performance cores rather than the full complement of 16 performance cores as it should be. I really don't like this hybrid design and feel that the consumer (us) is getting cheated out of a lot of performance.

AMD really need to come back with Alder Lake beating performance with all cores being performance cores, or this situation will continue.
I agree, it doesn't make sense especially for high performance desktop CPUs, they could use this for laptops or office desktops with say 2 P cores and few E cores (as they are comparable to a 7700K core).
 
Joined
Jan 24, 2020
Messages
107 (0.16/day)
Pretty pointless application? You can get a PS3 off Ebay for about £50 and run anything on the native hardware, saving yourself the huge upgrade cost to Alder Lake for this purpose!

Not to mention you can play every PS3 game for free using cloud service. They are all locked to 720p 30fps natively anyway. I don't believe someone has a reason to spend their time playing 20 ps3 titles. Maybe the ocasional gem here and there, like Red Dead. You complete it and move on to other games
 
Joined
Dec 5, 2020
Messages
74 (0.20/day)
Something is badly wrong with a CPU architecture if disabling half the cores results in a performance improvement.

Say what you want about unoptimized software allegedly being the issue, but the bottom line is that we have 16 core CPU with 8 low performance cores rather than the full complement of 16 performance cores as it should be. I really don't like this hybrid design and feel that the consumer (us) is getting cheated out of a lot of performance.

AMD really need to come back with Alder Lake beating performance with all cores being performance cores, or this situation will continue.
AMD is probably going to follow Intel with Big.Little as it's the best way to scale up multicore CPU performance. The complaints are just based on ignorance. It's not 16 P-cores vs 8P+8E cores but 10 P-cores vs 8P+8E cores. Hybrid brings us more performance as stuff just doesn't scale linearly.

If software wasn't an issue you'd want a CPU that is entirely made up of E-cores as they're just more efficient in die space so you always get more performance. Unfortunately a lot of software just doesn't scale well so there's a benefit to use big cores but they're just not a cost effective use of your silicon. That's also why all upcoming known Intel architectures keep 8 P-cores and scale up the E-cores. The software that P-cores are designed for don't really use more than 8 cores anyway atm.

This discussion is just a repeat of the whole single core vs multicore CPUs. Back then we also sacrificied single core performance for the sake of having more cores. Conroe just had the benefit of being a massive increase in performance.

Also to adress your first sentence: In games and software with limited scaling disabling HT usually also leads to a performance increase. Or if you disable a CCD on a 5950x you generally also gain performance in software that doesn't scale, like games.
 
Joined
Oct 12, 2005
Messages
290 (0.05/day)
The main thing is Intel should have include a downgraded version of AVX512 on the E-Cores. They could run the instruction but much slower than on the P core to reduce the amount of transistors used. This way they could have kept it on the P-Cores and the thread director could have moved the slight AVX512 load on the P core.

I bet the problem is they couldn't make it without investing too much transistors or crippling too much the performance. A downside of this approach is that these emulator aren't your typical AVX512 load. Generally those load are all cores fully multithreaded.
 
Joined
Dec 25, 2020
Messages
107 (0.31/day)
Location
São Paulo, Brazil
System Name Unova
Processor AMD Ryzen 9 5950X 16-Core Processor
Motherboard ASUS ROG STRIX B550-E Gaming
Cooling id-cooling Frostflow X 360 + Thermal Grizzly Aeronaut
Memory 64 GB (4x 16) Corsair Dominator Platinum @ DDR4-3600 16-17-16-34-1 1.375V
Video Card(s) ASUS TUF Gaming GeForce RTX 3090 24 GB GDDR6X OC Edition (Non-LHR)
Storage XPG SPECTRIX S40G 512 GB
Display(s) Sony XBR-55X905F
Case Lian Li PC-O11 Air
Audio Device(s) EVGA Nu Audio (classic)
Power Supply EVGA SuperNOVA 1300 G2 1300W 80+ Gold
Mouse Corsair M55 RGB Pro
Keyboard Logitech G213 Prodigy
VR HMD OG HTC Vive
Software Windows 11 Pro for Workstations
Benchmark Scores Imagine imagining imagination :D
AMD is probably going to follow Intel with Big.Little as it's the best way to scale up multicore CPU performance. The complaints are just based on ignorance. It's not 16 P-cores vs 8P+8E cores but 10 P-cores vs 8P+8E cores. Hybrid brings us more performance as stuff just doesn't scale linearly.

If software wasn't an issue you'd want a CPU that is entirely made up of E-cores as they're just more efficient in die space so you always get more performance. Unfortunately a lot of software just doesn't scale well so there's a benefit to use big cores but they're just not a cost effective use of your silicon. That's also why all upcoming known Intel architectures keep 8 P-cores and scale up the E-cores. The software that P-cores are designed for don't really use more than 8 cores anyway atm.

This discussion is just a repeat of the whole single core vs multicore CPUs. Back then we also sacrificied single core performance for the sake of having more cores. Conroe just had the benefit of being a massive increase in performance.

Also to adress your first sentence: In games and software with limited scaling disabling HT usually also leads to a performance increase. Or if you disable a CCD on a 5950x you generally also gain performance in software that doesn't scale, like games.

This used to be particularly true with the Zen 2 design, but it is not as much of a problem on the 5950X as it was on the 3950X, as Zen 3 has a single CCX per CCD and full access to the processor's resources at any given moment, I can't think of any given case where restricting applications to a single CCX actually mattered, if anything any potential performance increase would be from higher power allowance and more aggressive clock speeds attained by keeping one of the dies mostly or completely unloaded and that's in situations where threads interleaving between both dies wouldn't benefit to begin with.

I sympathize in general with qubit's line of thought (and why I plan on buying the 3D upgrade for my processor), but I also see the appeal in Alder Lake's hybrid design and where Intel wants to go with it... I just think the Windows ecosystem is not exactly ready for such advanced technology yet. A few friends of mine have upgraded to the i7-12700K and seem to be very pleased with the result, they do pull their own weight for gaming, although something that struck me as odd is that there are games that answer better to being run on the big cores and others that run better on the little cores, meaning that it isn't the big cores that always invariably win the race. That leads to some inconsistency, and to be really frank, my 5950X is plenty fast as it is, I'm only upgrading because of some accounting magic, I give this to my brother, upgrade, sell the 3900XT I left with him last year, 2 people upgrading for less than 1 CPU's full price... everyone wins, that's the idea at least. Hopefully the pricing will be sensible, or even that will not be worth it.
 
Joined
Aug 2, 2012
Messages
940 (0.28/day)
Location
Netherlands
System Name TheDeeGee's PC
Processor Intel Core i7 11700
Motherboard ASRock Z590 Steel Legend
Cooling Noctua NH-D15
Memory Crucial Ballistix RGB 3200/C16 32GB
Video Card(s) MSI GTX 1070 AERO OC
Storage Crucial P5 1TB, Crucial BX500 1TB, Crucial BX500 2TB
Display(s) EIZO CX240
Case Fractal Design Define 7
Audio Device(s) Creative SoundBlaster ZxR
Power Supply Seasonic Fanless TX-700
Mouse Logitech G500s
Keyboard Logitech G710+
Software Windows 10 Pro 64-Bit
Benchmark Scores None, don't care. As long as my games runs smooth.
So far i didn't regret not waiting and getting a 11700 instead.

While the future of CPUs is looking good, it's clear this new architecture has to mature first.
 
Joined
May 17, 2016
Messages
389 (0.19/day)
Location
Bedroom
System Name DUX
Processor Ryzen 9 5900X
Motherboard MSI X570 Gaming Plus
Cooling Fortron Windale 6 blue LED
Memory Crucial Balistix Sport 3800MHz CL16,19,19,39
Video Card(s) RTX 2080Ti
Storage ADATA 512GB M.2
Case Zalman Z1 NEO
Audio Device(s) Kingston HyperX Cloud II
Power Supply Corsair TX850 Gold
Pretty pointless application? You can get a PS3 off Ebay for about £50 and run anything on the native hardware, saving yourself the huge upgrade cost to Alder Lake for this purpose!
I actually bought PS3 Super Slim 500GB this summer. Never had a PS3 before. Some games clearly look amazing. Like Resistance 3 and Killzone 3. However, low resolution prevents them from shining. I played Legend of Zelda:BOTW on CEMU in 4K/60fps and it's a game changer.
 
Joined
Sep 17, 2014
Messages
15,703 (5.95/day)
Location
The Washing Machine
Processor i7 8700k 4.6Ghz @ 1.24V
Motherboard AsRock Fatal1ty K6 Z370
Cooling beQuiet! Dark Rock Pro 3
Memory 16GB Corsair Vengeance LPX 3200/C16
Video Card(s) MSI GTX 1080 Gaming X @ 2100/5500
Storage Samsung 850 EVO 1TB + Samsung 830 256GB + Crucial BX100 250GB + Toshiba 1TB HDD
Display(s) Gigabyte G34QWC (3440x1440)
Case Fractal Design Define C TG
Audio Device(s) Situational :)
Power Supply EVGA G2 750W
Mouse Logitech G502 Protheus Spectrum
Keyboard Lenovo Thinkpad Trackpoint II (Best K/B ever... <3)
Software W10 x64
Im not touching big little until Windows 12, that's for sure.

Probably won't take longer than a year or two, knowing MS and its strategic outlook :D

Pretty pointless application? You can get a PS3 off Ebay for about £50 and run anything on the native hardware, saving yourself the huge upgrade cost to Alder Lake for this purpose!
Sure, but you can't emulate on a PS3, its slow as shit for some games (Heavenly Sword could easily run sub 20 FPS on a ps3, and its no exception), hot and loud for others, you have storage media limitations or a failing BR lens, PS Network is no real added use anymore, should I go on?

And let's not begin about the content itself. Its not like they get released any longer.

Even emulating PS2 on a PC is 10x better than the OG console. Even if only just for save states.
 
Joined
Jan 31, 2011
Messages
2,015 (0.51/day)
System Name Ultima
Processor AMD Ryzen 7 5800X
Motherboard MSI Mag B550M Mortar
Cooling Arctic Liquid Freezer II 240 rev4 w/ Ryzen offset mount
Memory G.SKill Ripjaws V 2x16GB DDR4 3600
Video Card(s) Palit GeForce GTX 1070 Super JetStream 2Ghz
Storage Samsung 970 Evo Plus 500GB nvme, 1TB Crucial MX500 SSD sata, 1TB Seagate Barracuda 7200rpm HDD sata
Display(s) LG 23MP68VQ-P IPS 75HZ 23" 1080p MONITOR
Case DarkFlash DLM21 Mesh
Audio Device(s) Onboard Realtek ALC1200 Audio/Nvidia HD Audio
Power Supply Corsair RM650
Mouse Logitech G304 | Wacom Intuos CTH-480
Keyboard A4Tech B314 Keyboard
Software Windows 10 Pro
I was hoping more on emulation architecture discussion but instead became more of a cpu discussion >_>
 
Joined
Dec 16, 2017
Messages
2,199 (1.52/day)
Location
Buenos Aires, Argentina
System Name System V
Processor AMD Ryzen 5 3600
Motherboard Asus Prime X570-P
Cooling AMD Wraith Stealth // a bunch of 120 mm Xigmatek 1500 RPM fans (2 ins, 3 outs)
Memory 2x8GB Ballistix Sport LT 3200 MHz (BLS8G4D32AESCK.M8FE) (CL16-18-18-36)
Video Card(s) Gigabyte AORUS Radeon RX 580 8 GB
Storage SHFS37A240G / DT01ACA200 / WD20EZRX / MKNSSDTR256GB-3DL / LG BH16NS40 / ST10000VN0008
Display(s) LG 22MP55 IPS Display
Case NZXT Source 210
Audio Device(s) Logitech G430 Headset
Power Supply Corsair CX650M
Mouse Microsoft Trackball Optical 1.0
Keyboard HP Vectra VE keyboard (Part # D4950-63004)
Software Whatever build of Windows 11 is being served in Dev channel at the time.
Benchmark Scores Corona 1.3: 3120620 r/s Cinebench R20: 3355 FireStrike: 12490 TimeSpy: 4624
Just stick to one build then for a test, or maybe a common build for benchmark only like what Dolphin did
There can be multiple builds in one day. And any one you pick won't necessarily be stable on all hardware.

Also, RPCS3 is alpha-level software. You don't want to benchmark with that.

Pretty pointless application? You can get a PS3 off Ebay for about £50 and run anything on the native hardware, saving yourself the huge upgrade cost to Alder Lake for this purpose!
There's a limited number of PS3 consoles in the world. And they're dwindling every day as they fail or break. Never mind that their current owners might not be willing to let go of them for some time yet.
 
Joined
Dec 25, 2020
Messages
107 (0.31/day)
Location
São Paulo, Brazil
System Name Unova
Processor AMD Ryzen 9 5950X 16-Core Processor
Motherboard ASUS ROG STRIX B550-E Gaming
Cooling id-cooling Frostflow X 360 + Thermal Grizzly Aeronaut
Memory 64 GB (4x 16) Corsair Dominator Platinum @ DDR4-3600 16-17-16-34-1 1.375V
Video Card(s) ASUS TUF Gaming GeForce RTX 3090 24 GB GDDR6X OC Edition (Non-LHR)
Storage XPG SPECTRIX S40G 512 GB
Display(s) Sony XBR-55X905F
Case Lian Li PC-O11 Air
Audio Device(s) EVGA Nu Audio (classic)
Power Supply EVGA SuperNOVA 1300 G2 1300W 80+ Gold
Mouse Corsair M55 RGB Pro
Keyboard Logitech G213 Prodigy
VR HMD OG HTC Vive
Software Windows 11 Pro for Workstations
Benchmark Scores Imagine imagining imagination :D
I was hoping more on emulation architecture discussion but instead became more of a cpu discussion >_>

I mean, these are all x86-64 processors and end of the day an emulator's job is to translate native machine code into something that your processor can execute, something that can be attained through an interpreter (which tends to be slow, but more accurate and universally predictable) or dynamic JIT (just-in-time compilation, or batched translation of bytecode at runtime, allowing for target architecture specific optimizations) among other methods, those two being the most common... This will be the case regardless of whether you run Sandy Bridge or Zen 3, or if you're running an NES emulator or a PlayStation 3 emulator.

RPCS3 was always particularly Intel-friendly, but that's because the emulator's sensitive to a few things that Intel chips currently do better and that their CPUs have historically had a bit of an advantage as far as instruction sets go. This emulator in particular has pioneered use of all of these instructions, like TSX, 256-bit and 512-bit AVX, etc, I wouldn't fault any of its developers for preferring Intel for their development machines. I applaud their use of pioneering instruction sets, even if unsupported by many modern CPU microarchitectures.

The lukewarm reaction in this thread is probably expected, RPCS3 is hardly representative of any real-world or meaningful advantage of the Intel architecture vs. the AMD one, as it has always traditionally been Intel-biased. It's not a bad thing, there are other places where Ryzen will shine particularly bright, as well. :)
 
Joined
Oct 12, 2005
Messages
290 (0.05/day)
This used to be particularly true with the Zen 2 design, but it is not as much of a problem on the 5950X as it was on the 3950X, as Zen 3 has a single CCX per CCD and full access to the processor's resources at any given moment, I can't think of any given case where restricting applications to a single CCX actually mattered, if anything any potential performance increase would be from higher power allowance and more aggressive clock speeds attained by keeping one of the dies mostly or completely unloaded and that's in situations where threads interleaving between both dies wouldn't benefit to begin with.

The thing is on Zen 2, communication between CCX had to go thru the I/O die. The infinity fabric could become saturated by all those access and it had to compete with memory and i/o access too. And this round trip to the I/O die was costly on latency and power usage.

On Zen 3, all core within the CCD can communicate directly with each other but still have to go thru the I/O die via infinity fabrics and this have a latency impact. There are application that are faster on the 5800x than on the 5900x because they are affected by that latency. By example

1637682812498.png


But those are rare and generally, the higher frequency compensate the latency problem. It's true that the OS should just use the 5950x as a Single CCD but it's harder to implement in real life than in theory. It's more up to the application to establish that.
 
Joined
Dec 25, 2020
Messages
107 (0.31/day)
Location
São Paulo, Brazil
System Name Unova
Processor AMD Ryzen 9 5950X 16-Core Processor
Motherboard ASUS ROG STRIX B550-E Gaming
Cooling id-cooling Frostflow X 360 + Thermal Grizzly Aeronaut
Memory 64 GB (4x 16) Corsair Dominator Platinum @ DDR4-3600 16-17-16-34-1 1.375V
Video Card(s) ASUS TUF Gaming GeForce RTX 3090 24 GB GDDR6X OC Edition (Non-LHR)
Storage XPG SPECTRIX S40G 512 GB
Display(s) Sony XBR-55X905F
Case Lian Li PC-O11 Air
Audio Device(s) EVGA Nu Audio (classic)
Power Supply EVGA SuperNOVA 1300 G2 1300W 80+ Gold
Mouse Corsair M55 RGB Pro
Keyboard Logitech G213 Prodigy
VR HMD OG HTC Vive
Software Windows 11 Pro for Workstations
Benchmark Scores Imagine imagining imagination :D
The thing is on Zen 2, communication between CCX had to go thru the I/O die. The infinity fabric could become saturated by all those access and it had to compete with memory and i/o access too. And this round trip to the I/O die was costly on latency and power usage.

On Zen 3, all core within the CCD can communicate directly with each other but still have to go thru the I/O die via infinity fabrics and this have a latency impact. There are application that are faster on the 5800x than on the 5900x because they are affected by that latency. By example

But those are rare and generally, the higher frequency compensate the latency problem. It's true that the OS should just use the 5950x as a Single CCD but it's harder to implement in real life than in theory. It's more up to the application to establish that.

I mean, with the 5950X, you have two complete dies inside the processor, so if you run into this very specific fringe scenario you've mentioned, you can disable one of the CCDs and by all intents have a fully functional 5800X in there. The latency impact is more than likely one of the things AMD hopes to mitigate with the 3D cache, and one of the reasons I believe that it will work. I wonder how would chiplet technology end up affecting a GPU (for graphics rendering purposes), I really do. Aldebaran's more of a compute processor than anything.
 
Joined
Oct 12, 2005
Messages
290 (0.05/day)
I mean, with the 5950X, you have two complete dies inside the processor, so if you run into this very specific fringe scenario you've mentioned, you can disable one of the CCDs and by all intents have a fully functional 5800X in there. The latency impact is more than likely one of the things AMD hopes to mitigate with the 3D cache, and one of the reasons I believe that it will work. I wonder how would chiplet technology end up affecting a GPU (for graphics rendering purposes), I really do. Aldebaran's more of a compute processor than anything.
A larger Cache won't help to reduce latency as a data that got processed by another CCD will remain there, they will just be able to hold more.

CCD to CCD isn't much faster than memory access so it won't really help there. What AMD could do with a larger interposer is to add Infinity fabrics link between CCD. This should cut the CCD to CCD latency by half at least.

As for GPU, again it will depend of the kind of code it will run. if it has to do a lot of sync, that will not be beneficial to have 2 CPU on the same die instead of 1 big. If all data is very contain and have a very high level of parallelism, it won't matter much. (like Zen 2 still do very great on video encoding, 3d rendering etc.)
 
Joined
Feb 1, 2019
Messages
328 (0.32/day)
Location
UK, Leicester
System Name Main PC
Processor 9900k@4.8ghz
Motherboard Asrock Fatality K6 Z370
Cooling Noctua NH-D15S
Memory 32 Gig 3200CL14
Video Card(s) 3080 RTX FE
Storage 970 EVO 1TB
Display(s) LG 27GL850
Case Fractal Define R4
Audio Device(s) Asus Xonar D2X
Power Supply Antec HCG 750
Pretty pointless application? You can get a PS3 off Ebay for about £50 and run anything on the native hardware, saving yourself the huge upgrade cost to Alder Lake for this purpose!

Post processing AA
Save states
Massively less cable clutter, and room required to house console.
Ability to memory hack games.

Definite advantage,
 
Joined
Nov 18, 2010
Messages
5,630 (1.40/day)
Location
Rīga, Latvia
System Name HELLSTAR
Processor AMD RYZEN 5950X
Motherboard ASUS Strix X570-E
Cooling Custom Loop. Two 360ies + 280 rad. 8x Nidec Servo Gentle Typhoons. EK-Quantum Momentum monoblock.
Memory 4x8GB Corsair Vengeance LPX 3000MHz 15-15-15-36 CR1[16-18-18-32-50;TRFC560@3200MHz]]
Video Card(s) ASUS 1080 Ti FE + water block
Storage Optane 900P + Samsung PM981 NVMe 1TB + 750 EVO 500GB
Display(s) Philips PHL BDM3270 + Acer XV242Y
Case Phanteks Enthoo Evolv ATX Tempered Glass
Audio Device(s) Sound Blaster ZxR
Power Supply Fractal Design Newton R3 1000W
Mouse Razer Basilisk
Keyboard Razer BlackWidow V3 - Yellow Switch
Software Windows 11 insider
More cases like this will not appear.

Where else you would need to mimic instruction sets of a super complex CELL CPU? Also a lot of contributes the raw over 5GHz single core boost. Not only the AVX512. The added performance number corelates more with the added frequency gap.

Actually the emulator is usable, I have played Metal Gear 4 on it. Occasional freezing is more an issue than the lack of CPU power. It is 30FPS limited ingame either way, so what's the fuss?

The LLVM needs a lot of job... and it still has poor multithreading, they experiment a lot in certain way, but it lacks the desired result often.
 
Top