Friday, September 15th 2023

Apple A17 Pro SoC Within Reach of Intel i9-13900K in Single-Core Performance

An Apple "iPhone16,1" was put through the Geekbench 6.2 gauntlet earlier this week—according to database info this pre-retail sample was running a build of iOS 17.0 (currently in preview) and its logic board goes under the "D83AP" moniker. It is interesting to see a unit hitting the test phase only a day after the unveiling of Apple's iPhone 15 Pro and Max models—the freshly benched candidate seems to house an A17 Pro system-on-chip. The American tech giant has set lofty goals for said flagship SoC, since it is "the industry's first 3-nanometer chip. Continuing Apple's leadership in smartphone silicon, A17 Pro brings improvements to the entire chip, including the biggest GPU redesign in Apple's history. The new CPU is up to 10 percent faster with microarchitectural and design improvements, and the Neural Engine is now up to 2x faster."

Tech news sites have pored over the leaked unit's Geekbench 6.2 scores—its A17 Pro chipset (TSMC N3) surpasses the previous generation A16 Bionic (TSMC N4) by 10% in single-core stakes. Apple revealed this performance uplift during this week's iPhone "Wonderlust" event, so the result is not at all surprising. The multi-score improvement is a mere ~3%, suggesting that only minor tweaks have been made to the underlying microarchitecture. The A17 Pro beats Qualcomm's Snapdragon 8 Gen 2 in both categories—2914 vs. 2050 (SC) and 7199 vs. 5405 (MC) respectively. Spring time leaks indicated that the "A17 Bionic" was able to keep up with high-end Intel and AMD desktop CPUs in terms of single-core performance—the latest Geekbench 6.2 entry semi-confirms those claims. The A17 Pro's single-threaded performance is within 10% of Intel Core i9-13900K and Ryzen 9 7950X processors. Naturally, Apple's plucky mobile chip cannot put up a fight in the multi-core arena, additionally Tom's Hardware notes another catch: "A17 Pro operates at 3.75 GHz, according to the benchmark, whereas its mighty competitors work at about 5.80 GHz and 6.0 GHz, respectively."
Sources: Tom's Hardware, Geekbench, Techspot, Wccftech
Add your own comment

46 Comments on Apple A17 Pro SoC Within Reach of Intel i9-13900K in Single-Core Performance

#26
progste
All this discussion about the advantages of soldered RAM makes me wonder how desktop CPUs will evolve in the future.
We're already in a time where basically 2 sizes of RAM makes sense for DDR5 (32 or 64 GB), at least for consumer desktops.
Now imagine if intel and AMD moved to an "embedded" CPU design where you buy the whole CPU+RAM block to be slotted into the motherboard, kinda like GPUs are made where you have a reference design and board partners make their own version with the chips sourced by AMD/intel.
The RAM slots could be replaced by SSDs, secondary expansion memory or other similar stuff.
Posted on Reply
#27
Ferrum Master
silentbogoThe only question is why... I don't really give a crap about computational photography and AI/ML in my pocket. Running compute-intensive apps on the phone as-is is also stupid(from usability perspective). Having full-blown AAA games on the go, running natively, isn't that impressive these days either. All it needs is a proper dock with a proper desktop mode, at minimum like DEX. Then we'll talk.
Their camera is basically full of it... also the denoise engine relies on it. They have build their all API around it. Imagine an instagram wh*r5 that actually is a large user base for those flagship iphones, and allow them to be worse than the Android competition? They have to hold the bar high and make is as dumb proof as they can. AI is the only method to evolve phone camera quality further. Nothing else can be done due to basic physics. The size is just too small.

Same applies to the workstation CPU's they have. All productivity related software incorporate some sort of computing these days... let it be AI denoise etc... they don't have AMD or NVIDA cards to have compute cores now... they predicted it right to incorporate into the CPU clusters and actually the Neural Engines ain't that bad.

Imagine that backlash that your new shiny macs could do properly Adobe tasks that use compute in timid fashion? It would be a disaster.
Posted on Reply
#28
Dave65
WirkoYeah, Intel really should try that.
:laugh::toast:

Well said.
Posted on Reply
#30
trparky
What people are failing to understand here is that if the mobile chip, the A17 Pro, can achieve these kinds of results, imagine what the performance numbers would be if Apple took the A17 Pro chip and took the gloves off by making it a future M3 chip. I have a feeling that the M3 chip would wipe the floor with a 13900K.

I'm by no means an Apple fanatic, I will always buy a PC. However, you have to hand it to their engineers... they're world class.
Posted on Reply
#31
Jism
progsteAll this discussion about the advantages of soldered RAM makes me wonder how desktop CPUs will evolve in the future.
We're already in a time where basically 2 sizes of RAM makes sense for DDR5 (32 or 64 GB), at least for consumer desktops.
Now imagine if intel and AMD moved to an "embedded" CPU design where you buy the whole CPU+RAM block to be slotted into the motherboard, kinda like GPUs are made where you have a reference design and board partners make their own version with the chips sourced by AMD/intel.
The RAM slots could be replaced by SSDs, secondary expansion memory or other similar stuff.
Because of this, hardware would be half more expensive then it already is.

You forget that apple charge a certain tax - and if you buy a phone, it's usually a 100$ more for just 64GB / 128GB of storage extra.

There's equipment already having memory on the same chip, such as the PS3 GPU.

Apple makes CPU's quite efficient - and to be honest, i really was suprised after having to replace the Macbook 13 inch battery - to find a board the size of a shoe:



That is packing everything - from CPU to Memory and Graphics. And fits in between the screen and battery.

It was a tough job to actually accomplish since it was my first mac's battery replacement. But geezus the super tiny screws at the size of avg flees n stuff.

Apple has some very skilled engineers.
Posted on Reply
#32
Denver
It's very interesting, just before Apple launches a new Soc, this benchmark receives an update that boosts the numbers a little compared to the old one.

Anyway, I would say that this software is at least inadequate... :rolleyes:
Posted on Reply
#33
user556
AnotherReaderIn that case, the A17 would win by a landslide in single threaded performance. Still, the clock speed is a part of the design and equalizing clocks doesn't make sense for two very different designs.
Comparing power consumption would make a lot of sense.
Posted on Reply
#34
mohammed2006
waw 3nn chip is more efficient then 10nn who could have known
Posted on Reply
#35
RandallFlagg
Nuckles56The big thing that everyone seems to be forgetting is the additional width in the decode block the A17 likely has for throughput vs Intel and AMD. I remember when Anandtech did the deep dive on the A14 in 2020, it had an eight wide decode block vs intel and AMD having 4 max. Even if the x86 chips can't change, I fully expect that apple has made their CPU design have a 12 or 16 wide decode block now to keep going with their big and wide over max frequency strategy.
trparkyWhat people are failing to understand here is that if the mobile chip, the A17 Pro, can achieve these kinds of results, imagine what the performance numbers would be if Apple took the A17 Pro chip and took the gloves off by making it a future M3 chip. I have a feeling that the M3 chip would wipe the floor with a 13900K.

I'm by no means an Apple fanatic, I will always buy a PC. However, you have to hand it to their engineers... they're world class.
Nuckles answered that question.

Broadly speaking, chip pipelines are either wide and shallow or narrow and deep. Within the same transistor budget, a wider decode with shorter pipelines means lower clocks while a narrower decode with longer pipelines means higher clocks.

Both AMD and Intel and x86 in general use narrow and deep.

AMD and Intel design for higher clocks, they are 4 (Zen) and 1+3 (Intel) wide decode pipelines. Apple was at 7 wide with the A11/A12 and 8 wide back with the A14, not sure where they are now. Samsung was at 6 a couple of years ago and generic ARM Cortex was 6.

This is why clock-for-clock comparisons are meaningless. These are high level design choices the engineers made for their use cases. Apple is designing for lower clocks and higher efficiency, AMD and Intel are mostly going for raw performance hence higher clocks.
Posted on Reply
#36
zlobby
Yeah, another year, another ARM vs x86-64 BS from apple!
Posted on Reply
#37
Vayra86
Minus InfinityPretty pathetic that tech sites still refer to the utterly useless Geekbench which has virtually zero relevance to real world applications.
But here Apple wins! So this is what matters, we need to keep the Apple does it all better fairy tale alive right
Posted on Reply
#38
qcmadness
Apple paid double transistor count per "performance" core. Are you really willing to pay double price for that?
Posted on Reply
#39
Luminescent
OMG, i can finally ditch my middle tower PC and edit/work in Adobe on the apple A17 BIONIC, at even has RAY-TRACING, oh wait, i cannot, it's all benchmarks and numbers, i still need that ugly, big and huge middle tower.
Posted on Reply
#40
FoulOnWhite
LuminescentOMG, i can finally ditch my middle tower PC and edit/work in Adobe on the apple A17 BIONIC, at even has RAY-TRACING, oh wait, i cannot, it's all benchmarks and numbers, i still need that ugly, big and huge middle tower.
The phone needs the power. imagine all the time you save when the phone or msg apps open 0.34 seconds quicker than the previous model.
Posted on Reply
#41
zlobby
FoulOnWhiteThe phone needs the power. imagine all the time you save when the phone or msg apps open 0.34 seconds quicker than the previous model.
0.34s is actually noticeable. But I agree with the general statement.
Posted on Reply
#42
FoulOnWhite
zlobby0.34s is actually noticeable. But I agree with the general statement.
i made up 0.34 i was being sarcastic sorry
Posted on Reply
#43
claes
LuminescentOMG, i can finally ditch my middle tower PC and edit/work in Adobe on the apple A17 BIONIC, at even has RAY-TRACING, oh wait, i cannot, it's all benchmarks and numbers, i still need that ugly, big and huge middle tower.
I mean, if adobe is your workload, you can buy an iPad or a Studio and get better performance
Posted on Reply
#44
Aquinus
Resident Wat-man
claesI mean, if adobe is your workload, you can buy an iPad or a Studio and get better performance
Adobe's acceleration on most Macs is pretty good. The 5600M in my MBP accelerates workloads in Photoshop and Lightroom pretty well to be completely honest.
Posted on Reply
#45
Scrizz
JismApple makes CPU's quite efficient - and to be honest, i really was suprised after having to replace the Macbook 13 inch battery - to find a board the size of a shoe:

That is packing everything - from CPU to Memory and Graphics. And fits in between the screen and battery.

It was a tough job to actually accomplish since it was my first mac's battery replacement. But geezus the super tiny screws at the size of avg flees n stuff.

Apple has some very skilled engineers.
I guess you need to take apart more ultrabooks :laugh: small boards like that are pretty common.
Also, since we're talking ARM... your cellphone also packs the CPU, memory and graphics in a much smaller board. ;) :D
Posted on Reply
#46
varase
Nuckles56The big thing that everyone seems to be forgetting is the additional width in the decode block the A17 likely has for throughput vs Intel and AMD. I remember when Anandtech did the deep dive on the A14 in 2020, it had an eight wide decode block vs intel and AMD having 4 max. Even if the x86 chips can't change, I fully expect that apple has made their CPU design have a 12 or 16 wide decode block now to keep going with their big and wide over max frequency strategy.
CPU width has diminishing returns - I'll bet the Apple Silicon team did a shot whenever they got the out of order execution unit to execute eight instructions simultaneously.

Maybe they've added more arithmetic units so the pipeline is more symmetrical, but I can't see how going much wider will trigger that much of performance improvement.
Posted on Reply
Add your own comment
Jun 4th, 2024 09:20 EDT change timezone

New Forum Posts

Popular Reviews

Controversial News Posts