• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

Intel "Cannon Lake" Could Bring AVX-512 Instruction-set to the Mainstream

btarunr

Editor & Senior Moderator
Staff member
Joined
Oct 9, 2007
Messages
46,355 (7.68/day)
Location
Hyderabad, India
System Name RBMK-1000
Processor AMD Ryzen 7 5700G
Motherboard ASUS ROG Strix B450-E Gaming
Cooling DeepCool Gammax L240 V2
Memory 2x 8GB G.Skill Sniper X
Video Card(s) Palit GeForce RTX 2080 SUPER GameRock
Storage Western Digital Black NVMe 512GB
Display(s) BenQ 1440p 60 Hz 27-inch
Case Corsair Carbide 100R
Audio Device(s) ASUS SupremeFX S1220A
Power Supply Cooler Master MWE Gold 650W
Mouse ASUS ROG Strix Impact
Keyboard Gamdias Hermes E2
Software Windows 11 Pro
Intel's next-generation "Cannon Lake" CPU micro-architecture could see the introduction of the AVX-512 instruction-set to the mainstream segments (MSDT or mainstream-desktop, and mobile). It is currently available on the company's Core X "Skylake-X" HEDT processors, and on the company's Xeon "Skylake-W," Xeon Scalable "Skylake-SP," and in a limited form on the Xeon Phi Knights Landing and Knights Mill scalar compute chips.

The upcoming "Cannon Lake" mainstream silicon will feature AVX512F, AVX512CD, AVX512DQ, AVX512BW, and AVX512VL instructions, and will support AVX512_IFMA and AVX512_VBMI commands, making it a slightly broader implementation of AVX-512 than the "Skylake-SP" silicon. The new AVX-512 will vastly improve performance of compute-intensive applications that take advantage of it. It will also be a key component of future security standards.



View at TechPowerUp Main Site
 

2901BitSlice

New Member
Joined
Sep 7, 2017
Messages
11 (0.00/day)
The more salient question is WHEN will Cannon Lake bring us the AVX512 Instruction Set ?
 
Joined
Jun 12, 2017
Messages
136 (0.05/day)
From an architectural perspective, that makes pretty much sense. A skylake(client) core has two 256-bits FMA units (on port0 and port1), which can be fused into one 512-bits FMA unit, as they have already done with Skylake server variant (one of two 512-bits FMAs is fused from 2*256 in original client core variant, and another 512 is implemented by attaching an additional AVX section to the core).

But the problem is, if they have already implemented this on Skylake-SP, why not coffee lake or even kaby lake? And why did they change their mind with Cannonlake? From a timeline perspective, doubtful.
 
Joined
Sep 26, 2012
Messages
860 (0.20/day)
Location
Australia
System Name ATHENA
Processor AMD 7950X
Motherboard ASUS Crosshair X670E Extreme
Cooling Noctua NH-D15S, 7 x Noctua NF-A14 industrialPPC IP67 2000RPM
Memory 2x32GB Trident Z RGB 6000Mhz CL30
Video Card(s) ASUS 4090 Strix
Storage 3 x Kingston Fury 4TB, 4 x Samsung 870 QVO
Display(s) Alienware AW3821DW, Wacom Cintiq Pro 15
Case Fractal Design Torrent
Audio Device(s) Topping A90/D90 MQA, Fluid FPX7 Fader Pro, Beyerdynamic T1 G2, Beyerdynamic MMX300
Power Supply ASUS THOR 1600T
Mouse Xtrfy MZ1 - Zy' Rail, Logitech MX Vertical, Logitech MX Master 3
Keyboard Logitech G915 TKL
VR HMD Oculus Quest 2
Software Windows 11 + OpenSUSE MicroOS
AVX512 - where either the vector unit runs, or your CPU runs (as thermally, the vector unit throttles the shit out of the CPU).

Intel needs to solve that before I'll get excited about AVX512 (as lets be honest, its only generally useful for the 1% of stuff I can't send to the GPU in the first place).
 
Joined
Sep 7, 2017
Messages
3,244 (1.34/day)
System Name Grunt
Processor Ryzen 5800x
Motherboard Gigabyte x570 Gaming X
Cooling Noctua NH-U12A
Memory Corsair LPX 3600 4x8GB
Video Card(s) Gigabyte 6800 XT (reference)
Storage Samsung 980 Pro 2TB
Display(s) Samsung CFG70, Samsung NU8000 TV
Case Corsair C70
Power Supply Corsair HX750
Software Win 10 Pro
Joined
Aug 6, 2017
Messages
7,412 (3.03/day)
Location
Poland
System Name Purple rain
Processor 10.5 thousand 4.2G 1.1v
Motherboard Zee 490 Aorus Elite
Cooling Noctua D15S
Memory 16GB 4133 CL16-16-16-31 Viper Steel
Video Card(s) RTX 2070 Super Gaming X Trio
Storage SU900 128,8200Pro 1TB,850 Pro 512+256+256,860 Evo 500,XPG950 480, Skyhawk 2TB
Display(s) Acer XB241YU+Dell S2716DG
Case P600S Silent w. Alpenfohn wing boost 3 ARGBT+ fans
Audio Device(s) K612 Pro w. FiiO E10k DAC,W830BT wireless
Power Supply Superflower Leadex Gold 850W
Mouse G903 lightspeed+powerplay,G403 wireless + Steelseries DeX + Roccat rest
Keyboard HyperX Alloy SilverSpeed (w.HyperX wrist rest),Razer Deathstalker
Software Windows 10
Benchmark Scores A LOT
I've been hearing about avx since haswell, could someone explain it to a simpleton ?
 
Joined
Feb 3, 2017
Messages
3,481 (1.32/day)
Processor R5 5600X
Motherboard ASUS ROG STRIX B550-I GAMING
Cooling Alpenföhn Black Ridge
Memory 2*16GB DDR4-2666 VLP @3800
Video Card(s) EVGA Geforce RTX 3080 XC3
Storage 1TB Samsung 970 Pro, 2TB Intel 660p
Display(s) ASUS PG279Q, Eizo EV2736W
Case Dan Cases A4-SFX
Power Supply Corsair SF600
Mouse Corsair Ironclaw Wireless RGB
Keyboard Corsair K60
VR HMD HTC Vive
I've been hearing about avx since haswell, could someone explain it to a simpleton ?
Additional set of instructions/operations that processor can perform, also with larger numbers. There are additional bonuses, like being way more efficient when doing the exact same operation on a number of operands.
While the main x86 operations work with 64-bit numbers, SSE does 128-bit, AVX does 256-bit and AVS-512 does 512-bit numbers. This increases both range for integer calculations and precision for floating point calculations.
Usefulness of these extensions relies heavily on both compilers and software being aware of these and using these. On the other hand, these operations are useful for some types of software (productivity software usually, lots of calculations, especially requiring high precision) and less useful for others.

SSE is from Pentium 3 era.
AVX has been there since Sandy Bridge (and Bulldozer).
AVX2 since Haswell (and Excavator).

Wiki is actually pretty good on the topic:
https://en.wikipedia.org/wiki/Advanced_Vector_Extensions
https://en.wikipedia.org/wiki/Streaming_SIMD_Extensions
 
Joined
Aug 6, 2017
Messages
7,412 (3.03/day)
Location
Poland
System Name Purple rain
Processor 10.5 thousand 4.2G 1.1v
Motherboard Zee 490 Aorus Elite
Cooling Noctua D15S
Memory 16GB 4133 CL16-16-16-31 Viper Steel
Video Card(s) RTX 2070 Super Gaming X Trio
Storage SU900 128,8200Pro 1TB,850 Pro 512+256+256,860 Evo 500,XPG950 480, Skyhawk 2TB
Display(s) Acer XB241YU+Dell S2716DG
Case P600S Silent w. Alpenfohn wing boost 3 ARGBT+ fans
Audio Device(s) K612 Pro w. FiiO E10k DAC,W830BT wireless
Power Supply Superflower Leadex Gold 850W
Mouse G903 lightspeed+powerplay,G403 wireless + Steelseries DeX + Roccat rest
Keyboard HyperX Alloy SilverSpeed (w.HyperX wrist rest),Razer Deathstalker
Software Windows 10
Benchmark Scores A LOT
what software can benefit from avx ? rendering ? games ? or just number crunching ?
 
Joined
Feb 3, 2017
Messages
3,481 (1.32/day)
Processor R5 5600X
Motherboard ASUS ROG STRIX B550-I GAMING
Cooling Alpenföhn Black Ridge
Memory 2*16GB DDR4-2666 VLP @3800
Video Card(s) EVGA Geforce RTX 3080 XC3
Storage 1TB Samsung 970 Pro, 2TB Intel 660p
Display(s) ASUS PG279Q, Eizo EV2736W
Case Dan Cases A4-SFX
Power Supply Corsair SF600
Mouse Corsair Ironclaw Wireless RGB
Keyboard Corsair K60
VR HMD HTC Vive
what software can benefit from avx ? rendering ? games ? or just number crunching ?
Good candidate is anything that does parallelization or vectorization, basically running same operations on a lot of data. Image or video processing (Adobe stuff), encoding/decoding (ffmpeg), compression (7zip, WinRAR) plus obviously anything that does a lot of math (Excel).
 
Joined
Apr 12, 2013
Messages
6,743 (1.68/day)
So, in a nutshell, don't buy Kaby Lake now, wait for Cannon Lake next year...
It's supposed to be on track for release later this year, proper paper launch it ought to be. Though CNL is not going to be in Intel's full product stack i.e. desktop, server et al, only a handful of low power chips.
 
Joined
Feb 3, 2017
Messages
3,481 (1.32/day)
Processor R5 5600X
Motherboard ASUS ROG STRIX B550-I GAMING
Cooling Alpenföhn Black Ridge
Memory 2*16GB DDR4-2666 VLP @3800
Video Card(s) EVGA Geforce RTX 3080 XC3
Storage 1TB Samsung 970 Pro, 2TB Intel 660p
Display(s) ASUS PG279Q, Eizo EV2736W
Case Dan Cases A4-SFX
Power Supply Corsair SF600
Mouse Corsair Ironclaw Wireless RGB
Keyboard Corsair K60
VR HMD HTC Vive
What makes it a paper launch?
If I look at local shops, I could go and buy i7 8700 or i5 8400 right now for what seem to be MSRP prices. i7 8700K is not in stock but that is not very surprising.
 
Joined
Oct 2, 2004
Messages
13,791 (1.93/day)
So, it's like GeForce GTX 750 and Radeon R9 285. A forgettable chip with all the latest tech not seen even in their most expensive top of the line products. Well, that's garbage then. Don't buy either and wait 2 more years then for it to get into the "mainstream". I see it as important enough to be worthy waiting unless you have a really prehistoric system that needs replacing like NOW.
 

bug

Joined
May 22, 2015
Messages
13,210 (4.06/day)
Processor Intel i5-12600k
Motherboard Asus H670 TUF
Cooling Arctic Freezer 34
Memory 2x16GB DDR4 3600 G.Skill Ripjaws V
Video Card(s) EVGA GTX 1060 SC
Storage 500GB Samsung 970 EVO, 500GB Samsung 850 EVO, 1TB Crucial MX300 and 2TB Crucial MX500
Display(s) Dell U3219Q + HP ZR24w
Case Raijintek Thetis
Audio Device(s) Audioquest Dragonfly Red :D
Power Supply Seasonic 620W M12
Mouse Logitech G502 Proteus Core
Keyboard G.Skill KM780R
Software Arch Linux + Win10
what software can benefit from avx ? rendering ? games ? or just number crunching ?
It's primarily about number crunching. But if you use that number crunching to improve AI (for example), this can also mean more challenging/fun games.
 
Joined
Dec 28, 2012
Messages
3,475 (0.84/day)
System Name Skunkworks
Processor 5800x3d
Motherboard x570 unify
Cooling Noctua NH-U12A
Memory 32GB 3600 mhz
Video Card(s) asrock 6800xt challenger D
Storage Sabarent rocket 4.0 2TB, MX 500 2TB
Display(s) Asus 1440p144 27"
Case Old arse cooler master 932
Power Supply Corsair 1200w platinum
Mouse *squeak*
Keyboard Some old office thing
Software openSUSE tumbleweed/Mint 21.2
Good candidate is anything that does parallelization or vectorization, basically running same operations on a lot of data. Image or video processing (Adobe stuff), encoding/decoding (ffmpeg), compression (7zip, WinRAR) plus obviously anything that does a lot of math (Excel).
Also going to add- emulation. Emulating more complex game consoles depends on high FPU calculations. PCSX2 got a nice boost from AVX.

Things like AVX-512 will probably be a near requirement for x360/PS3 emulation, if we ever get there.
 
Joined
Jun 10, 2014
Messages
2,900 (0.81/day)
Processor AMD Ryzen 9 5900X ||| Intel Core i7-3930K
Motherboard ASUS ProArt B550-CREATOR ||| Asus P9X79 WS
Cooling Noctua NH-U14S ||| Be Quiet Pure Rock
Memory Crucial 2 x 16 GB 3200 MHz ||| Corsair 8 x 8 GB 1333 MHz
Video Card(s) MSI GTX 1060 3GB ||| MSI GTX 680 4GB
Storage Samsung 970 PRO 512 GB + 1 TB ||| Intel 545s 512 GB + 256 GB
Display(s) Asus ROG Swift PG278QR 27" ||| Eizo EV2416W 24"
Case Fractal Design Define 7 XL x 2
Audio Device(s) Cambridge Audio DacMagic Plus
Power Supply Seasonic Focus PX-850 x 2
Mouse Razer Abyssus
Keyboard CM Storm QuickFire XT
Software Ubuntu
AVX512 - where either the vector unit runs, or your CPU runs (as thermally, the vector unit throttles the shit out of the CPU).
Sure, the CPU clocks down when running AVX instructions, but the efficiency gains of AVX is so massive it will still outperform pure ALU/FPU operations by a large factor.

Intel needs to solve that before I'll get excited about AVX512 (as lets be honest, its only generally useful for the 1% of stuff I can't send to the GPU in the first place).
AVX-512 is massive, it can't reach high clocks.
The real problem is it will take years before consumer software will utilize it.
As with other CPU instructions the software has to be compiled to use this feature. In some cases compilers can automatically vectorize certain structures (requires compiler flags), but usually the programmer has to apply specific intrinsics, which basically are macros mapping almost directly to assembly.

I've been hearing about avx since haswell, could someone explain it to a simpleton ?
A vector unit is able to process multiple pieces of data at once, e.g. a AVX-512 unit can process up to 1x512-bit, 2x256-bit, 4x128-bit, 8x64-bit, 16x32-bit, etc. operations per clock. Each CPU core may contain multiple AVX and FMA units on different execution ports, some only do e.g. multiplication.

So, in a nutshell, don't buy Kaby Lake now, wait for Cannon Lake next year...
If you need AVX-512…
Also, Ice Lake will be the next desktop archtecture.
 
Last edited:
Joined
Oct 2, 2015
Messages
2,991 (0.96/day)
Location
Argentina
System Name Ciel
Processor AMD Ryzen R5 5600X
Motherboard Asus Tuf Gaming B550 Plus
Cooling ID-Cooling 224-XT Basic
Memory 2x 16GB Kingston Fury 3600MHz@3933MHz
Video Card(s) Gainward Ghost 3060 Ti 8GB + Sapphire Pulse RX 6600 8GB
Storage NVMe Kingston KC3000 2TB + NVMe Toshiba KBG40ZNT256G + HDD WD 4TB
Display(s) AOC Q27G3XMN + Samsung S22F350
Case Cougar MX410 Mesh-G
Audio Device(s) Kingston HyperX Cloud Stinger Core 7.1 Wireless PC
Power Supply Aerocool KCAS-500W
Mouse EVGA X15
Keyboard VSG Alnilam
Software Windows 11
I bet Celeron and Pentium won't get it as always.
 
Joined
Aug 13, 2009
Messages
3,188 (0.59/day)
Location
Czech republic
Processor Ryzen 5800X
Motherboard Asus TUF-Gaming B550-Plus
Cooling Noctua NH-U14S
Memory 32GB G.Skill Trident Z Neo F4-3600C16D-32GTZNC
Video Card(s) Sapphire Radeon Rx 580 Nitro+ 8GB
Storage HP EX950 512GB + Samsung 970 PRO 1TB
Display(s) HP Z Display Z24i G2
Case Fractal Design Define R6 Black
Audio Device(s) Creative Sound Blaster AE-5
Power Supply Seasonic PRIME Ultra 650W Gold
Mouse Roccat Kone AIMO Remastered
Software Windows 10 x64
Did someone say Intel will do a paper launch of the next generation of CPUs this year? Coffee Lake has JUST been released.
 

2901BitSlice

New Member
Joined
Sep 7, 2017
Messages
11 (0.00/day)
Here is a screen shot of a leaked table of potential Intel CPUs. This came out of China and there are spelling errors. 'cores/treads' the H got lost.

upload_2017-10-20_14-39-9.png
 
Joined
Sep 15, 2011
Messages
6,465 (1.41/day)
Processor Intel® Core™ i7-13700K
Motherboard Gigabyte Z790 Aorus Elite AX
Cooling Noctua NH-D15
Memory 32GB(2x16) DDR5@6600MHz G-Skill Trident Z5
Video Card(s) ZOTAC GAMING GeForce RTX 3080 AMP Holo
Storage 2TB SK Platinum P41 SSD + 4TB SanDisk Ultra SSD + 500GB Samsung 840 EVO SSD
Display(s) Acer Predator X34 3440x1440@100Hz G-Sync
Case NZXT PHANTOM410-BK
Audio Device(s) Creative X-Fi Titanium PCIe
Power Supply Corsair 850W
Mouse Logitech Hero G502 SE
Software Windows 11 Pro - 64bit
Benchmark Scores 30FPS in NFS:Rivals
So, in a nutshell, don't buy Kaby Lake now, wait for Cannon Lake next year...
That's what I'm keep saying. Especially that we'll get 8 Cores and another new Mobo...
 
Joined
Dec 31, 2009
Messages
19,366 (3.71/day)
Benchmark Scores Faster than yours... I'd bet on it. :)
Theres aways something better around the corner. Take this advice and nobody will ever buy.
 
Top