AMD Confirms CDNA-Based Radeon Instinct MI100 Coming to HPC Workloads in 2H2020

Raevenlord · Jun 17, 2020

Mark Papermaster, chief technology officer and executive vice president of Technology and Engineering at AMD, today confirmed that CDNA is on-track for release in 2H2020 for HPC computing. The confirmation was (adequately) given during Dell's EMC High-Performance Computing Online event. This confirms that AMD is looking at a busy 2nd half of the year, with both Zen 3, RDNA 2 and CDNA product lines being pushed to market.

CDNA is AMD's next push into the highly-lucrative HPC market, and will see the company differentiating their GPU architectures through market-based product differentiation. CDNA will see raster graphics hardware, display and multimedia engines, and other associated components being removed from the chip design in a bid to recoup die area for both increased processing units as well as fixed-function tensor compute hardware. CNDA-based Radeon Instinct MI100 will be fabricated under TSMC's 7 nm node, and will be the first AMD architecture featuring shared memory pools between CPUs and GPUs via the 2nd gen Infinity Fabric, which should bring about both throughput and power consumption improvements to the platform.

View at TechPowerUp Main Site

cucker tarlson · Jun 17, 2020

isn't cdna gcn based ?

Fouquin · Jun 17, 2020

cucker tarlson said:
isn't cdna gcn based ?

Should be. Distilled down and rebuilt to amplify FP compute.

Aldain · Jun 17, 2020

cucker tarlson said:
isn't cdna gcn based ?

no

Deleted member 50521 · Jun 17, 2020

I pity whoever is gonna write software for these. OpenCL? Vulkan Compute?

ARF · Jun 17, 2020

xkm1948 said:
I pity whoever is gonna write software for these. OpenCL? Vulkan Compute?

If Arcturus MI100 turns out to be a beast, I guess developers will fight between each other who to code for it...

Specs please ?

Deleted member 50521 · Jun 17, 2020

ARF said:
If Arcturus MI100 turns out to be a beast, I guess developers will fight between each other who to code for it...

Specs please ?

Doesn’t work like that. You need a full ecosystem of hw and sw for these gpu accelerated computing. Very few software developers and end users will use it, if it requires deep investment into close to metal level programming. CUDA is successful because Nvidia takes huge effort in polishing the low level software foundation, making it effortless for developers to work on without being crippled by weird driver bugs

OpenCL is pretty broken so far with ROCm. Not sure about Vulkan compute.

Hopefully they find some good use for these GPUs

Aquinus · Jun 17, 2020

xkm1948 said:
CUDA is successful because Nvidia takes huge effort in polishing the low level software foundation, making it effortless for developers to work on without being crippled by weird driver bugs

Would you say that's mostly due to poor documentation on AMD's part?

Deleted member 50521 · Jun 18, 2020

Aquinus said:
Would you say that's mostly due to poor documentation on AMD's part?

Generally lack of investment on software side

ValenOne · Jun 18, 2020

cucker tarlson said:
isn't cdna gcn based ?

GCN has inferior branch and instruction retirement latency performance when compared to RDNA.

Cheeseball · Jun 18, 2020

Aquinus said:
Would you say that's mostly due to poor documentation on AMD's part?

It's more of lack of investment, but I believe this because they're a smaller company. For example, while AMD does send sales engineers over to promote their products (which we use in the PDL and HCII), we don't get as much support from them compared to NVIDIA, who does send channel reps (basically NVIDIA's developers) to assist with some projects.

AMD needs to invest more time and money into supporting ROCm (and OpenCL).

cucker tarlson · Jun 18, 2020

rvalencia said:
GCN has inferior branch and instruction retirement latency performance when compared to RDNA.

but better compute numbers

1d10t · Jun 18, 2020

IMO in workstation environtment, Radeon Pro can hold on its own. Same as dekstop counterpart, while not giving highest performance but its offer best bang for bucks. Theres also a whole lot community support out there giving patches or just workaround. AMD should give major effort more than just a framework and community support for cDNA to really take off.

ValenOne · Jun 23, 2020

cucker tarlson said:
but better compute numbers

Per CU count, FLOPS is the same. On software raytracing like Crytek's raytracing demo via compute, NAVI 10 beats VII

Both RDNA and GCN executes wave64 compute.

Read

https://www.reddit.com/r/Amd/comments/ctfbem

Figure 3 (bottom of page 5) shows 4 lines of shader instructions being executed in GCN, vs RDNA in Wave32 or “backwards compatible” Wave64.
Vega takes 12 cycles to complete the instruction on a GCN SIMD. Navi in Wave32 (optimized code) completes it in 7 cycles.
In backwards compatible (optimized for GCN Wave64) mode, Navi completes it in 8 cycles.
So even on code optimized for GCN, Navi is faster., but more performance can be extracted by optimizing for Navi.
Lower latency, and no wasted clock cycles.

GCN such as "Vega 20" supports 64bit FP.

RDNA still executes GCN instruction set with less latency.

System Name	The Ryzening
Processor	AMD Ryzen 9 5900X
Motherboard	MSI X570 MAG TOMAHAWK
Cooling	Lian Li Galahad 360mm AIO
Memory	32 GB G.Skill Trident Z F4-3733 (4x 8 GB)
Video Card(s)	Gigabyte RTX 3070 Ti
Storage	Boot: Transcend MTE220S 2TB, Kintson A2000 1TB, Seagate Firewolf Pro 14 TB
Display(s)	Acer Nitro VG270UP (1440p 144 Hz IPS)
Case	Lian Li O11DX Dynamic White
Audio Device(s)	iFi Audio Zen DAC
Power Supply	Seasonic Focus+ 750 W
Mouse	Cooler Master Masterkeys Lite L
Keyboard	Cooler Master Masterkeys Lite L
Software	Windows 10 x64

System Name	Purple rain
Processor	10.5 thousand 4.2G 1.1v
Motherboard	Zee 490 Aorus Elite
Cooling	Noctua D15S
Memory	16GB 4133 CL16-16-16-31 Viper Steel
Video Card(s)	RTX 2070 Super Gaming X Trio
Storage	SU900 128,8200Pro 1TB,850 Pro 512+256+256,860 Evo 500,XPG950 480, Skyhawk 2TB
Display(s)	Acer XB241YU+Dell S2716DG
Case	P600S Silent w. Alpenfohn wing boost 3 ARGBT+ fans
Audio Device(s)	K612 Pro w. FiiO E10k DAC,W830BT wireless
Power Supply	Superflower Leadex Gold 850W
Mouse	G903 lightspeed+powerplay,G403 wireless + Steelseries DeX + Roccat rest
Keyboard	HyperX Alloy SilverSpeed (w.HyperX wrist rest),Razer Deathstalker
Software	Windows 10
Benchmark Scores	A LOT

System Name	Apollo
Processor	Intel Core i9 9880H
Motherboard	Some proprietary Apple thing.
Memory	64GB DDR4-2667
Video Card(s)	AMD Radeon Pro 5600M, 8GB HBM2
Storage	1TB Apple NVMe, 2TB external SSD, 4TB external HDD for backup.
Display(s)	32" Dell UHD, 27" LG UHD, 28" LG 5k
Case	MacBook Pro (16", 2019)
Audio Device(s)	AirPods Pro, Sennheiser HD 380s w/ FIIO Alpen 2, or Logitech 2.1 Speakers
Power Supply	Display or Thunderbolt 4 Hub
Mouse	Logitech G502
Keyboard	Logitech G915, GL Clicky
Software	MacOS 15.3.1

System Name	Eula
Processor	AMD Ryzen 9 7950X
Motherboard	MSI MPG B850 Edge Ti WiFi
Cooling	Corsair H150i Elite LCD XT White
Memory	Trident Z5 Neo RGB DDR5-6000 CL32-38-38-96 1.40V 64GB (2x32GB) AMD EXPO F5-6000J3238G32GX2-TZ5NR
Video Card(s)	Gigabyte GeForce RTX 4080 GAMING OC
Storage	Crucial P3 Plus, 4 TB NVMe, Samsung 980 Pro 2TB NVMe, Toshiba N300 10TB HDD, WDC Red Pro NAS HDD
Display(s)	Acer Predator X32FP 32in 160Hz 4K, Corsair Xeneon 32UHD144 32in 144 hz 4K
Case	Antec Constellation C8 RGB White
Audio Device(s)	Creative Sound Blaster Z
Power Supply	Corsair HX1000 Platinum 1000W
Mouse	SteelSeries Prime Pro Gaming Mouse
Keyboard	SteelSeries Apex 5
Software	MS Windows 11 Pro

System Name	Titan
Processor	AMD Ryzen™ 7 7950X3D / AMD Ryzen™ 7 9800X3D
Motherboard	ASRock X870 Taichi Lite
Cooling	Thermalright Phantom Spirit 120 EVO
Memory	G.SKILL Flare X5 Series 2x48GB DDR5-6000 CL30
Video Card(s)	ASRock Steel Legend RX 9070 XTX 16 GB GDDR6 / NVIDIA RTX 5090 FE
Storage	Crucial T500 2TB x 4
Display(s)	LG 32GS95UE-B, ASUS ROG Swift OLED (PG27AQDP), LG C4 42" (OLED42C4PUA)
Case	Cooler Master QUBE 500 Flatpack Macaron
Audio Device(s)	HyperX Cloud 3 Wireless
Power Supply	Corsair SF1000
Mouse	Logitech Pro Superlight 2 (White), G303 Shroud Edition
Keyboard	Keychron K2 HE Wireless / 8BitDo Retro Mechanical Keyboard (N Edition) / NuPhy Air75 v2
VR HMD	Meta Quest 3 512GB
Software	Windows 11 Pro 64-bit 24H2 Build 26100.4061

AMD Confirms CDNA-Based Radeon Instinct MI100 Coming to HPC Workloads in 2H2020

Raevenlord

News Editor

cucker tarlson

Fouquin

Staff

Aldain

Deleted member 50521

Guest

ARF

Deleted member 50521

Guest

Aquinus

Resident Wat-man

Deleted member 50521

Guest

ValenOne

Cheeseball

Not a Potato

cucker tarlson

1d10t

ValenOne

System Name	Poor Man's PC
Processor	Ryzen 7 7700
Motherboard	MSI B650M Mortar WiFi
Cooling	AMD Wraith Prism
Memory	32GB GSkill Flare X5 DDR5 6000Mhz
Video Card(s)	XFX Merc 310 Radeon RX 7900 XT
Storage	XPG Gammix S70 Blade 2TB + 8 TB WD Ultrastar DC HC320
Display(s)	Xiaomi G Pro 27i MiniLED
Case	Asus A21 Case
Audio Device(s)	MPow Air Wireless + Mi Soundbar
Power Supply	Enermax Revolution DF 650W Gold
Mouse	Logitech MX Anywhere 3
Keyboard	Logitech Pro X + Kailh box heavy pale blue switch + Durock stabilizers
VR HMD	Meta Quest 2
Benchmark Scores	Who need bench when everything already fast?