• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.
  • The forums have been upgraded with support for dark mode. By default it will follow the setting on your system/browser. You may override it by scrolling to the end of the page and clicking the gears icon.

AMD Confirms CDNA-Based Radeon Instinct MI100 Coming to HPC Workloads in 2H2020

Raevenlord

News Editor
Joined
Aug 12, 2016
Messages
3,755 (1.17/day)
Location
Portugal
System Name The Ryzening
Processor AMD Ryzen 9 5900X
Motherboard MSI X570 MAG TOMAHAWK
Cooling Lian Li Galahad 360mm AIO
Memory 32 GB G.Skill Trident Z F4-3733 (4x 8 GB)
Video Card(s) Gigabyte RTX 3070 Ti
Storage Boot: Transcend MTE220S 2TB, Kintson A2000 1TB, Seagate Firewolf Pro 14 TB
Display(s) Acer Nitro VG270UP (1440p 144 Hz IPS)
Case Lian Li O11DX Dynamic White
Audio Device(s) iFi Audio Zen DAC
Power Supply Seasonic Focus+ 750 W
Mouse Cooler Master Masterkeys Lite L
Keyboard Cooler Master Masterkeys Lite L
Software Windows 10 x64
Mark Papermaster, chief technology officer and executive vice president of Technology and Engineering at AMD, today confirmed that CDNA is on-track for release in 2H2020 for HPC computing. The confirmation was (adequately) given during Dell's EMC High-Performance Computing Online event. This confirms that AMD is looking at a busy 2nd half of the year, with both Zen 3, RDNA 2 and CDNA product lines being pushed to market.

CDNA is AMD's next push into the highly-lucrative HPC market, and will see the company differentiating their GPU architectures through market-based product differentiation. CDNA will see raster graphics hardware, display and multimedia engines, and other associated components being removed from the chip design in a bid to recoup die area for both increased processing units as well as fixed-function tensor compute hardware. CNDA-based Radeon Instinct MI100 will be fabricated under TSMC's 7 nm node, and will be the first AMD architecture featuring shared memory pools between CPUs and GPUs via the 2nd gen Infinity Fabric, which should bring about both throughput and power consumption improvements to the platform.



View at TechPowerUp Main Site
 
isn't cdna gcn based ?
 
I pity whoever is gonna write software for these. OpenCL? Vulkan Compute?
 
I pity whoever is gonna write software for these. OpenCL? Vulkan Compute?
:confused:

If Arcturus MI100 turns out to be a beast, I guess developers will fight between each other who to code for it...

Specs please ?
 
:confused:

If Arcturus MI100 turns out to be a beast, I guess developers will fight between each other who to code for it...

Specs please ?


Doesn’t work like that. You need a full ecosystem of hw and sw for these gpu accelerated computing. Very few software developers and end users will use it, if it requires deep investment into close to metal level programming. CUDA is successful because Nvidia takes huge effort in polishing the low level software foundation, making it effortless for developers to work on without being crippled by weird driver bugs

OpenCL is pretty broken so far with ROCm. Not sure about Vulkan compute.

Hopefully they find some good use for these GPUs
 
CUDA is successful because Nvidia takes huge effort in polishing the low level software foundation, making it effortless for developers to work on without being crippled by weird driver bugs
Would you say that's mostly due to poor documentation on AMD's part?
 
Would you say that's mostly due to poor documentation on AMD's part?

It's more of lack of investment, but I believe this because they're a smaller company. For example, while AMD does send sales engineers over to promote their products (which we use in the PDL and HCII), we don't get as much support from them compared to NVIDIA, who does send channel reps (basically NVIDIA's developers) to assist with some projects.

AMD needs to invest more time and money into supporting ROCm (and OpenCL).
 
IMO in workstation environtment, Radeon Pro can hold on its own. Same as dekstop counterpart, while not giving highest performance but its offer best bang for bucks. Theres also a whole lot community support out there giving patches or just workaround. AMD should give major effort more than just a framework and community support for cDNA to really take off.
 
but better compute numbers
Per CU count, FLOPS is the same. On software raytracing like Crytek's raytracing demo via compute, NAVI 10 beats VII

Both RDNA and GCN executes wave64 compute.

Read Figure 3 (bottom of page 5) shows 4 lines of shader instructions being executed in GCN, vs RDNA in Wave32 or “backwards compatible” Wave64.
Vega takes 12 cycles to complete the instruction on a GCN SIMD. Navi in Wave32 (optimized code) completes it in 7 cycles.
In backwards compatible (optimized for GCN Wave64) mode, Navi completes it in 8 cycles.
So even on code optimized for GCN, Navi is faster., but more performance can be extracted by optimizing for Navi.
Lower latency, and no wasted clock cycles.


GCN such as "Vega 20" supports 64bit FP.

RDNA still executes GCN instruction set with less latency.
 
Back
Top