AMD Scores Another EPYC Win in Exascale Computing With DOE's "El Capitan" Two-Exaflop Supercomputer

silentbogo · Mar 5, 2020

FreedomEclipse said:
But can it multi-virtual machine run crysis?

With raytracing!

steen said:
Let's see what resources are put into ROCm now that AMD has some income to fund dev. Nv has many years (decade) lead with their better fleshed out ecosystem. With nn/AI, Dnn/Dlops will feature heavily on upcoming IHV releases.

Today it's not the case. There are GPGPU APIs that can do the same and have expansive feature set and ecosystem. Heck, before yesterday I didn't even know that OpenMP already implemented GPU offloading (last time I tinkered with it 5-6 years ago).
The main reason why CUDA ruled the HPC and GPGPU compute in general, is being fast. Other aspects are just a consequence of the first one.

Dammeron · Mar 5, 2020

They'll have so many of those new EPYCs, surely they won't notice one is missing, right? Cause I need it...

R0H1T · Mar 5, 2020

Dammeron said:
They'll have so many of those new EPYCs, surely they won't notice one is missing, right? Cause I need it...

You'll need to shift jobs to pull that off, even if temporarily :pimp:

Vya Domus · Mar 5, 2020

Mark Little said:
CUDA is more for companies like mine where we have 10 people and make biomedical imaging devices. CUDA helps us speed up the image reconstruction on the GPU versus the CPU. We are too small to make our own APIs. Giant supercomputer projects have custom tailor made software.

Completely agree, highly specialized software for these large scale computations are probably optimized down to the lowest available level like PTX for Nvidia and assembly for AMD. Truth is not a whole lot of the critical software paths there are actually going to be written in CUDA or OpenCL.

Cheeseball said:
Why do you keep saying CUDA is in a locked-in eco-system? You can run CUDA code on other hardware (even on x86 and ARM, if you're desperate) using HIP through ROCm, but you need to translate (not manual conversion) to avoid any NVIDIA extensions. This is currently a lot more efficient than what can be done in OpenCL 2.1.

CUDA really is a locked ecosystem even for customers of Nvidia hardware. For example their ISA isn't open to the public and there are instances where no matter what you write in CUDA or directly in PTX it will never be as fast as the hardware is capable of. Nvidia reserves the highest level of optimizations for themselves so in order to get the most out of the hardware you purchased you either have to use a library that was hand optimized by Nvidia or if there is none for of the sort of thing you need to do then tough luck. If that's not a locked ecosystem then I don't know what is.

TheoneandonlyMrK · Mar 5, 2020

dicktracy said:
They picked the cheapest but good enuff and not the absolute highest performing options.

Like they could use xeons on a DOE super computer , near double the power use on a DOE system would go down well.
Now Fujitsu's 64FX chip's seems like a contender but not intel.
As for the GGPu choice perhaps they see something in the next generation of chips that we have not yet seen, they are not comparing chips that are out are they no it's chips to be made yet.

bonehead123 · Mar 5, 2020

Great, now we can figure out how to obliterate everyone on the planet even faster & moar better than before...get ready, 'cause the end times are now upon us !

Beertintedgoggles said:
$600 million is fairly massive

Not by government spending standards, seeins how they're spending OUR money not theirs

gamefoo21 · Mar 5, 2020

Cheeseball said:
You're right about that. Corporations create these supercomputers with a major goal in mind, so they would need custom APIs to get to that goal efficiently. But what @xkm1948 is getting at is that CUDA can scale from the basic enthusiast all the way to the [big] corporations that don't have the time (or need) to have a custom API developed for them.

If anything, those same corporations would employ researchers from these universities.

Why do you keep saying CUDA is in a locked-in eco-system? You can run CUDA code on other hardware (even on x86 and ARM, if you're desperate) using HIP through ROCm, but you need to translate (not manual conversion) to avoid any NVIDIA extensions. This is currently a lot more efficient than what can be done in OpenCL 2.1.

The investment in ROCm is an advantage for everyone since all compute APIs will use this. Thank AMD for pulling this off.

They still use Apple because of deals (think 60%+ hardware and support discounts) offered by Apple. Also hardware deployment of Mac minis and Pros depends on department use cases.

Vulkan is aimed at rendering (and why any GPGPU code using Vulkan is on the graphics pipeline), which is why it succeeds OpenGL. OpenCL is meant for GPGPU use.

Oh I know Apple gives universities crazy prices. It's a great way to keep up demand once students become workers.

I'm so deep in studying Latin and writing papers on Greek and Roman epics my brain is melting, I really should focus and my posts are suffering because of that.

It's amazing how this stuff can get so muddied when you are trying to ram different stuff into it.

R0H1T · Mar 5, 2020

bonehead123 said:
cause the end times are now upon us !

I though that was 2012 or 1999/whatever date Nostradamus came up with?

prtskg · Mar 5, 2020

Raevenlord said:
This is the first such exascale contract where AMD is the sole purveyor of both CPUs and GPUs, with AMD's other design win with EPYC in the Cray Shasta being paired with NVIDIA graphics cards.

@Raevenlord This is 2nd AMD win for exascale computing where both cpu and gpu is from AMD. 1st one was called Frontier.

Beyond Rome: AMD's EPYC and Radeon to Power World's Fastest Exascale Supercomputer

AMD's EPYC CPUs and Radeon GPUs have been selected to power the world's fastest exascale supercomputer.

www.tomshardware.com

mtcn77 · Mar 5, 2020

prtskg said:
@Raevenlord This is 2nd AMD win for exascale computing where both cpu and gpu is from AMD. 1st one was called Frontier.

Beyond Rome: AMD's EPYC and Radeon to Power World's Fastest Exascale Supercomputer

AMD's EPYC CPUs and Radeon GPUs have been selected to power the world's fastest exascale supercomputer.

www.tomshardware.com

That system uses 40MW@1.5 Exaflops. FastForward 2 project aims at 20MW@1Exaflops. %33 higher.

AMD Collaborates with US DOE to Deliver the Frontier Supercomputer

It doesn't matter if it's open or closed. And if the driver is a closed binary unavailable for your platform? Least resistance. It is arguably easier to port a driver than pay for a new closed one to integrate with a moving target (OSS kernel). I do advocate opensource, but only when it's...

www.techpowerup.com

IceShroom · Mar 6, 2020

bonehead123 said:
Great, now we can figure out how to obliterate everyone on the planet even faster & moar better than before...get ready, 'cause the end times are now upon us !

If the new supercomputer was built with 5GHz Xeon and GTX 480, then the Govt. could have obliterate us just by truning the computer 'On'.

System Name	WS#1337
Processor	Ryzen 7 5700X3D
Motherboard	ASUS X570-PLUS TUF Gaming
Cooling	Xigmatek Scylla 240mm AIO
Memory	64GB DDR4-3600(4x16)
Video Card(s)	MSI RTX 3070 Gaming X Trio
Storage	ADATA Legend 2TB
Display(s)	Samsung Viewfinity Ultra S6 (34" UW)
Case	ghetto CM Cosmos RC-1000
Audio Device(s)	ALC1220
Power Supply	SeaSonic SSR-550FX (80+ GOLD)
Mouse	Logitech G603
Keyboard	Modecom Volcano Blade (Kailh choc LP)
VR HMD	Google dreamview headset(aka fancy cardboard)
Software	Windows 11, Ubuntu 24.04 LTS

System Name	Proper
Processor	5900X + OC
Motherboard	GB X570s Elite AX
Cooling	WC Heatkiller 3.0 LT
Memory	G.Skill 3600 CL16
Video Card(s)	Zotac RTX 3070 Ti Trinity LC'ed + OC
Storage	KC2500 1TB + A2000 1TB
Display(s)	GB M32Q
Case	Fractal Define R6 USB C
Audio Device(s)	Creative AE-7 + Phonic AM120 MkIII + H/K AVR 265 -> Paradigm Monitor 11 v.7 + AKG K712 Pro
Power Supply	Seasonic Prime PX-850
Mouse	Log G502 X LS
Keyboard	Keychron K5 Opt.brown
Software	Win10 Pro

System Name	Good enough
Processor	AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard	ASRock B650 Pro RS
Cooling	2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory	32GB - FURY Beast RGB 5600 Mhz
Video Card(s)	Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage	1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s)	LG UltraGear 32GN650-B + 4K Samsung TV
Case	Phanteks NV7
Power Supply	GPS-750C

System Name	RyzenGtEvo/ Asus strix scar II
Processor	Amd R5 5900X/ Intel 8750H
Motherboard	Crosshair hero8 impact/Asus
Cooling	360EK extreme rad+ 360$EK slim all push, cpu ek suprim Gpu full cover all EK
Memory	Gskill Trident Z 3900cas18 32Gb in four sticks./16Gb/16GB
Video Card(s)	Asus tuf RX7900XT /Rtx 2060
Storage	Silicon power 2TB nvme/8Tb external/1Tb samsung Evo nvme 2Tb sata ssd/1Tb nvme
Display(s)	Samsung UAE28"850R 4k freesync.dell shiter
Case	Lianli 011 dynamic/strix scar2
Audio Device(s)	Xfi creative 7.1 on board ,Yamaha dts av setup, corsair void pro headset
Power Supply	corsair 1200Hxi/Asus stock
Mouse	Roccat Kova/ Logitech G wireless
Keyboard	Roccat Aimo 120
VR HMD	Oculus rift
Software	Win 10 Pro
Benchmark Scores	laptop Timespy 6506

System Name	The Little One
Processor	i5-11320H @3.4GHZ
Motherboard	Beelink/AZW SEI
Cooling	Fan w/heat pipes + side & rear vents
Memory	64GB Crucial DDR4-3200 (2x 32GB)
Video Card(s)	Iris XE+
Storage	WD Black SN850X 8TB m.2 + Seagate 4TB SATA SSD + 8TB SN850X x2 in an external USB-C enclosure
Display(s)	2x Samsung 43" + 1x 32"
Case	Practically identical to a mac mini, just purrtier in slate blue, & with 3x usb ports on the front !
Audio Device(s)	No-name compact bluetooth speakers
Power Supply	65w brick
Mouse	Logitech MX Master 3
Keyboard	Logitech G613 mechanical wireless
VR HMD	Whahdatiz ???
Software	Windows 10 pro, with all the unnecessary background shitzu turned OFF !
Benchmark Scores	PDQ

System Name	R2V2 *In Progress
Processor	Ryzen 7 2700
Motherboard	Asrock X570 Taichi
Cooling	W2A... water to air
Memory	G.Skill Trident Z3466 B-die
Video Card(s)	Radeon VII repaired and resurrected
Storage	Adata and Samsung NVME
Display(s)	Samsung LCD
Case	Some ThermalTake
Audio Device(s)	Asus Strix RAID DLX upgraded op amps
Power Supply	Seasonic Prime something or other
Software	Windows 10 Pro x64

Processor	Intel Core i5 4590
Motherboard	Gigabyte Z97x Gaming 3
Cooling	Intel Stock Cooler
Memory	8GiB(2x4GiB) DDR3-1600 [800MHz]
Video Card(s)	XFX RX 560D 4GiB
Storage	Transcend SSD370S 128GB; Toshiba DT01ACA100 1TB HDD
Display(s)	Samsung S20D300 20" 768p TN
Case	Cooler Master MasterBox E501L
Audio Device(s)	Realtek ALC1150
Power Supply	Corsair VS450
Mouse	A4Tech N-70FX
Software	Windows 10 Pro
Benchmark Scores	BaseMark GPU : 250 Point in HD 4600

AMD Scores Another EPYC Win in Exascale Computing With DOE's "El Capitan" Two-Exaflop Supercomputer

Moderator