• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Radeon Instinct MI100 "Arcturus" Hits the Radar, We Have its BIOS

ppn

Joined
Aug 18, 2015
Messages
614 (0.37/day)
700mm2 on 7nm EUV... damn.
Imagine this thing on 14nm, you'd have 3 working dies per wafer :laugh:
ASML
"...new maximum die size of 429 mm². Say goodbye to the massive dies we got used to from Intel and Nvidia. ...",

Vega 20 Transistors 13,230 million Density 40.0M / mm² 7 nm Die Size 331 mm²

MI100 Transistors29,000 million Density 41.4M / mm² 7 nm+ Die Size 700 mm²

Same Density as first gen 7nm. The real EUV should be 18% denser or 505 mm² NAVi21 gets shrinked to exactly 429mm²
 

1idd0kun

New Member
Joined
Feb 10, 2020
Messages
2 (0.12/day)
In real life, Nvidia dominated GPU computing before they added Tensors - all thanks to a better ecosystem and support. And nothing changed here.
Even if Mi100 temporarily pulls ahead in performance, it won't be enough.
Well, I think it's a matter of whether their software stack get some interest from developers. Having key super computers (like Frontier) using their hardware will help in that department.

Also, AMD already has a MI200 in the works. MI100 looks like it's gonna be pretty powerful, but MI200 will be even better, and is probably what they're gonna deploy in Frontier.
 
Joined
Nov 15, 2016
Messages
338 (0.28/day)
System Name Sillicon Nightmares
Processor Intel i5 4690K (4.7GHZ 1.372vcore, 4.0GHZ 1.192vring, VCCIN 1.84v)
Motherboard Asrock Fatal1ty Z97X Killer BIOS P2.40
Cooling DEEPCOOL Gamer Storm CAPTAIN 360
Memory 2x4GB G.Skill Ripjaws V 2400mhz 10-14-13-32 2T, vDIMM 1.775v
Video Card(s) ASUS GTX 1060 Strix 6GB OC, Core: 2190mhz, Vcore: 1.075v, Mem: 4909mhz (Sillicon Lottery Jackpot)
Storage Samsung 840 EVO 1TB SSD, WD Blue 1TB
Display(s) BenQ XL2430 1080p 144HZ + (2) Samsung SyncMaster 913v 1280x1024 75HZ + A Shitty TV For Movies
Case Deepcool Genome ROG Edition
Audio Device(s) Bunta Sniff Speakers From The Tip Edition
Power Supply Corsair CX430M + CableMod Cables
Mouse Logitech G602
Keyboard Shitty Dell Office Keyboard
Software Windows 7 Ultimate 64 Bit
Benchmark Scores 12 101 Firestrike (3rd for my hardware)
only amd can make a gpu so bad at being a gpu that it has no graphics processing capability
 
Joined
Jan 8, 2017
Messages
4,751 (4.15/day)
System Name Good enough
Processor AMD Ryzen R7 1700X - 4.0 Ghz / 1.350V
Motherboard ASRock B450M Pro4
Cooling Scythe Katana 4 - 3x 120mm case fans
Memory 16GB - Corsair Vengeance LPX
Video Card(s) OEM Dell GTX 1080
Storage 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s) 4K Samsung TV
Case Zalman R1
Power Supply 500W
I looked at NVIDIA Quadro RTX 8000 specs and it says over 200 TOPS at INT8. Is there really so huge difference between Radeon and RTX, or does Nvidia counts differently?
They use different ratios of execution units. Everything is a trade-off, MI60 has a lot of FP64 units, Turing doesn't, Volta does but it doesn't have any RT cores. The thing is though INT8/INT4/FP16 aren't that critical.

Out of all of those, FP64 units have become indispensable. It used to be that they were very expensive power and size wise and that's why GPUs of the past skimped on that but any real compute accelerator nowadays needs to have strong FP64 performance. 64 bit floating point is usually the de facto precision for simulations and that sort of stuff, you can do without tensor cores or INT8/INT4/FP16 but not without FP64 in a data center environment. That's why there have been no large Turing based Tesla's, because no one would have wanted them due to their poor FP64 performance.
 
Last edited by a moderator:
Joined
Jun 28, 2016
Messages
3,268 (2.44/day)
Hehe, they seem to follow the motto: "Bad product is still better than no product"
They've signed a few large contracts and they have to deliver a GPGPU accelerator. It doesn't have to be the best available. It only has to match the agreed specification.
 

ARF

Joined
Jan 28, 2020
Messages
79 (2.72/day)
That's not AMD's spec page. It's Techpowerup's speculated spec list.
When the user with the Arcturus card run GPU-Z and submitted the BIOS to the database, well, I don't think so.
I think there is a way for the software to read the specifications and put them into the database.

I smell $1000 Gaming GPU's this year by AMD...
No way, I smell much lower prices, after all, AMD has to regain some mindshare and lost positions.

ASML
"...new maximum die size of 429 mm². Say goodbye to the massive dies we got used to from Intel and Nvidia. ...",

Vega 20 Transistors 13,230 million Density 40.0M / mm² 7 nm Die Size 331 mm²

MI100 Transistors29,000 million Density 41.4M / mm² 7 nm+ Die Size 700 mm²

Same Density as first gen 7nm. The real EUV should be 18% denser or 505 mm² NAVi21 gets shrinked to exactly 429mm²
Probably fake limit of only 429 mm². That would mean bye-bye enthusiasts videocards.
 

T4C Fantasy

CPU & GPU DB Maintainer
Staff member
Joined
May 7, 2012
Messages
2,373 (0.83/day)
Location
Rhode Island
System Name Whaaaat Kiiiiiiid!
Processor Intel Core i9-9900K @ 5.0GHz
Motherboard Gigabyte Z390 AORUS Ultra
Cooling Corsair H150i AIO Cooler
Memory Corsair Dominator Platinum 16GB DDR4-3200
Video Card(s) Zotac GeForce RTX 2080 Ti Triple Fan @ 2040MHz
Storage Samsung 970 PRO 512GB + Crucial MX300 512GB + OCZ Vertex 4 256GB
Display(s) 27" LG 27MU67-B 4K, + 27" Acer Predator XB271HU 1440P
Case Thermaltake Core X9 Snow
Audio Device(s) Logitech G933 Headset
Power Supply SeaSonic Platinum 1050W Snow Silent
Mouse Logitech G900
Keyboard Logitech G910
Software Windows 10 Pro
Benchmark Scores FFXV: 19329
ASML
"...new maximum die size of 429 mm². Say goodbye to the massive dies we got used to from Intel and Nvidia. ...",

Vega 20 Transistors 13,230 million Density 40.0M / mm² 7 nm Die Size 331 mm²

MI100 Transistors29,000 million Density 41.4M / mm² 7 nm+ Die Size 700 mm²

Same Density as first gen 7nm. The real EUV should be 18% denser or 505 mm² NAVi21 gets shrinked to exactly 429mm²
So this will be first gen 7nm to reach 500+mm2? Or for it to have 8192 cores it is made possible by losing the raster engines?

When the user with the Arcturus card run GPU-Z and submitted the BIOS to the database, well, I don't think so.
I think there is a way for the software to read the specifications and put them into the database.



No way, I smell much lower prices, after all, AMD has to regain some mindshare and lost positions.



Probably fake limit of only 429 mm². That would mean bye-bye enthusiasts videocards.
The bios doesn't show any die information, all of this is speculation by me and will be updated when new info comes out.
 
Joined
Nov 24, 2017
Messages
633 (0.77/day)
Location
Asia
Processor Intel Core i5 4590
Motherboard Gigabyte Z97x Gaming 3
Cooling Intel Stock Cooler
Memory 8GB(2x4GB) DDR3-800MHz [1600MT/s]
Video Card(s) XFX RX 560 4GB
Storage Transcend SSD370S 128GB; Toshiba DT01ACA100 1TB
Display(s) Samsung S20D300 20" 768p TN
Case Delux DLC-MV888
Audio Device(s) Realtek ALC1150
Power Supply Corsair VS450
Mouse A4Tech N-70FX
Software Windows 10 Pro
Benchmark Scores BaseMark GPU : 250 Point

evolucian911

New Member
Joined
Mar 31, 2019
Messages
11 (0.03/day)
Theoretically, and based on V7 pro performance. A version of this card for Pros would be insane. To call it a monster would be an understatement. On top of that, since its Vega they can add the ability like with last Vega based card to use m.2 as extra ram. I'm certain this card is coming. They just need to focus and market it for that segment and not gaming like before. Call it the FirePro X. This specific version though is obviously just for compute and even still it is beyond anything on the market by Far.
 
Joined
Dec 10, 2014
Messages
502 (0.26/day)
Location
Nowy Warsaw
System Name SYBARIS
Processor AMD Ryzen™ 5 3600
Motherboard MSI Arsenal Gaming B450 Tomahawk
Cooling Cryorig H7 Quad Lumi
Memory Team T-Force Delta RGB 2x8GB 3200CL16
Video Card(s) Colorful GeForce RTX 2060 6G V2
Storage WD Black WD1003FZEX 1TB + Crucial MX500 500GB
Display(s) LG 22MP68VQ-P 22" 75hz IPS
Case In Win Mana 136
Audio Device(s) HyperX Cloud X | iVOOMi iVO-169SUFBT 2.1
Power Supply Cooler Master G550M
Mouse Logitech G102 Prodigy | Logitech G402 Hyperion Fury
Keyboard Fantech MK871 RGB TKL Outemu Blue mechanical keyboard
Software Windows 10 Education 1909 x64
Let me set this straight. It's like taking out passenger/driver cabin and trunk space off a Tesla and fill it with battery and more powerful motor (delete raster capability, basically the little doo-dah that shows stuff on your monitor and fill the space with CUs). Even steering system because this Tesla will just go straight in highly specialized environment (only for AI-ML acceleration unlike GPUs that are usually jack of all trades).
Except extra battery and motor wouldn't be used for faster 0-60 and top speed rather hauling more stuff so a Tesla semi or pickup (not for gaming but workstation only).

Some things you get after reading this thread:
It's not a gaming card at all. It's workstation card through and through. But people will like a gaming card with this spec.
People don't understand that with raster stuff added back in this thing will become MASSIVE.
AMD also has brain-dead haters for their products who don't even understand what they're hating.

My personal take is that this is a good development. I've long prescribed that AMD should have different architectures for different markets instead of "jack of all trades, master of none". Now that CPU business is making them money, hope they spend clever money on R&D for RTG.
 

evolucian911

New Member
Joined
Mar 31, 2019
Messages
11 (0.03/day)
Let me set this straight. It's like taking out passenger/driver cabin and trunk space off a Tesla and fill it with battery and more powerful motor (delete raster capability, basically the little doo-dah that shows stuff on your monitor and fill the space with CUs). Even steering system because this Tesla will just go straight in highly specialized environment (only for AI-ML acceleration unlike GPUs that are usually jack of all trades).
Except extra battery and motor wouldn't be used for faster 0-60 and top speed rather hauling more stuff so a Tesla semi or pickup (not for gaming but workstation only).

Some things you get after reading this thread:
It's not a gaming card at all. It's workstation card through and through. But people will like a gaming card with this spec.
People don't understand that with raster stuff added back in this thing will become MASSIVE.
AMD also has brain-dead haters for their products who don't even understand what they're hating.

My personal take is that this is a good development. I've long prescribed that AMD should have different architectures for different markets instead of "jack of all trades, master of none". Now that CPU business is making them money, hope they spend clever money on R&D for RTG.
This is why I said a "version " of this card for pro. Which would obviously mean rasters in amongst other things and inevitably higher tdp.
 
Joined
Nov 4, 2005
Messages
10,499 (2.01/day)
System Name MoFo 2
Processor AMD PhenomII 1100T @ 4.2Ghz
Motherboard Asus Crosshair IV
Cooling Swiftec 655 pump, Apogee GT,, MCR360mm Rad, 1/2 loop.
Memory 8GB DDR3-2133 @ 1900 8.9.9.24 1T
Video Card(s) HD7970 1250/1750
Storage Agility 3 SSD 6TB RAID 0 on RAID Card
Display(s) 46" 1080P Toshiba LCD
Case Rosewill R6A34-BK modded (thanks to MKmods)
Audio Device(s) ATI HDMI
Power Supply 750W PC Power & Cooling modded (thanks to MKmods)
Software A lot.
Benchmark Scores Its fast. Enough.
Hehe, they seem to follow the motto: "Bad product is still better than no product"
Your poor attempt at trolling and fanboyism is honestly the coolest thing ever, ohhh baby you make me so excited for sucking Nvidia off with you......

700mm die, at 1Ghz plus, and an interposer to hold it and HBM? I imagine yield loss is huge, but maybe they have perfected it so it's actually profitable. Now when do we get to see actual performance numbers?
 
Joined
Jun 28, 2016
Messages
3,268 (2.44/day)
700mm die, at 1Ghz plus, and an interposer to hold it and HBM? I imagine yield loss is huge, but maybe they have perfected it so it's actually profitable. Now when do we get to see actual performance numbers?
Or they have no choice - they need to make such a chip (PoC).
 
Joined
Apr 30, 2011
Messages
1,525 (0.47/day)
Location
Greece
Processor AMD FX-8350 4GHz@1.3V
Motherboard Gigabyte GA-970A UD3 Rev3.0
Cooling Zalman CNPS9X Optima
Memory 4*4GB DDR3 1600MHz CL9
Video Card(s) Sapphire Radeon RX 5700 Pulse 8GB
Storage Sandisk SSD 120GB, Samsung F1 1TB, Hitachi HUS724040ALE640 4TB
Display(s) LG IPS235
Case Zalman Neo Z9 Black
Audio Device(s) Via 7.1 onboard
Power Supply Be Quiet Pure Power 11 600W
Mouse Sharkoon SHARK Force Black
Keyboard Trust GXT280
Software Win 7 sp1 64bit/Win 10 pro 64bit
Benchmark Scores CB R15 64bit: single core 99p, multicore 647p WPrime 1.55 (8 cores): 9.0 secs
A shock for many that is has 128CUs as they couldn't get that big Navi will have even 80CUs. But AMD is on rails now with only marketing being inferior but their products are on top level. Even their latest GPU drivers for Navi arch are improved (the biggest customer problem for the last 5-6 months). Impressive feat nevertheless, especially for 200W. Big Navi now can easily become the fastest GPU with some distance while using under 300W. It seems that 7nm+ helps muchly in efficiency.
 
Joined
Sep 15, 2007
Messages
3,765 (0.83/day)
Location
Police/Nanny State of America
System Name More hardware than I use :|
Processor 4.7 8350 - 4.2 4560K - 4.4 4690K
Motherboard Sabertooth R2.0 - Gigabyte Z87X-UD4H-CF - AsRock Z97M KIller
Cooling Mugen 2 rev B push/pull - Hyper 212+ push/pull - Hyper 212+
Memory 16GB Gskill - 8GB Gskill - 16GB Ballistix 1.35v
Video Card(s) Xfire OCed 7950s - Powercolor 290x - Oced Zotac 980Ti AMP! (also have two 7870s)
Storage Crucial 250GB SSD, Kingston 3K 120GB, Sammy 1TB, various WDs, 13TB (actual capactity) NAS with WDs
Display(s) X-star 27" 1440 - Auria 27" 1440 - BenQ 24" 1080 - Acer 23" 1080
Case Lian Li open bench - Fractal Design ARC - Thermaltake Cube (still have HAF 932 and more ARCs)
Audio Device(s) Titanium HD - Onkyo HT-RC360 Receiver - BIC America custom 5.1 set up (and extra Klipsch sub)
Power Supply Corsair 850W V2 - EVGA 1000 G2 - Seasonic 500 and 600W units (dead 750W needs RMA lol)
Mouse Logitech G5 - Sentey Revolution Pro - Sentey Lumenata Pro - multiple wireless logitechs
Keyboard Logitech G11s - Thermaltake Challenger
Software I wish I could kill myself instead of using windows (OSX can suck it too).
Your poor attempt at trolling and fanboyism is honestly the coolest thing ever, ohhh baby you make me so excited for sucking Nvidia off with you......

700mm die, at 1Ghz plus, and an interposer to hold it and HBM? I imagine yield loss is huge, but maybe they have perfected it so it's actually profitable. Now when do we get to see actual performance numbers?
No doubt cut down dies will be sold, so I'm sure there's plenty of profit even if it's 50% fully functioning yield.
 
Joined
Feb 19, 2019
Messages
299 (0.80/day)
A shock for many that is has 128CUs as they couldn't get that big Navi will have even 80CUs. But AMD is on rails now with only marketing being inferior but their products are on top level. Even their latest GPU drivers for Navi arch are improved (the biggest customer problem for the last 5-6 months). Impressive feat nevertheless, especially for 200W. Big Navi now can easily become the fastest GPU with some distance while using under 300W. It seems that 7nm+ helps muchly in efficiency.
Maybe this is related to what they have done with Ryzen 4000 Vega cores efficiency.
 

ARF

Joined
Jan 28, 2020
Messages
79 (2.72/day)
The estimation changed, not a 700 die any loner.
Radeon VII is already 331 mm² , it is physically impossible to fit 2X the shaders with only 6 billion transistors more, and in 420 mm² .

420 mm² is impossible with these specs.

Have you got any source that they can manufacture larger than 429 mm² dies on N7+ and have you got any source that this particular chip is on N7+, and not in N7 ?
 
Joined
Jun 28, 2016
Messages
3,268 (2.44/day)
The coments here, jesus. This GPU can't even render graphics. Do people even realize that ? Its purely AI GPU which explains the low wattage. Its similar to 75W Tesla T4 which is purely AI as well while having RTX 2070S spec.
Absolutely NOT.

Obviously, this card can render. It's built around a normal GPU. It just can't provide a video signal - there are no outputs and no logic dedicated for this task.
It can be used in any scenario that can utilize GPGPU (including AI, obviously).

This is NOT similar to Tesla T4.
In green camp you have the V100, which is an all-mighty, all-round, dual-slot accelerator. Mi100 (like Mi60 now) will compete in this segment.
Nvidia also makes the Tesla T4, which has half of V100 Tensor (AI) potential, but just 5% of it's double-precision performance. T4 is single-slot, 1/4th of V100 price and uses 75W (V100 is up to 300W).

Which means that if you need V100's double-precision (all-round) performance, you buy a V100. You can't go wrong with this card.
But if you don't need it, you buy 2xTesla T4's - you get pretty much the same performance in stuff like Deep Learning (e.g. image recognition) for half the money and half the power.
 
  • Like
Reactions: ppn
Joined
Jun 2, 2017
Messages
2,108 (2.11/day)
System Name Best AMD Computer
Processor AMD TR4 1920X
Motherboard MSI X399 SLI Plus
Cooling Alphacool Eisbaer 420 x2 Noctua XPX Pro TR4 block
Memory Gskill RIpjaws 4 3000MHZ 48GB
Video Card(s) Sapphire Vega 64 Nitro, Gigabyte Vega 64 Gaming OC
Storage 6 x NVME 480 GB, 2 x SSD 2TB, 5TB HDD, 2 TB HDD, 2x 2TB SSHD
Display(s) Acer 49BQ0k 4K monitor
Case Thermaltake Core X9
Audio Device(s) Corsair Void Pro, Logitch Z523 5.1
Power Supply Corsair HX1200!
Mouse Logitech g7 gaming mouse
Keyboard Logitech G510
Software Windows 10 Pro 64 Steam. GOG, Uplay, Origin
Benchmark Scores Firestrike: 24955 Time Spy: 13500
That would be another massive failure just like the Crap-eon 7, but then again, AMD just doesn't seem to tire of failures... :D
Why was the Radeon 7 a failure in your eyes?
 

ppn

Joined
Aug 18, 2015
Messages
614 (0.37/day)
Radeon VII is already 331 mm² , it is physically impossible to fit 2X the shaders with only 6 billion transistors more, and in 420 mm² .
In Mi60 -4096 shading units = 160 mm2. double that, the memory controllers remain unchanged. SO therefore 50% bigger die 2X shaders. Shrink to 7nm+ with 18% better density. looks like perfect prediction by DB maintainer.
 
Top