• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.

AMD Radeon Instinct MI100 "Arcturus" Hits the Radar, We Have its BIOS

ppn

Joined
Aug 18, 2015
Messages
1,231 (0.39/day)
700mm2 on 7nm EUV... damn.
Imagine this thing on 14nm, you'd have 3 working dies per wafer :laugh:

ASML
"...new maximum die size of 429 mm². Say goodbye to the massive dies we got used to from Intel and Nvidia. ...",

Vega 20 Transistors 13,230 million Density 40.0M / mm² 7 nm Die Size 331 mm²

MI100 Transistors29,000 million Density 41.4M / mm² 7 nm+ Die Size 700 mm²

Same Density as first gen 7nm. The real EUV should be 18% denser or 505 mm² NAVi21 gets shrinked to exactly 429mm²
 

1idd0kun

New Member
Joined
Feb 10, 2020
Messages
2 (0.00/day)
In real life, Nvidia dominated GPU computing before they added Tensors - all thanks to a better ecosystem and support. And nothing changed here.
Even if Mi100 temporarily pulls ahead in performance, it won't be enough.

Well, I think it's a matter of whether their software stack get some interest from developers. Having key super computers (like Frontier) using their hardware will help in that department.

Also, AMD already has a MI200 in the works. MI100 looks like it's gonna be pretty powerful, but MI200 will be even better, and is probably what they're gonna deploy in Frontier.
 
Joined
Nov 15, 2016
Messages
454 (0.17/day)
System Name Sillicon Nightmares
Processor Intel i7 9700KF 5ghz (5.1ghz 4 core load, no avx offset), 4.7ghz ring, 1.412vcore 1.3vcio 1.264vcsa
Motherboard Asus Z390 Strix F
Cooling DEEPCOOL Gamer Storm CAPTAIN 360
Memory 2x8GB G.Skill Trident Z RGB (B-Die) 3600 14-14-14-28 1t, tRFC 220 tREFI 65535, tFAW 16, 1.545vddq
Video Card(s) ASUS GTX 1060 Strix 6GB XOC, Core: 2202-2240, Vcore: 1.075v, Mem: 9818mhz (Sillicon Lottery Jackpot)
Storage Samsung 840 EVO 1TB SSD, WD Blue 1TB, Seagate 3TB, Samsung 970 Evo Plus 512GB
Display(s) BenQ XL2430 1080p 144HZ + (2) Samsung SyncMaster 913v 1280x1024 75HZ + A Shitty TV For Movies
Case Deepcool Genome ROG Edition
Audio Device(s) Bunta Sniff Speakers From The Tip Edition With Extra Kenwoods
Power Supply Corsair AX860i/Cable Mod Cables
Mouse Logitech G602 Spilled Beer Edition
Keyboard Dell KB4021
Software Windows 10 x64
Benchmark Scores 13543 Firestrike (3dmark.com/fs/22336777) 601 points CPU-Z ST 37.4ns AIDA Memory
only amd can make a gpu so bad at being a gpu that it has no graphics processing capability
 
Joined
Jan 8, 2017
Messages
8,860 (3.36/day)
System Name Good enough
Processor AMD Ryzen R9 7900 - Alphacool Eisblock XPX Aurora Edge
Motherboard ASRock B650 Pro RS
Cooling 2x 360mm NexXxoS ST30 X-Flow, 1x 360mm NexXxoS ST30, 1x 240mm NexXxoS ST30
Memory 32GB - FURY Beast RGB 5600 Mhz
Video Card(s) Sapphire RX 7900 XT - Alphacool Eisblock Aurora
Storage 1x Kingston KC3000 1TB 1x Kingston A2000 1TB, 1x Samsung 850 EVO 250GB , 1x Samsung 860 EVO 500GB
Display(s) LG UltraGear 32GN650-B + 4K Samsung TV
Case Phanteks NV7
Power Supply GPS-750C
I looked at NVIDIA Quadro RTX 8000 specs and it says over 200 TOPS at INT8. Is there really so huge difference between Radeon and RTX, or does Nvidia counts differently?

They use different ratios of execution units. Everything is a trade-off, MI60 has a lot of FP64 units, Turing doesn't, Volta does but it doesn't have any RT cores. The thing is though INT8/INT4/FP16 aren't that critical.

Out of all of those, FP64 units have become indispensable. It used to be that they were very expensive power and size wise and that's why GPUs of the past skimped on that but any real compute accelerator nowadays needs to have strong FP64 performance. 64 bit floating point is usually the de facto precision for simulations and that sort of stuff, you can do without tensor cores or INT8/INT4/FP16 but not without FP64 in a data center environment. That's why there have been no large Turing based Tesla's, because no one would have wanted them due to their poor FP64 performance.
 
Last edited by a moderator:
Joined
Jun 28, 2016
Messages
3,595 (1.27/day)
Hehe, they seem to follow the motto: "Bad product is still better than no product"
They've signed a few large contracts and they have to deliver a GPGPU accelerator. It doesn't have to be the best available. It only has to match the agreed specification.
 

ARF

Joined
Jan 28, 2020
Messages
3,892 (2.56/day)
Location
Ex-usa
That's not AMD's spec page. It's Techpowerup's speculated spec list.

When the user with the Arcturus card run GPU-Z and submitted the BIOS to the database, well, I don't think so.
I think there is a way for the software to read the specifications and put them into the database.

I smell $1000 Gaming GPU's this year by AMD...

No way, I smell much lower prices, after all, AMD has to regain some mindshare and lost positions.

ASML
"...new maximum die size of 429 mm². Say goodbye to the massive dies we got used to from Intel and Nvidia. ...",

Vega 20 Transistors 13,230 million Density 40.0M / mm² 7 nm Die Size 331 mm²

MI100 Transistors29,000 million Density 41.4M / mm² 7 nm+ Die Size 700 mm²

Same Density as first gen 7nm. The real EUV should be 18% denser or 505 mm² NAVi21 gets shrinked to exactly 429mm²

Probably fake limit of only 429 mm². That would mean bye-bye enthusiasts videocards.
 

T4C Fantasy

CPU & GPU DB Maintainer
Staff member
Joined
May 7, 2012
Messages
2,562 (0.59/day)
Location
Rhode Island
System Name Whaaaat Kiiiiiiid!
Processor Intel Core i9-12900K @ Default
Motherboard Gigabyte Z690 AORUS Elite AX
Cooling Corsair H150i AIO Cooler
Memory Corsair Dominator Platinum 32GB DDR4-3200
Video Card(s) EVGA GeForce RTX 3080 FTW3 ULTRA @ Default
Storage Samsung 970 PRO 512GB + Crucial MX500 2TB x3 + Crucial MX500 4TB + Samsung 980 PRO 1TB
Display(s) 27" LG 27MU67-B 4K, + 27" Acer Predator XB271HU 1440P
Case Thermaltake Core X9 Snow
Audio Device(s) Logitech G935 Headset
Power Supply SeaSonic Platinum 1050W Snow Silent
Mouse Logitech G903 Lightspeed
Keyboard Logitech G915
Software Windows 11 Pro
Benchmark Scores FFXV: 19329
ASML
"...new maximum die size of 429 mm². Say goodbye to the massive dies we got used to from Intel and Nvidia. ...",

Vega 20 Transistors 13,230 million Density 40.0M / mm² 7 nm Die Size 331 mm²

MI100 Transistors29,000 million Density 41.4M / mm² 7 nm+ Die Size 700 mm²

Same Density as first gen 7nm. The real EUV should be 18% denser or 505 mm² NAVi21 gets shrinked to exactly 429mm²
So this will be first gen 7nm to reach 500+mm2? Or for it to have 8192 cores it is made possible by losing the raster engines?

When the user with the Arcturus card run GPU-Z and submitted the BIOS to the database, well, I don't think so.
I think there is a way for the software to read the specifications and put them into the database.



No way, I smell much lower prices, after all, AMD has to regain some mindshare and lost positions.



Probably fake limit of only 429 mm². That would mean bye-bye enthusiasts videocards.
The bios doesn't show any die information, all of this is speculation by me and will be updated when new info comes out.
 
Joined
Nov 24, 2017
Messages
853 (0.37/day)
Location
Asia
Processor Intel Core i5 4590
Motherboard Gigabyte Z97x Gaming 3
Cooling Intel Stock Cooler
Memory 8GiB(2x4GiB) DDR3-1600 [800MHz]
Video Card(s) XFX RX 560D 4GiB
Storage Transcend SSD370S 128GB; Toshiba DT01ACA100 1TB HDD
Display(s) Samsung S20D300 20" 768p TN
Case Cooler Master MasterBox E501L
Audio Device(s) Realtek ALC1150
Power Supply Corsair VS450
Mouse A4Tech N-70FX
Software Windows 10 Pro
Benchmark Scores BaseMark GPU : 250 Point in HD 4600

evolucian911

New Member
Joined
Mar 31, 2019
Messages
11 (0.01/day)
Theoretically, and based on V7 pro performance. A version of this card for Pros would be insane. To call it a monster would be an understatement. On top of that, since its Vega they can add the ability like with last Vega based card to use m.2 as extra ram. I'm certain this card is coming. They just need to focus and market it for that segment and not gaming like before. Call it the FirePro X. This specific version though is obviously just for compute and even still it is beyond anything on the market by Far.
 
Joined
Dec 10, 2014
Messages
1,325 (0.39/day)
Location
Nowy Warsaw
System Name SYBARIS
Processor AMD Ryzen 5 3600
Motherboard MSI Arsenal Gaming B450 Tomahawk
Cooling Cryorig H7 Quad Lumi
Memory Team T-Force Delta RGB 2x8GB 3200CL16
Video Card(s) Colorful GeForce RTX 2060 6GV2
Storage Crucial MX500 500GB | WD Black WD1003FZEX 1TB | Seagate ST1000LM024 1TB | WD My Passport Slim 1TB
Display(s) AOC 24G2 24" 144hz IPS
Case Montech Air ARGB
Audio Device(s) Massdrop + Sennheiser PC37X | QKZ x HBB
Power Supply Corsair CX650-F
Mouse Razer Viper Mini | Cooler Master MM711 | Logitech G102 | Logitech G402
Keyboard Drop + The Lord of the Rings Dwarvish
Software Windows 10 Education 22H2 x64
Let me set this straight. It's like taking out passenger/driver cabin and trunk space off a Tesla and fill it with battery and more powerful motor (delete raster capability, basically the little doo-dah that shows stuff on your monitor and fill the space with CUs). Even steering system because this Tesla will just go straight in highly specialized environment (only for AI-ML acceleration unlike GPUs that are usually jack of all trades).
Except extra battery and motor wouldn't be used for faster 0-60 and top speed rather hauling more stuff so a Tesla semi or pickup (not for gaming but workstation only).

Some things you get after reading this thread:
It's not a gaming card at all. It's workstation card through and through. But people will like a gaming card with this spec.
People don't understand that with raster stuff added back in this thing will become MASSIVE.
AMD also has brain-dead haters for their products who don't even understand what they're hating.

My personal take is that this is a good development. I've long prescribed that AMD should have different architectures for different markets instead of "jack of all trades, master of none". Now that CPU business is making them money, hope they spend clever money on R&D for RTG.
 

evolucian911

New Member
Joined
Mar 31, 2019
Messages
11 (0.01/day)
Let me set this straight. It's like taking out passenger/driver cabin and trunk space off a Tesla and fill it with battery and more powerful motor (delete raster capability, basically the little doo-dah that shows stuff on your monitor and fill the space with CUs). Even steering system because this Tesla will just go straight in highly specialized environment (only for AI-ML acceleration unlike GPUs that are usually jack of all trades).
Except extra battery and motor wouldn't be used for faster 0-60 and top speed rather hauling more stuff so a Tesla semi or pickup (not for gaming but workstation only).

Some things you get after reading this thread:
It's not a gaming card at all. It's workstation card through and through. But people will like a gaming card with this spec.
People don't understand that with raster stuff added back in this thing will become MASSIVE.
AMD also has brain-dead haters for their products who don't even understand what they're hating.

My personal take is that this is a good development. I've long prescribed that AMD should have different architectures for different markets instead of "jack of all trades, master of none". Now that CPU business is making them money, hope they spend clever money on R&D for RTG.
This is why I said a "version " of this card for pro. Which would obviously mean rasters in amongst other things and inevitably higher tdp.
 
Joined
Nov 4, 2005
Messages
11,654 (1.73/day)
System Name Compy 386
Processor 7800X3D
Motherboard Asus
Cooling Air for now.....
Memory 64 GB DDR5 6400Mhz
Video Card(s) 7900XTX 310 Merc
Storage Samsung 990 2TB, 2 SP 2TB SSDs and over 10TB spinning
Display(s) 56" Samsung 4K HDR
Audio Device(s) ATI HDMI
Mouse Logitech MX518
Keyboard Razer
Software A lot.
Benchmark Scores Its fast. Enough.
Hehe, they seem to follow the motto: "Bad product is still better than no product"

Your poor attempt at trolling and fanboyism is honestly the coolest thing ever, ohhh baby you make me so excited for sucking Nvidia off with you......

700mm die, at 1Ghz plus, and an interposer to hold it and HBM? I imagine yield loss is huge, but maybe they have perfected it so it's actually profitable. Now when do we get to see actual performance numbers?
 
Joined
Jun 28, 2016
Messages
3,595 (1.27/day)
700mm die, at 1Ghz plus, and an interposer to hold it and HBM? I imagine yield loss is huge, but maybe they have perfected it so it's actually profitable. Now when do we get to see actual performance numbers?
Or they have no choice - they need to make such a chip (PoC).
 
Joined
Apr 30, 2011
Messages
2,648 (0.56/day)
Location
Greece
Processor AMD Ryzen 5 5600@80W
Motherboard MSI B550 Tomahawk
Cooling ZALMAN CNPS9X OPTIMA
Memory 2*8GB PATRIOT PVS416G400C9K@3733MT_C16
Video Card(s) Sapphire Radeon RX 6750 XT Pulse 12GB
Storage Sandisk SSD 128GB, Kingston A2000 NVMe 1TB, Samsung F1 1TB, WD Black 10TB
Display(s) AOC 27G2U/BK IPS 144Hz
Case SHARKOON M25-W 7.1 BLACK
Audio Device(s) Realtek 7.1 onboard
Power Supply Seasonic Core GC 500W
Mouse Sharkoon SHARK Force Black
Keyboard Trust GXT280
Software Win 7 Ultimate 64bit/Win 10 pro 64bit/Manjaro Linux
A shock for many that is has 128CUs as they couldn't get that big Navi will have even 80CUs. But AMD is on rails now with only marketing being inferior but their products are on top level. Even their latest GPU drivers for Navi arch are improved (the biggest customer problem for the last 5-6 months). Impressive feat nevertheless, especially for 200W. Big Navi now can easily become the fastest GPU with some distance while using under 300W. It seems that 7nm+ helps muchly in efficiency.
 
Joined
Sep 15, 2007
Messages
3,944 (0.65/day)
Location
Police/Nanny State of America
Processor OCed 5800X3D
Motherboard Asucks C6H
Cooling Air
Memory 32GB
Video Card(s) OCed 6800XT
Storage NVMees
Display(s) 32" Dull curved 1440
Case Freebie glass idk
Audio Device(s) Sennheiser
Power Supply Don't even remember
Your poor attempt at trolling and fanboyism is honestly the coolest thing ever, ohhh baby you make me so excited for sucking Nvidia off with you......

700mm die, at 1Ghz plus, and an interposer to hold it and HBM? I imagine yield loss is huge, but maybe they have perfected it so it's actually profitable. Now when do we get to see actual performance numbers?

No doubt cut down dies will be sold, so I'm sure there's plenty of profit even if it's 50% fully functioning yield.
 
Joined
Feb 19, 2019
Messages
324 (0.17/day)
A shock for many that is has 128CUs as they couldn't get that big Navi will have even 80CUs. But AMD is on rails now with only marketing being inferior but their products are on top level. Even their latest GPU drivers for Navi arch are improved (the biggest customer problem for the last 5-6 months). Impressive feat nevertheless, especially for 200W. Big Navi now can easily become the fastest GPU with some distance while using under 300W. It seems that 7nm+ helps muchly in efficiency.
Maybe this is related to what they have done with Ryzen 4000 Vega cores efficiency.
 

ARF

Joined
Jan 28, 2020
Messages
3,892 (2.56/day)
Location
Ex-usa
The estimation changed, not a 700 die any loner.

Radeon VII is already 331 mm² , it is physically impossible to fit 2X the shaders with only 6 billion transistors more, and in 420 mm² .

420 mm² is impossible with these specs.

Have you got any source that they can manufacture larger than 429 mm² dies on N7+ and have you got any source that this particular chip is on N7+, and not in N7 ?
 
Joined
Jun 28, 2016
Messages
3,595 (1.27/day)
The coments here, jesus. This GPU can't even render graphics. Do people even realize that ? Its purely AI GPU which explains the low wattage. Its similar to 75W Tesla T4 which is purely AI as well while having RTX 2070S spec.
Absolutely NOT.

Obviously, this card can render. It's built around a normal GPU. It just can't provide a video signal - there are no outputs and no logic dedicated for this task.
It can be used in any scenario that can utilize GPGPU (including AI, obviously).

This is NOT similar to Tesla T4.
In green camp you have the V100, which is an all-mighty, all-round, dual-slot accelerator. Mi100 (like Mi60 now) will compete in this segment.
Nvidia also makes the Tesla T4, which has half of V100 Tensor (AI) potential, but just 5% of it's double-precision performance. T4 is single-slot, 1/4th of V100 price and uses 75W (V100 is up to 300W).

Which means that if you need V100's double-precision (all-round) performance, you buy a V100. You can't go wrong with this card.
But if you don't need it, you buy 2xTesla T4's - you get pretty much the same performance in stuff like Deep Learning (e.g. image recognition) for half the money and half the power.
 
  • Like
Reactions: ppn
Joined
Jun 2, 2017
Messages
7,788 (3.13/day)
System Name Best AMD Computer
Processor AMD 7900X3D
Motherboard Asus X670E E Strix
Cooling In Win SR36
Memory GSKILL DDR5 32GB 5200 30
Video Card(s) Sapphire Pulse 7900XT (Watercooled)
Storage Corsair MP 700, Seagate 530 2Tb, Adata SX8200 2TBx2, Kingston 2 TBx2, Micron 8 TB, WD AN 1500
Display(s) GIGABYTE FV43U
Case Corsair 7000D Airflow
Audio Device(s) Corsair Void Pro, Logitch Z523 5.1
Power Supply Deepcool 1000M
Mouse Logitech g7 gaming mouse
Keyboard Logitech G510
Software Windows 11 Pro 64 Steam. GOG, Uplay, Origin
Benchmark Scores Firestrike: 46183 Time Spy: 25121
That would be another massive failure just like the Crap-eon 7, but then again, AMD just doesn't seem to tire of failures... :D

Why was the Radeon 7 a failure in your eyes?
 

ppn

Joined
Aug 18, 2015
Messages
1,231 (0.39/day)
Radeon VII is already 331 mm² , it is physically impossible to fit 2X the shaders with only 6 billion transistors more, and in 420 mm² .

In Mi60 -4096 shading units = 160 mm2. double that, the memory controllers remain unchanged. SO therefore 50% bigger die 2X shaders. Shrink to 7nm+ with 18% better density. looks like perfect prediction by DB maintainer.
 
Top