• Welcome to TechPowerUp Forums, Guest! Please check out our forum guidelines for info related to our community.
  • The forums have been upgraded with support for dark mode. By default it will follow the setting on your system/browser. You may override it by scrolling to the end of the page and clicking the gears icon.

AMD's Export-Friendly Radeon AI PRO R9700 GPU Prepares for China Debut

AleksandarK

News Editor
Staff member
Joined
Aug 19, 2017
Messages
3,114 (1.09/day)
According to the latest rumor mill, AMD is preparing Radeon AI PRO R9700, a new GPU designed specifically for the Chinese market. The new Radeon AI PRO R9700 GPU will feature deliberately handicapped performance parameters that comply with the most recent US export regulations while still supporting the local industry's needs for tasks such as on-device inference and model fine-tuning. To adhere to Washington's restrictions, AMD has reduced the chip's compute throughput to remain below the critical threshold for controlled exports, yet it retains 32 GB of high-bandwidth frame buffer memory and full PCIe Gen 5 connectivity. These specifications are intended to ensure efficient data transfer, and when multiple R9700 cards are deployed together, they can deliver substantial processing capability despite the lower power per card.

This China-specific strategy follows a challenging period for AMD in that region, including an $800 million charge related to its MI308 accelerator after it was blocked under earlier rules. NVIDIA has also suffered, losing an estimated $5.5 billion in revenue when its H20 series was barred, and seeing its Chinese market share drop from roughly 90 percent in 2021 to about 50 percent today. These developments have created an opening for AMD to compete on both price and supply reliability. AMD's approach mirrors NVIDIA's release of a restricted-performance version of its Blackwell-architecture B20 GPU for China. Priced more competitively than previous China-compliant offerings, the Radeon AI PRO R9700 will arrive in the third quarter of 2025 alongside NVIDIA's B20. AMD intends to present its full AI silicon roadmap at the Advancing AI Summit on June 12, positioning the R9700 as a targeted solution for enterprises and research institutions rather than for hyperscale cloud environments.



View at TechPowerUp Main Site | Source
 
I would never imagine that such a card will need a cut down version for China. I doubt it's in the level of performance of a 5090D or even a 5080, so why does AMD needs a cut down version?
 
I would never imagine that such a card will need a cut down version for China. I doubt it's in the level of performance of a 5090D or even a 5080, so why does AMD needs a cut down version?
Because China does have an alternative, MTT S4000, which has no quantity limit for a single device.
 
The 9700 has about 1500 AI TOPS

I assume the new regulation for China compliant chips is lower than that
 
It’s almost as if they take whatever the top SKU from whatever US company and tell them to cut it down regardless of specs. The 9700 Pro is not even close to the 4090D.
 
It’s almost as if they take whatever the top SKU from whatever US company and tell them to cut it down regardless of specs. The 9700 Pro is not even close to the 4090D.

The 4090 doesn't support FP4 so the 9700 Pro is faster in that workload compared to the 4090 running FP8 (about 1300 AI TOPS)
 
It’s almost as if they take whatever the top SKU from whatever US company and tell them to cut it down regardless of specs. The 9700 Pro is not even close to the 4090D.

I mean, if true, it's a politically motivated move. But I don't see much or any circumstantial evidence regarding this. I doubt there will be a China-specific nerf because the R9700 is not powerful enough to be subject to export controls to begin with. I'm writing this one down in the rumor category.

The R9700 has roughly the same AI TOPS rating of the RX 9070 XT, 1531 FP4 and 766 FP8 TOPS, substantially weaker compared to Nvidia's offerings on the upper gaming/pro-viz and data center segments, so it's really a 9070 XT with twice the memory and access to pro drivers. Speaking of pro drivers, they were released alongside the GPU 9 days ago, so it's way past the "preparing" stage. It's a real product already in the market. However, Windows drivers are not currently available, and reference design GPUs will, at least for the time being, not be sold. It's an OEM-only product release.

1748610176738.png
 
I mean, if true, it's a politically motivated move. But I don't see much or any circumstantial evidence regarding this. I doubt there will be a China-specific nerf because the R9700 is not powerful enough to be subject to export controls to begin with. I'm writing this one down in the rumor category.

The R9700 has roughly the same AI TOPS rating of the RX 9070 XT, 1531 FP4 and 766 FP8 TOPS, substantially weaker compared to Nvidia's offerings on the upper gaming/pro-viz and data center segments, so it's really a 9070 XT with twice the memory and access to pro drivers. Speaking of pro drivers, they were released alongside the GPU 9 days ago, so it's way past the "preparing" stage. It's a real product already in the market. However, Windows drivers are not currently available, and reference design GPUs will, at least for the time being, not be sold. It's an OEM-only product release.

View attachment 401856
Any chance you get
 
Any chance you get

The horror, I actually linked to AMD's website and pointed out to their official product specifications page! Unforgivable, a personal vendetta.
 
It's an OEM-only product release
AIB partner only, right? (like the RX 9070s were?)

So far we've seen, Powercolor, Sapphire, Asrock, and Gigabyte (Aorus Gaming)

-with XFX, Asus, and Yeston models remaining to be seen.
The final graphics card introduced during today's keynote is the AMD Radeon AI PRO R9700. This card will be available starting in July and board partners like Sapphire, XFX, PowerColor, Gigabyte, ASRock, Asus and Yeston are lined up to release versions of it.
 
AIB partner only, right? (like the RX 9070s were?)

So far we've seen, Powercolor, Sapphire, Asrock, and Gigabyte


-with XFX, Asus, and Yeston models remaining to be seen.

AMD's site states that no reference design board is being provided and that the imaging are artistic renders only, but I believe it will be launched on the retail channel sometime in July, that's what their presentation at Computex said anyway. There's no specific date or confirmed pricing yet, though with drivers, official specifications and related documentation provided, to me at least, is enough to count a product as "launched". If it's priced at around the RTX 5080 (up to $1,2k IMHO, maybe even 1,5k for enthusiasts and AMD diehards - as a former Vega Frontier owner, miss me with getting bit with that again), this will be an amazing card to have, even for gamers. Anything more and you should just buy an RTX 5090 which is twice as powerful at compute/AI and 90% faster at gaming.

I'd be surprised if XFX, Yeston and ASUS actually release AIB versions of Pro cards, it'd be a first (and a quite welcome first so far). AsRock and Gigabyte had AI market products from the last generation (like the AsRock Creator with 12VHPWR connector and the W7900 AI TOP) and I think PowerColor is just entering that market, makes sense they'd use a reference-ish board if they are targeting exclusively the AI/edge market.
 
The 9700 has about 1500 AI TOPS

I assume the new regulation for China compliant chips is lower than that
The 4090 doesn't support FP4 so the 9700 Pro is faster in that workload compared to the 4090 running FP8 (about 1300 AI TOPS)
Theoretically, given that no one has managed to achieve that raw perf number given the poor software stack.

Also, TOPs as a metric is usually meant for INT4, which the 4090 does support and manages 2600 TOPs with sparsity. Don't conflict that with TFLOPs.

As per this:
The performance metric used is known as Total Processing Power (TPP) which is calculated by the maximum compute for a given bit-length using TFLOPs or TOPS multiplied by the number of bits. The maximum threshold allowed by the U.S. export regulations is 4,800. It just so happened that the RTX 4090's performance level in this benchmark is 10% higher (5,286) than the regulation limit, which is how the 4090 ended up on the ban list.
This is about the dense numbers, which the 4090 does at 1300 TOPs. 1300*4 = 5200, which is above the 4800 limit.
We can then assume that 4090D should have less than 1200 INT4 dense TOPs.

The 9070xt (and by extension the Pro 9700 since they're pretty much the same) achieves 1557 INT4 TOPs with sparsity, so ~780 TOPs dense, that's way less than the 4090D and way bellow the sanction limits, so I don't get why they had to reduce it further.

Could it be not about the ban restriction, but about that limit where companies need to notify such exports instead?

Couldn't be related to memory bandwidth either, given that this GPU has a pretty puny bandwidth and AMD did not change it for this new China-exclusive.
 
Theoretically, given that no one has managed to achieve that raw perf number given the poor software stack.

Also, TOPs as a metric is usually meant for INT4, which the 4090 does support and manages 2600 TOPs with sparsity. Don't conflict that with TFLOPs.

As per this:

This is about the dense numbers, which the 4090 does at 1300 TOPs. 1300*4 = 5200, which is above the 4800 limit.
We can then assume that 4090D should have less than 1200 INT4 dense TOPs.

The 9070xt (and by extension the Pro 9700 since they're pretty much the same) achieves 1557 INT4 TOPs with sparsity, so ~780 TOPs dense, that's way less than the 4090D and way bellow the sanction limits, so I don't get why they had to reduce it further.

Could it be not about the ban restriction, but about that limit where companies need to notify such exports instead?

Couldn't be related to memory bandwidth either, given that this GPU has a pretty puny bandwidth and AMD did not change it for this new China-exclusive.

Yup. Compute-wise, it has around the same performance of the RTX 4080 non-Super, that card's performance rating being:

1748666358842.png


No way it's getting sanctioned on a performance basis, but... let me paraphrase my fan club:

Any chance you get

Any chance I get, indeed.
 
Yup. Compute-wise, it has around the same performance of the RTX 4080 non-Super. No way it's getting sanctioned on a performance basis, but...
In theory. In practice it's still behind a 3090 with the current software stack (to the surprise of no one), I verified this with a colleague on a simple mamf-finder run.

No way it's getting sanctioned on a performance basis
Yeah, I don't think this is related to the actual blocks, but rather them being able to avoid the necessity of any kind of notification whatsoever instead, as I had said before.
But that's just me guessing, I'm not really keeping up with whatever rules the US has been throwing around every other week.

Any chance I get, indeed.
Just stop being a fan boy, ez :p
 

Best article I've found, and it's still rather convoluted, but seems hardware at this performance class doesn't need to be issued a special authorization for export, while pretty much everything faster does.

On that note, I actually just installed LM Studio, getting 171 tokens/sec* on deepseek-r1-0528-qwen3-8b, not sure if that is good or bad, but it generates the text much faster than I can read it, and the output is surprisingly coherent. My RTX 5090 whirrs and purrs with a pretty unique coil whine while it's running, I found it adorable lol. Gonna try downloading some other models to play with

*lol literally just realized I had it at 66% power limit when I went to bench Wukong
 
Last edited:
Back
Top