News Posts matching #A100

NVIDIA A800 China-Tailored GPU Performance within 70% of A100

May 8th, 2023 02:24 Discuss (8 Comments)

The recent growth in demand for training Large Language Models (LLMs) like Generative Pre-trained Transformer (GPT) has sparked the interest of many companies to invest in GPU solutions that are used to train these models. However, countries like China have struggled with US sanctions, and NVIDIA has to create custom models that meet US export regulations. Carrying two GPUs, H800 and A800, they represent cut-down versions of the original H100 and A100, respectively. We reported about H800; however, it remained as mysterious as A800 that we are talking about today. Thanks to MyDrivers, we have information that the A800 GPU performance is within 70% of the regular A100.

The regular A100 GPU manages 9.7 TeraFLOPs of FP64, 19.5 TeraFLOPS of FP64 Tensor, and up to 624 BF16/FP16 TeraFLOPS with sparsity. A rough napkin math would suggest that 70% performance of the original (a 30% cut) would equal 6.8 TeraFLOPs of FP64 precision, 13.7 TeraFLOPs of FP64 Tensor, and 437 BF16/FP16 TeraFLOPs with sparsity. MyDrivers notes that A800 can be had for 100,000 Yuan, translating to about 14,462 USD at the time of writing. This is not the most capable GPU that Chinese companies can acquire, as H800 exists. However, we don't have any information about its performance for now.

NVIDIA H100 Compared to A100 for Training GPT Large Language Models

AleksandarK

Apr 28th, 2023 02:59 Discuss (2 Comments)

NVIDIA's H100 has recently become available to use via Cloud Service Providers (CSPs), and it was only a matter of time before someone decided to benchmark its performance and compare it to the previous generation's A100 GPU. Today, thanks to the benchmarks of MosaicML, a startup company led by the ex-CEO of Nervana and GM of Artificial Intelligence (AI) at Intel, Naveen Rao, we have some comparison between these two GPUs with a fascinating insight about the cost factor. Firstly, MosaicML has taken Generative Pre-trained Transformer (GPT) models of various sizes and trained them using bfloat16 and FP8 Floating Point precision formats. All training occurred on CoreWeave cloud GPU instances.

Regarding performance, the NVIDIA H100 GPU achieved anywhere from 2.2x to 3.3x speedup. However, an interesting finding emerges when comparing the cost of running these GPUs in the cloud. CoreWeave prices the H100 SXM GPUs at $4.76/hr/GPU, while the A100 80 GB SXM gets $2.21/hr/GPU pricing. While the H100 is 2.2x more expensive, the performance makes it up, resulting in less time to train a model and a lower price for the training process. This inherently makes H100 more attractive for researchers and companies wanting to train Large Language Models (LLMs) and makes choosing the newer GPU more viable, despite the increased cost. Below, you can see tables of comparison between two GPUs in training time, speedup, and cost of training.

Alphacool Expands Enterprise Solutions with Water Blocks for A100 80 GB PCIe, RTX A4000, and RTX 6000 Ada 48 GB SKUs

Press Release by

AleksandarK

Apr 13th, 2023 11:45 Discuss (2 Comments)

Alphacool expands the portfolio of the Enterprise Solutions series for GPU water coolers and presents the new ES NV A100 80 GB PCIe, ES RTX A4000 with backplate and ES RTX 6000 Ada 48 GB.

To best dissipate the enormous waste heat of this GPU generation, the cooler is positioned close to the components to be cooled in an exemplary manner. The fin structure has been adapted and allows a very good water flow while increasing the cooling surface. The modified jetplate with improved inflow engine ensures optimal distribution of water on the cooling fins. The fully chromed copper base is resistant to acids, scratches and damages. The matte carbon finish gives the cooler a noble appearance. At the same time, this makes it interesting for private users who want to do without aRGB lighting.

News Posts matching #A100

Latest GPU Drivers

New Forum Posts

Popular Reviews

Controversial News Posts