Monday, September 11th 2023

Intel Shows Strong AI Inference Performance

Press Release
Today, MLCommons published results of its MLPerf Inference v3.1 performance benchmark for GPT-J, the 6 billion parameter large language model, as well as computer vision and natural language processing models. Intel submitted results for Habana Gaudi 2 accelerators, 4th Gen Intel Xeon Scalable processors, and Intel Xeon CPU Max Series. The results show Intel's competitive performance for AI inference and reinforce the company's commitment to making artificial intelligence more accessible at scale across the continuum of AI workloads - from client and edge to the network and cloud.

"As demonstrated through the recent MLCommons results, we have a strong, competitive AI product portfolio, designed to meet our customers' needs for high-performance, high-efficiency deep learning inference and training, for the complete spectrum of AI models - from the smallest to the largest - with leading price/performance." -Sandra Rivera, Intel executive vice president and general manager of the Data Center and AI Group
Building on the MLCommons AI training update from June and the Hugging Face performance benchmarks that validate that Gaudi2 can outperform Nvidia's H100 on a state-of-the-art vision language model, today's results further reinforce that Intel offers the only viable alternative to Nvidia's H100 and A100 for AI compute needs. Every customer has unique considerations, and Intel is bringing AI everywhere with products that can address inference and training across the continuum of AI workloads. Intel's AI products give customers flexibility and choice when choosing an optimal AI solution based on their own respective performance, efficiency and cost targets, while helping them break from closed ecosystems.

The Habana Gaudi2 inference performance results for GPT-J provide strong validation of its competitive performance.With Gaudi2 software updates released every six to eight weeks, Intel expects to continue delivering performance advancements and expanded model coverage in MLPerf benchmarks.

Intel submitted all seven inference benchmarks, including GPT-J, on 4th Gen Intel Xeon Scalable processors. These results show great performance for general-purpose AI workloads, including vision, language processing, speech and audio translation models, as well as the much larger DLRM v2 recommendation and ChatGPT-J models. Additionally, Intel remains the only vendor to submit public CPU results with industry-standard deep learning ecosystem software.MLPerf, generally regarded as the most reputable benchmark for AI performance, enables fair and repeatable performance comparisons. Intel anticipates submitting new AI training performance results for the next MLPerf benchmark. The ongoing performance updates show Intel's commitment to support customers and address every node of the AI continuum: from low-cost AI processors to the highest-performing AI hardware accelerators and GPUs for the network, cloud and enterprise customers.
Show 0 Comments