IBM Cloud Integrates Intel Gaudi 3 AI Accelerators, Demonstrating Superior Performance Over NVIDIA H100 and H200 GPUs

IBM Cloud Integrates Intel Gaudi 3 AI Accelerators, Demonstrating Superior Performance Over NVIDIA H100 and H200 GPUs

​Intel’s Gaudi 3 AI accelerator is now available on IBM Cloud, offering enterprises a scalable and cost-effective solution for AI workloads. This integration positions IBM Cloud as the first cloud service provider to adopt Gaudi 3, providing clients with enhanced performance and flexibility for AI applications.​

Key Features of Gaudi 3 on IBM Cloud:

  • High-Performance AI Processing: Gaudi 3 accelerators are designed with AI-specific features, including matrix math engines, tensor processing cores, high-bandwidth memory, and built-in Ethernet ports. These components facilitate accelerated inferencing of deep neural networks, supporting the deployment of large language models (LLMs) and retrieval-augmented generation (RAG) applications.
  • Scalability and Flexibility: IBM Cloud enables customers to scale AI workloads from single-node configurations with eight accelerators, offering a throughput of 9.6 TB/s, to expansive 1,024-node clusters with 8,192 accelerators, achieving a throughput of 9.83 PB/s. This scalability ensures that enterprises can tailor their AI infrastructure to meet specific performance and budgetary requirements.
  • Cost-Effective Solutions: Deploying Gaudi 3 accelerators on IBM Cloud provides a more affordable alternative to existing AI accelerators. Preliminary tests have shown that Gaudi 3 delivers superior performance at a lower cost compared to NVIDIA’s H100 and H200 GPUs, making it an attractive option for enterprises seeking efficient AI solutions.

Performance Benchmarks:

Initial evaluations by Signal65 have demonstrated that Gaudi 3 outperforms NVIDIA’s H100 and H200 GPUs in several AI models. For instance, in tests involving models like IBM Granite (8B) and Meta Llama-3.1, Gaudi 3 achieved higher throughput rates, particularly as batch sizes increased. These results highlight Gaudi 3’s capability to handle complex AI workloads effectively.

Integration with IBM’s AI Platform:

IBM plans to integrate Gaudi 3 support into its watsonx AI and Data Platform, further enhancing AI infrastructure resources for clients. This integration aims to optimize model inference performance, providing a comprehensive environment for developing and deploying AI applications.

Strategic Implications:

The adoption of Gaudi 3 accelerators on IBM Cloud signifies a strategic move to diversify AI hardware options, reducing reliance on a single supplier and fostering a more competitive market. This collaboration between IBM and Intel reflects a commitment to offering clients innovative and secure compute solutions tailored to the evolving demands of AI workloads.

In summary, the availability of Intel’s Gaudi 3 AI accelerators on IBM Cloud provides enterprises with a high-performance, scalable, and cost-effective platform for AI applications. This development enhances IBM Cloud’s AI capabilities, offering clients advanced tools to meet the growing demands of AI workloads.

RSS
Follow by Email
0
Would love your thoughts, please comment.x
()
x