IBM Cloud Is First to Widely Implement Intel Gaudi 3 AI Chips

IBM is the first cloud customer for Intel’s Gaudi 3 AI accelerator chip, which it will make available in early 2025. The Gaudi 3 will be available for hybrid and on-site environments via the IBM Cloud, as part of Watsonx AI and on IBM data platforms. Gaudi 3, which began shipping in Q2 and is expected to go into mass production later this year, is IBM’s AI challenger to GPU accelerators from Nvidia and AMD, the latter having in January begun shipping its own HPC solution, the MI300X. Unlike that chip and Nvidia’s Hopper H100 and more recent Blackwell B200, the Gaudi 3 is not a GPU, but built on an architecture specifically for inference and deep learning. Continue reading IBM Cloud Is First to Widely Implement Intel Gaudi 3 AI Chips

GTC: Nvidia Unveils Blackwell GPU for Trillion-Parameter LLMs

Nvidia unveiled what it is calling the world’s most powerful AI processing system, the Blackwell GPU, purpose built to power real-time generative AI on trillion-parameter large language models at what the company says will be up to 25x less cost and energy consumption than its predecessors. Blackwell’s capabilities will usher in what the company promises will be a new era in generative AI computing. News from Nvidia’s GTC 2024 developer conference included the NIM software platform, purpose built to streamline the setup of custom and pre-trained AI models in a production environment, and the DGX SuperPOD server, powered by Blackwell. Continue reading GTC: Nvidia Unveils Blackwell GPU for Trillion-Parameter LLMs