IBM Cloud Is First to Widely Implement Intel Gaudi 3 AI Chips

IBM is the first cloud customer for Intel’s Gaudi 3 AI accelerator chip, which it will make available in early 2025. The Gaudi 3 will be available for hybrid and on-site environments via the IBM Cloud, as part of Watsonx AI and on IBM data platforms. Gaudi 3, which began shipping in Q2 and is expected to go into mass production later this year, is IBM’s AI challenger to GPU accelerators from Nvidia and AMD, the latter having in January begun shipping its own HPC solution, the MI300X. Unlike that chip and Nvidia’s Hopper H100 and more recent Blackwell B200, the Gaudi 3 is not a GPU, but built on an architecture specifically for inference and deep learning. Continue reading IBM Cloud Is First to Widely Implement Intel Gaudi 3 AI Chips

AI Boom Continues to Drive Strong Nvidia Revenue and Profit

Nvidia has had another impressive quarter. Record revenue of $30 billion in Q2 was up 122 percent from a year ago, while data center revenue of $26.3 billion marked a 154 percent increase from the same period in 2023. The performance was seen by many as an assurance of AI’s staying power, although others raised concern that if the AI companies buying chips do not start generating profits soon, the sugar high of the two-year AI boom could precede a crash. Nvidia took the occasion to tout its next-generation Blackwell chips, reassuring investors that a mid-production “tweak” would not delay release. Continue reading AI Boom Continues to Drive Strong Nvidia Revenue and Profit

Intel’s Xeon 6 Coming to Data Centers and Lunar Lake to PCs

Intel launched new Xeon 6 processors built for high-density AI work in data centers. Intel CEO Pat Gelsinger emphasized performance and power efficiency as he introduced the next-gen Xeon, and said that the Gaudi 3 chips for AI model training and deployment that were released two months ago are less expensive than comparable silicon from Intel rivals. “Intel is one of the only companies in the world innovating across the full spectrum of the AI market opportunity — from semiconductor manufacturing to PC, network, edge and data center systems,” Gelsinger said, embracing open standards during his keynote at Computex. Continue reading Intel’s Xeon 6 Coming to Data Centers and Lunar Lake to PCs

Nvidia Reports Record Revenue, Profits as AI Demand Surges

Nvidia just wrapped a record quarter, with no sign of interest cooling for the GPUs that have become essential to powering the AI boom. Revenue for the company’s most recent quarter was a record $26 billion, up 262 percent year-over-year. Profit also hit a new high, up nearly sevenfold to $14.88 billion compared to the same period a year earlier. The performance drove the already buoyant stock price above $1,000 a share. Company founder and CEO Jensen Huang proclaimed, “the next industrial revolution has begun,” with Nvidia playing a pivotal role in transforming data centers into “AI factories.” Continue reading Nvidia Reports Record Revenue, Profits as AI Demand Surges

GTC: Nvidia Unveils Blackwell GPU for Trillion-Parameter LLMs

Nvidia unveiled what it is calling the world’s most powerful AI processing system, the Blackwell GPU, purpose built to power real-time generative AI on trillion-parameter large language models at what the company says will be up to 25x less cost and energy consumption than its predecessors. Blackwell’s capabilities will usher in what the company promises will be a new era in generative AI computing. News from Nvidia’s GTC 2024 developer conference included the NIM software platform, purpose built to streamline the setup of custom and pre-trained AI models in a production environment, and the DGX SuperPOD server, powered by Blackwell. Continue reading GTC: Nvidia Unveils Blackwell GPU for Trillion-Parameter LLMs