By
Paula ParisiJune 4, 2024
Nvidia President and CEO Jensen Huang said the company will be upgrading its AI accelerators annually, with the Blackwell Ultra processor coming in 2025 and a next-generation platform called Rubin that is still in development planned for 2026. Rubin AI will utilize a type of high-bandwidth memory called HBM4 that addresses a bottleneck that has stifled the production of AI accelerators. Huang shared the news from Taiwan, where he delivered a keynote at the Computex trade show. Nvidia Inference Microservices were another focus, allowing AI applications to be deployed in minutes instead of weeks, Huang said. Continue reading Nvidia Teases Next-Gen AI Platform Rubin at Computex 2024
By
ETCentric StaffMarch 20, 2024
Nvidia unveiled what it is calling the world’s most powerful AI processing system, the Blackwell GPU, purpose built to power real-time generative AI on trillion-parameter large language models at what the company says will be up to 25x less cost and energy consumption than its predecessors. Blackwell’s capabilities will usher in what the company promises will be a new era in generative AI computing. News from Nvidia’s GTC 2024 developer conference included the NIM software platform, purpose built to streamline the setup of custom and pre-trained AI models in a production environment, and the DGX SuperPOD server, powered by Blackwell. Continue reading GTC: Nvidia Unveils Blackwell GPU for Trillion-Parameter LLMs