By
Paula ParisiSeptember 11, 2024
IBM is the first cloud customer for Intel’s Gaudi 3 AI accelerator chip, which it will make available in early 2025. The Gaudi 3 will be available for hybrid and on-site environments via the IBM Cloud, as part of Watsonx AI and on IBM data platforms. Gaudi 3, which began shipping in Q2 and is expected to go into mass production later this year, is IBM’s AI challenger to GPU accelerators from Nvidia and AMD, the latter having in January begun shipping its own HPC solution, the MI300X. Unlike that chip and Nvidia’s Hopper H100 and more recent Blackwell B200, the Gaudi 3 is not a GPU, but built on an architecture specifically for inference and deep learning. Continue reading IBM Cloud Is First to Widely Implement Intel Gaudi 3 AI Chips
By
Paula ParisiAugust 30, 2024
Nvidia has had another impressive quarter. Record revenue of $30 billion in Q2 was up 122 percent from a year ago, while data center revenue of $26.3 billion marked a 154 percent increase from the same period in 2023. The performance was seen by many as an assurance of AI’s staying power, although others raised concern that if the AI companies buying chips do not start generating profits soon, the sugar high of the two-year AI boom could precede a crash. Nvidia took the occasion to tout its next-generation Blackwell chips, reassuring investors that a mid-production “tweak” would not delay release. Continue reading AI Boom Continues to Drive Strong Nvidia Revenue and Profit
By
Paula ParisiAugust 21, 2024
California-based semiconductor manufacturer AMD is looking to take on Nvidia for a bigger share of business from the artificial intelligence boom. AMD plans to purchase data center equipment maker ZT Systems in a cash and stock deal that values the company at $4.9 billion. The deal, which is subject to regulatory approval, is part of AMD’s goal of offering a wider selection of chips, software and system designs to big data enterprise clients such as Microsoft, Google, Meta Platforms and Apple. Privately held ZT Systems, based in New Jersey, makes gear and server solutions for cloud computing and related infrastructure. Continue reading AMD Buying ZT Systems to Expand Data Center Capabilities
By
Paula ParisiJuly 25, 2024
In April, Meta Platforms revealed that it was working on an open-source AI model that performed as well as proprietary models from top AI companies such as OpenAI and Anthropic. Now, Meta CEO Mark Zuckerberg says that model has arrived in the form of Llama 3.1 405B, “the first frontier-level open-source AI model.” The company is also releasing “new and improved” Llama 3.1 70B and 8B models. In addition to general cost and performance benefits, the fact that the Llama 3.1 405B model is open source “will make it the best choice for fine-tuning and distilling smaller models,” according to Meta. Continue reading Meta Calls New Llama the First Open-Source Frontier Model
By
Paula ParisiJune 3, 2024
Big Tech players have joined forces to develop a new industry standard to advance high-speed and low latency communication among data centers by coordinating component development. AMD, Broadcom, Cisco, Google, Hewlett Packard Enterprise (HPE), Intel, Meta Platforms and Microsoft are backing the Ultra Accelerator Link (UALink) promoter group. The group plans to define and establish an open industry standard that will enable AI accelerators to communicate more effectively. The UALink aims to create a pathway for system OEMs, IT professionals and system integrators to connect and scale their AI-connected data centers. Continue reading Big Tech Forms a Group to Develop AI Connectivity Standard
By
Paula ParisiMay 28, 2024
Nvidia just wrapped a record quarter, with no sign of interest cooling for the GPUs that have become essential to powering the AI boom. Revenue for the company’s most recent quarter was a record $26 billion, up 262 percent year-over-year. Profit also hit a new high, up nearly sevenfold to $14.88 billion compared to the same period a year earlier. The performance drove the already buoyant stock price above $1,000 a share. Company founder and CEO Jensen Huang proclaimed, “the next industrial revolution has begun,” with Nvidia playing a pivotal role in transforming data centers into “AI factories.” Continue reading Nvidia Reports Record Revenue, Profits as AI Demand Surges
By
Paula ParisiMay 15, 2024
Masayoshi Son, CEO of Japan’s SoftBank, wants to transform the tech conglomerate’s Arm subsidiary into an AI powerhouse, and he is investing $64 billion (10 trillion yen) to implement the plan, which includes turning the UK-based unit into an AI chip supplier. Son announced that by spring 2025 Arm is expected to have its first prototype, followed by mass production by contract suppliers and commercial sales in the fall. Arm designs but does not manufacture circuitry, supplying what it calls “chip architecture” to customers including Nvidia and Qualcomm. Continue reading SoftBank’s Arm Plans to Supply AI Chips, Open Data Centers
By
Paula ParisiMay 15, 2024
France has been pursuing Big Tech and Microsoft and Amazon are among the first to express interest. Microsoft has committed $4.3 billion to expand cloud and AI infrastructure there, sharing plans to bring as many as 25,000 advanced GPUs to France by the close of 2025. The software giant will also train one million people for AI and data jobs while supporting 2,500 AI startups over the next three years. Meanwhile, Amazon announced that it would invest up to $1.3 billion to expand its existing footprint of 35 logistics facilities in the country. The deals were announced Monday during the Choose France summit hosted by French President Emmanuel Macron. Continue reading Microsoft, Amazon Commit to Expanding Operations in France
By
Paula ParisiMay 8, 2024
New iPad Pros with OLED displays and the thinnest design ever, powered by Apple’s new M4 chip topped the news out of Cupertino’s “Let Loose” launch event, where redesigned iPad Airs were unveiled in 11‑inch and 13‑inch M2 configurations. The switch to OLED is a major change for Apple, which used mini-LED on the most recent models. And it’s not just garden variety OLED, but tandem OLED, combining two OLED displays for very high contrast with deep blacks and 1,600 nit peak brightness. The Ultra Retina XDR visuals are enabled by the M4 and its new display engine. Continue reading Apple Brings Tandem OLEDs to New M4-Powered iPad Pros
By
ETCentric StaffApril 12, 2024
Meta’s next generation AI silicon is a 5nm chip designed to power the models that provide recommendations to those who use its social network platforms. The new MTIA inference accelerator is part of a “broader full-stack development program for custom, domain-specific silicon that addresses our unique workloads and systems,” Meta says. The next-gen MTIA more than doubles the compute and memory bandwidth of its predecessor, the 7nm MTIA v1 chip introduced in May 2023, resulting in 3x the performance, according to Meta, which says the new silicon is already live in 16 data centers. Continue reading Meta Deploys Gen 2 MTIA AI Accelerator Chip in Data Centers
By
ETCentric StaffMarch 28, 2024
Microsoft is making improvements to the way its Copilot AI assistant works in Microsoft Teams and is using artificial intelligence to further integrate hybrid meetings. As the company leans deeper into AI, it continues to push hardware manufacturers to build an AI-optimized PC, making sure to include a dedicated Microsoft Copilot key. Microsoft joins Intel, Qualcomm and AMD in championing purpose-built AI PCs. In the meantime, the tech giant continues to build out features for existing PCs. The company is adding new ways to tap into the Copilot tool for meetings, chats, summaries and more. Continue reading Microsoft Improves Meetings and Messaging with Copilot, AI
By
ETCentric StaffMarch 20, 2024
Nvidia unveiled what it is calling the world’s most powerful AI processing system, the Blackwell GPU, purpose built to power real-time generative AI on trillion-parameter large language models at what the company says will be up to 25x less cost and energy consumption than its predecessors. Blackwell’s capabilities will usher in what the company promises will be a new era in generative AI computing. News from Nvidia’s GTC 2024 developer conference included the NIM software platform, purpose built to streamline the setup of custom and pre-trained AI models in a production environment, and the DGX SuperPOD server, powered by Blackwell. Continue reading GTC: Nvidia Unveils Blackwell GPU for Trillion-Parameter LLMs
By
ETCentric StaffFebruary 27, 2024
Qualcomm raised the curtain on a variety of artificial intelligence, 5G, and Wi-Fi technologies at Mobile World Congress Barcelona, which runs through Thursday. The San Diego-based chip designer unveiled an AI Hub it says will help developers create voice-, text- and image-based applications using pre-optimized AI models. Qualcomm’s flagship AI chips — the mobile Snapdragon 8 Gen 3 processor and the PC-centric Snapdragon X Elite — were announced last year. With the first splash of products now heading to market the company is promising to push the boundaries of 5G and 6G. Continue reading MWC: Qualcomm Unveils AI Hub and Promotes 5G, 6G Tech
By
ETCentric StaffFebruary 23, 2024
Demand for artificial intelligence computer chips drove Nvidia income up 769 percent to nearly $12.3 billion for Q4, year-over-year, and 286 percent — to just over $29.7 billion — for the full-year fiscal 2024 frame that ended January 28. Revenue was $22.1 billion (+265 percent) and $60.9 billion (+126 percent) for the respective periods. Data center sales hit record highs of $18.4 billion for the quarter, up 409 percent from the previous year, $47.5 billion for the fiscal year, an increase of 217 percent. Gaming revenue was flat for Q4, at $2.9 billion, and up 115 percent for the year. Continue reading Nvidia Revenue and Profits Soar on Strength of AI Chip Sales
By
ETCentric StaffFebruary 8, 2024
Nvidia and Cisco Systems want to simplify the process of creating in-house AI computing infrastructure with a new joint service offering end-to-end artificial intelligence solutions that aim to allow any enterprise firm to host its own AI data center. Along with its own networking gear, Cisco will globally broker Nvidia AI software and GPU cloud products along with jointly configured “purpose-built Ethernet networking-based solutions.” European cloud services provider ClusterPower is an early customer, using the new offering “to help drive data center operations with innovative AI/ML solutions.” Continue reading Cisco and Nvidia Team to Offer Help Developing In-House AI