GPU Archives - ETCentric

Google Ironwood TPU is Made for Inference and ‘Thinking’ AI

By Paula Parisi
April 14, 2025

Google has debuted a new accelerator chip, Ironwood, a tensor processing unit designed specifically for inference — the ability of AI to predict things. Ironwood will power Google Cloud’s AI Hypercomputer, which runs the company’s Gemini models and is gearing up for the next generation of artificial intelligence workloads. Google’s TPUs are similar to the accelerator GPUs sold by Nvidia, but unlike the GPUs they’re designed for AI and geared toward speeding neural network tasks and mathematical operations. Google says when deployed at scale Ironwood is more than 24 times more powerful than the world’s fastest supercomputer. Continue reading Google Ironwood TPU is Made for Inference and ‘Thinking’ AI

TSMC Reportedly Ready for Joint Venture with Intel Foundries

By Paula Parisi
April 7, 2025

Semiconductor giant Intel has reached a tentative agreement with Taiwan’s TSMC and some U.S. firms to create a joint venture that would assume operating responsibility for Intel’s chip fabrication plants here. TSMC will reportedly hold a 20 percent stake in the JV, while Intel and the other investors would control the remaining 80 percent. This specific JV is limited to Intel’s foundry unit, which posted a 2024 operating loss of $13.4 billion in 2024 and is not expected to break even until 2027. New Intel CEO Lip-Bu Tan said at last week’s Intel Vision conference that he will spin off all non-core units. Continue reading TSMC Reportedly Ready for Joint Venture with Intel Foundries

Nintendo Switch 2 Out in June with 4K Support, In-Game Chat

By Paula Parisi
April 4, 2025

Gamers have been waiting for years for Nintendo’s new gaming device and now they only have to wait until June 5 when the Switch 2 hits shelves starting at $450. The device has a larger, 1080p LCD screen, and supports 4K and in-game chat. Details were revealed during a Nintendo Direct online presentation. Nintendo, which announced details of the new console in January, will over the next few months be holding a series of global roadshow events aimed at letting people have a hands-on experience with Switch 2. The original has sold more than 150 million units. Continue reading Nintendo Switch 2 Out in June with 4K Support, In-Game Chat

Nvidia Forges AI Initiative to Streamline Production Workflows

By Douglas Chan
March 31, 2025

During Nvidia’s GTC AI Conference in San Jose earlier this month, VP and GM of Media & Entertainment Richard Kerris presented the Nvidia Media2 initiative that builds on the company’s Blackwell GPU foundation to enable real-time AI solutions for all aspects of media production workflows. His talk showcased a broad range of generative AI breakthroughs in real-time ray tracing and VFX, video search and summarization, and musically-based sound effects (SFX). Kerris also shared insights on the media industry’s reception to AI thus far and humbly implored the audience to consider using such technology as an effective new tool for storytelling. Continue reading Nvidia Forges AI Initiative to Streamline Production Workflows

Meta Tests New AI Accelerator Chip Designed with Broadcom

By Paula Parisi
March 13, 2025

Meta Platforms has reportedly begun “a small deployment” of its first in-house chip designed for AI training. The accelerator chip is engineered around the open-standard RISC-V architecture. TSMC produced the working samples now being tested. The goal is to create purpose-specific chips that are more efficient than Nvidia’s general purpose GPUs, enjoying the cost-savings that would come with wide use and reducing reliance on outside chip suppliers in a tight market. If the tests go well, Meta plans to scale up production for expanded use by 2026. Details of the new chip’s specifications remain unknown at this time. Continue reading Meta Tests New AI Accelerator Chip Designed with Broadcom

OpenAI’s GPT-4.5 Model Sees Patterns and Thinks Creatively

By Paula Parisi
March 4, 2025

OpenAI is releasing a research preview of what it calls its “largest and best” chat model to date, GPT‑4.5, which scales unsupervised learning in pre-training and post-training. As a result, the new chat model has the ability to recognize patterns, draw connections, and generate creative insights without having to draw on time and energy consuming “reasoning.” GPT‑4.5 is currently available to ChatGPT Pro subscribers ($200 per month) and developers subscribing to OpenAI’s API tier. ChatGPT Plus and ChatGPT Team customers are expected to gain access this week. Continue reading OpenAI’s GPT-4.5 Model Sees Patterns and Thinks Creatively

New Blackwell AI Chip Helps Boost Nvidia to Record Quarter

By Paula Parisi
February 28, 2025

Nvidia delivered stellar earnings again, with profit up 80 percent to $22.09 billion for fiscal Q4, the period that ended January 26, 2025. Record quarterly revenue hit $39.3 billion, a 12 percent uptick from Q3 and a 78 percent increase year-over-year, driven in part by sales of the company’s Blackwell AI chips. The results rebut predictions that the leading-edge chipmaker would suffer due to a recent wave of Chinese AI models created using fewer and largely older chips. That trend rocked Nvidia stock over the past quarter, but the Silicon Valley-based company managed to maintain momentum. Continue reading New Blackwell AI Chip Helps Boost Nvidia to Record Quarter

OpenAI In-House Chip Could Be Ready for Testing This Year

By Paula Parisi
February 12, 2025

OpenAI is getting close to finalizing its first custom chip design, according to an exclusive report from Reuters that emphasizes the Microsoft-backed AI giant’s goal of reducing its dependency on Nvidia chips. The blueprint for the first-generation OpenAI chip could be finalized as soon as the next few months and sent to Taiwan’s TSMC for fabrication, which will take about six months — “unless OpenAI pays substantially more for expedited manufacturing” — according to the report. Even by usual standards, the training-focused chip is already on a fast track to deployment. Continue reading OpenAI In-House Chip Could Be Ready for Testing This Year

Nvidia Targets Consumers with $249 Compact Supercomputer

By Paula Parisi
January 24, 2025

Nvidia is hoping interest in artificial intelligence will translate to consumer sales of a relatively low-priced computer optimized for basic AI functionality. Last month, the company upgraded its Jetson line with a $249 “compact AI supercomputer,” the Jetson Orin Nano Super Developer Kit. At half the price of the original, the model aims to attract students, developers, hobbyists, small- and medium-sized businesses, and anyone who is AI curious. “As the AI world is moving from task-specific models into foundation models, it provides an accessible platform to transform ideas into reality,” according to Nvidia. Continue reading Nvidia Targets Consumers with $249 Compact Supercomputer

CES: Nvidia Unveils New GeForce RTX 50, AI Video Rendering

By Douglas Chan
January 8, 2025

Nvidia founder and CEO Jensen Huang kicked off CES 2025 with a keynote that was filled with new product announcements and visionary demonstrations of how the company plans to advance the field of AI. The first product that Huang unveiled was the GeForce RTX 50 series of consumer graphics processing units (GPUs). The series is also called RTX Blackwell because it is based on Nvidia’s latest Blackwell microarchitecture design for next generation data center and gaming applications. To showcase RTX Blackwell’s prowess, Huang played an impressively photorealistic video sequence of rich imagery under contrasting light ranges — all rendered in real time. Continue reading CES: Nvidia Unveils New GeForce RTX 50, AI Video Rendering

Nvidia, Intel and AMD Invest in AI Chiplet Developer Ayar Labs

By Paula Parisi
December 13, 2024

Ayar Labs, which develops optical interconnect chips for large-scale AI workloads, has secured $155 million in financing, including from competing processor companies Nvidia, Intel and AMD. Founded in 2017, the Silicon Valley-based company is pursuing a different processing path — combining photonic elements with electronic circuits on each chip for what it says provides faster, more efficient processing for artificial intelligence and high-performance computing. “This brings the company’s total funding to $370 million and raises the company’s valuation to above $1 billion,” Ayar notes, adding that the new funding allows the company to scale its optical I/O tech. Continue reading Nvidia, Intel and AMD Invest in AI Chiplet Developer Ayar Labs

Meta’s Llama 3.3 Delivers More Processing for Less Compute

By Paula Parisi
December 10, 2024

Meta Platforms has packed more artificial intelligence into a smaller package with Llama 3.3, which the company released last week. The open-source large language model (LLM) “improves core performance at a significantly lower cost, making it even more accessible to the entire open-source community,” Meta VP of Generative AI Ahmad Al-Dahle wrote on X social. The 70 billion parameter text-only Llama 3.3 is said to perform on par with the 405 billion parameter model that was part of Meta’s Llama 3.1 release in July, with less computing power required, significantly lowering its operational costs. Continue reading Meta’s Llama 3.3 Delivers More Processing for Less Compute

AWS Building Trainium-Powered Supercomputer with Anthropic

By Paula Parisi
December 5, 2024

Amazon Web Services is building a supercomputer in collaboration with Anthropic, the AI startup in which the e-commerce giant has an $8 billion minority stake. Hundreds of thousands of AWS’s flagship Trainium chips will be amassed in an “Ultracluster” that when it is completed in 2025 will be one of the largest supercomputers in the world for model training, Amazon says. The company announced the general availability of AWS Trainium2-powered Amazon Elastic Compute Cloud (EC2) virtual servers as well as Trn2 UltraServers designed to train and deploy AI models and teased next-generation Trainium3 chips. Continue reading AWS Building Trainium-Powered Supercomputer with Anthropic

Lightricks LTX Video Model Impresses with Speed and Motion

By Paula Parisi
December 2, 2024

Lightricks has released an AI model called LTX Video (LTXV) it says generates five seconds of 768 x 512 resolution video (121 frames) in just four seconds, outputting in less time than it takes to watch. The model can run on consumer-grade hardware and is open source, positioning Lightricks as a mass market challenger to firms like Adobe, OpenAI, Google and their proprietary systems. “It’s time for an open-sourced video model that the global academic and developer community can build on and help shape the future of AI video,” Lightricks co-founder and CEO Zeev Farbman said. Continue reading Lightricks LTX Video Model Impresses with Speed and Motion

OpenAI: sCM Generates Media 50x Faster Than Other Models

By Paula Parisi
October 28, 2024

OpenAI is taking a new approach to generating media that it says is 50 times faster than the models commonly used today. Called sCM, the approach is a “consistency model,” a variation on the diffusion method used by many leading systems. OpenAI claims its new model is ideal for training for large scale datasets and generating video, audio and images that are of “comparable sample quality to leading diffusion models.” Such models often require hundreds of steps, creating challenges when it comes to real-time applications. OpenAI aims to change this with a faster system that requires less power. Continue reading OpenAI: sCM Generates Media 50x Faster Than Other Models