By
Paula ParisiJune 6, 2024
Rene Haas, CEO of UK chip designer Arm Holdings, thinks his company’s platform architecture could nab as much as 50 percent of the Windows PC market by 2030. That would essentially be a 400 percent leap from its current 11 percent share in a market dominated by Intel’s x86 design. Because Arm was developed for smartphones, it was driven by energy efficiency, an approach that is paying off in the era of power-hungry AI applications. Now the technology is being used for the first wave of Microsoft Copilot+ Windows laptops, and Arm has also set its sights on desktop PCs. Continue reading Arm CEO Says Company Aims to Capture Half of PC Market
By
Paula ParisiJune 5, 2024
Intel launched new Xeon 6 processors built for high-density AI work in data centers. Intel CEO Pat Gelsinger emphasized performance and power efficiency as he introduced the next-gen Xeon, and said that the Gaudi 3 chips for AI model training and deployment that were released two months ago are less expensive than comparable silicon from Intel rivals. “Intel is one of the only companies in the world innovating across the full spectrum of the AI market opportunity — from semiconductor manufacturing to PC, network, edge and data center systems,” Gelsinger said, embracing open standards during his keynote at Computex. Continue reading Intel’s Xeon 6 Coming to Data Centers and Lunar Lake to PCs
By
Paula ParisiJune 4, 2024
Nvidia President and CEO Jensen Huang said the company will be upgrading its AI accelerators annually, with the Blackwell Ultra processor coming in 2025 and a next-generation platform called Rubin that is still in development planned for 2026. Rubin AI will utilize a type of high-bandwidth memory called HBM4 that addresses a bottleneck that has stifled the production of AI accelerators. Huang shared the news from Taiwan, where he delivered a keynote at the Computex trade show. Nvidia Inference Microservices were another focus, allowing AI applications to be deployed in minutes instead of weeks, Huang said. Continue reading Nvidia Teases Next-Gen AI Platform Rubin at Computex 2024
By
Paula ParisiJune 3, 2024
Big Tech players have joined forces to develop a new industry standard to advance high-speed and low latency communication among data centers by coordinating component development. AMD, Broadcom, Cisco, Google, Hewlett Packard Enterprise (HPE), Intel, Meta Platforms and Microsoft are backing the Ultra Accelerator Link (UALink) promoter group. The group plans to define and establish an open industry standard that will enable AI accelerators to communicate more effectively. The UALink aims to create a pathway for system OEMs, IT professionals and system integrators to connect and scale their AI-connected data centers. Continue reading Big Tech Forms a Group to Develop AI Connectivity Standard
By
Paula ParisiMay 28, 2024
Nvidia just wrapped a record quarter, with no sign of interest cooling for the GPUs that have become essential to powering the AI boom. Revenue for the company’s most recent quarter was a record $26 billion, up 262 percent year-over-year. Profit also hit a new high, up nearly sevenfold to $14.88 billion compared to the same period a year earlier. The performance drove the already buoyant stock price above $1,000 a share. Company founder and CEO Jensen Huang proclaimed, “the next industrial revolution has begun,” with Nvidia playing a pivotal role in transforming data centers into “AI factories.” Continue reading Nvidia Reports Record Revenue, Profits as AI Demand Surges
By
Paula ParisiMay 15, 2024
Masayoshi Son, CEO of Japan’s SoftBank, wants to transform the tech conglomerate’s Arm subsidiary into an AI powerhouse, and he is investing $64 billion (10 trillion yen) to implement the plan, which includes turning the UK-based unit into an AI chip supplier. Son announced that by spring 2025 Arm is expected to have its first prototype, followed by mass production by contract suppliers and commercial sales in the fall. Arm designs but does not manufacture circuitry, supplying what it calls “chip architecture” to customers including Nvidia and Qualcomm. Continue reading SoftBank’s Arm Plans to Supply AI Chips, Open Data Centers
By
Paula ParisiMay 15, 2024
France has been pursuing Big Tech and Microsoft and Amazon are among the first to express interest. Microsoft has committed $4.3 billion to expand cloud and AI infrastructure there, sharing plans to bring as many as 25,000 advanced GPUs to France by the close of 2025. The software giant will also train one million people for AI and data jobs while supporting 2,500 AI startups over the next three years. Meanwhile, Amazon announced that it would invest up to $1.3 billion to expand its existing footprint of 35 logistics facilities in the country. The deals were announced Monday during the Choose France summit hosted by French President Emmanuel Macron. Continue reading Microsoft, Amazon Commit to Expanding Operations in France
This week Microsoft announced plans to help establish Southeast Wisconsin “as a hub for AI-powered economic activity, innovation, and job creation,” according to the company’s press release. As part of the broad investment package, the tech giant is planning “$3.3 billion in cloud computing and AI infrastructure, the creation of the country’s first manufacturing-focused AI co-innovation lab, and an AI skilling initiative to equip more than 100,000 of the state’s residents with essential AI skills.” Microsoft’s new data center campus will replace the failed $10 billion Foxconn LCD manufacturing center planned for Mount Pleasant, situated in Racine County. Continue reading Microsoft to Invest $3.3 Billion in Building New AI Data Center
Amazon reported $143.3 billion in Q1 revenue, a 13 percent increase year-over-year, excluding the impact from foreign exchange rates, with net income at just over $10.3 billion, a nearly 229 percent surge that set a first quarter record for the company. Both categories outperformed Wall Street expectations, with strong online sales and a booming cloud business thanks to the increased demands of artificial intelligence deployment by enterprise clients credited as driving the boom. Amazon President and CEO Andy Jassy called it “a good start to the year.” Continue reading Amazon Q1 Profits Surge on Strong Retail and AWS Comeback
By
ETCentric StaffApril 29, 2024
Alphabet reported revenue of $80.5 billion for Q1, a 15 percent increase fueled largely by online advertising from Google Search and YouTube. The figure topped analyst estimates of $78.8 billion. Profit soared, rising 57 percent to more than $23.6 billion, wildly overperforming the forecast of $18.9 billion. The strong performance resulted in Alphabet announcing its first ever shareholder dividend, at 20 cents per share, which pays out on June 17. Alphabet’s board approved a $70 billion stock repurchase program, and the news-filled earnings event drove Alphabet shares up 13 percent in after-hours trading. Continue reading Alphabet Profit Up 57 Percent, Prompting First-Ever Dividend
By
ETCentric StaffApril 12, 2024
Meta’s next generation AI silicon is a 5nm chip designed to power the models that provide recommendations to those who use its social network platforms. The new MTIA inference accelerator is part of a “broader full-stack development program for custom, domain-specific silicon that addresses our unique workloads and systems,” Meta says. The next-gen MTIA more than doubles the compute and memory bandwidth of its predecessor, the 7nm MTIA v1 chip introduced in May 2023, resulting in 3x the performance, according to Meta, which says the new silicon is already live in 16 data centers. Continue reading Meta Deploys Gen 2 MTIA AI Accelerator Chip in Data Centers
By
ETCentric StaffApril 4, 2024
Microsoft and OpenAI are contemplating an AI supercomputer data center that may cost as much as $100 billion. Called Stargate, the aim would be to have it operational by 2008 to drive OpenAI’s next generation of artificial intelligence. According to reports, the Stargate complex would span hundreds of U.S. acres and use up to 5 gigawatts of power, which is massive (the equivalent of a substantial metropolitan power grid). In light of those power needs, a nuclear power source is said to be under consideration. The project is not yet green-lit, and no U.S. location has been selected. Continue reading Microsoft, OpenAI Considering a Supercomputer Data Center
By
ETCentric StaffApril 3, 2024
Amazon has added $2.75 billion to its initial September 2023 investment of $1.25 billion in Anthropic, completing its announced $4 billion stake in the artificial intelligence startup formed in 2021 by former members of OpenAI. As part of the resulting strategic collaboration, Anthropic’s most powerful models, including the Claude 3 series, are available on Amazon Bedrock, a service providing fully managed foundation models. Anthropic is using Amazon Web Services as its primary cloud provider and Amazon says Anthropic will use AWS Trainium and Inferentia chips “to build, train, and deploy its future models.” Continue reading Amazon Increases Its Investment in Anthropic AI to $4 Billion
By
ETCentric StaffFebruary 23, 2024
Demand for artificial intelligence computer chips drove Nvidia income up 769 percent to nearly $12.3 billion for Q4, year-over-year, and 286 percent — to just over $29.7 billion — for the full-year fiscal 2024 frame that ended January 28. Revenue was $22.1 billion (+265 percent) and $60.9 billion (+126 percent) for the respective periods. Data center sales hit record highs of $18.4 billion for the quarter, up 409 percent from the previous year, $47.5 billion for the fiscal year, an increase of 217 percent. Gaming revenue was flat for Q4, at $2.9 billion, and up 115 percent for the year. Continue reading Nvidia Revenue and Profits Soar on Strength of AI Chip Sales
By
ETCentric StaffFebruary 8, 2024
Nvidia and Cisco Systems want to simplify the process of creating in-house AI computing infrastructure with a new joint service offering end-to-end artificial intelligence solutions that aim to allow any enterprise firm to host its own AI data center. Along with its own networking gear, Cisco will globally broker Nvidia AI software and GPU cloud products along with jointly configured “purpose-built Ethernet networking-based solutions.” European cloud services provider ClusterPower is an early customer, using the new offering “to help drive data center operations with innovative AI/ML solutions.” Continue reading Cisco and Nvidia Team to Offer Help Developing In-House AI