By
Paula ParisiDecember 5, 2024
Amazon Web Services is building a supercomputer in collaboration with Anthropic, the AI startup in which the e-commerce giant has an $8 billion minority stake. Hundreds of thousands of AWS’s flagship Trainium chips will be amassed in an “Ultracluster” that when it is completed in 2025 will be one of the largest supercomputers in the world for model training, Amazon says. The company announced the general availability of AWS Trainium2-powered Amazon Elastic Compute Cloud (EC2) virtual servers as well as Trn2 UltraServers designed to train and deploy AI models and teased next-generation Trainium3 chips. Continue reading AWS Building Trainium-Powered Supercomputer with Anthropic
By
Paula ParisiNovember 30, 2023
Amazon Web Services is introducing Amazon Q, an AI chatbot geared toward enterprise clients who can customize it to increase productivity for their specific business needs. AWS also announced that it has updated its homegrown Graviton4 chips for a 30 percent performance boost. AWS confirmed it will be the first Big Tech firm to deploy the latest version of Nvidia’s Grace Hopper Superchip AI accelerator, and additionally will become a data center host for Nvidia’s DGX Cloud service. The announcements were disclosed at the AWS re:Invent conference in Las Vegas. Continue reading Amazon Unveils Productivity Chatbot, Gets Nvidia Superchip
By
Debra KaufmanOctober 15, 2015
The public cloud for software-as-a-service offerings, including back-end business services is catching fire, and Amazon and Google, which already run extensive public clouds, are well positioned to dominate in the arena. That’s despite Dell’s recent purchase of EMC, say the experts, because the two companies under EMC — VMware and Pivotal — although they are cloud computing companies, are not big players in the public cloud. Cloud platform services are expected to become a $44 billion market by 2020. Continue reading The Public Cloud is Inevitable, and Amazon Stands to Win Big
By
Rob ScottJanuary 2, 2014
In December, Google made its IaaS (infrastructure-as-a-service) Google Compute Engine (GCE) available as a full-fledged commercial service, after testing it in preview mode for more than a year. Last week, the company introduced its new Billing API as an easier way for developers to monitor and analyze how much running an application on the Cloud Platform costs. According to Google, the Billing Export offers a new means of accessing usage data, and is available in preview. Continue reading Google Gets Serious About Public Cloud: Previews New API