AWS Building Trainium-Powered Supercomputer with Anthropic

Amazon Web Services is building a supercomputer in collaboration with Anthropic, the AI startup in which the e-commerce giant has an $8 billion minority stake. Hundreds of thousands of AWS’s flagship Trainium chips will be amassed in an “Ultracluster” that when it is completed in 2025 will be one of the largest supercomputers in the world for model training, Amazon says. The company announced the general availability of AWS Trainium2-powered Amazon Elastic Compute Cloud (EC2) virtual servers as well as Trn2 UltraServers designed to train and deploy AI models and teased next-generation Trainium3 chips. Continue reading AWS Building Trainium-Powered Supercomputer with Anthropic

Amazon Commits $230M in AWS Credits for GenAI Startups

Amazon has earmarked $230 million to invest in generative AI startups worldwide, providing funding in the form of “AWS credits, mentorship, and education to further their use of artificial intelligence and machine learning technologies.” The initiative will cast a global net, focusing on early-stage companies. About $80 million of that allocation will fund the second cohort of the AWS Generative AI Accelerator, which provides up to $1 million in credits “to each of the top 80 early-stage startups that are using generative AI to solve complex challenges.” Applications for the AWS Accelerator are open through July 19. Continue reading Amazon Commits $230M in AWS Credits for GenAI Startups

Meta Deploys Gen 2 MTIA AI Accelerator Chip in Data Centers

Meta’s next generation AI silicon is a 5nm chip designed to power the models that provide recommendations to those who use its social network platforms. The new MTIA inference accelerator is part of a “broader full-stack development program for custom, domain-specific silicon that addresses our unique workloads and systems,” Meta says. The next-gen MTIA more than doubles the compute and memory bandwidth of its predecessor, the 7nm MTIA v1 chip introduced in May 2023, resulting in 3x the performance, according to Meta, which says the new silicon is already live in 16 data centers. Continue reading Meta Deploys Gen 2 MTIA AI Accelerator Chip in Data Centers

Amazon Increases Its Investment in Anthropic AI to $4 Billion

Amazon has added $2.75 billion to its initial September 2023 investment of $1.25 billion in Anthropic, completing its announced $4 billion stake in the artificial intelligence startup formed in 2021 by former members of OpenAI. As part of the resulting strategic collaboration, Anthropic’s most powerful models, including the Claude 3 series, are available on Amazon Bedrock, a service providing fully managed foundation models. Anthropic is using Amazon Web Services as its primary cloud provider and Amazon says Anthropic will use AWS Trainium and Inferentia chips “to build, train, and deploy its future models.” Continue reading Amazon Increases Its Investment in Anthropic AI to $4 Billion

Amazon Plans to Invest Up to $4 Billion in AI Startup Anthropic

Amazon has entered into a strategic investment in San Francisco-based Anthropic, founded by former members of OpenAI. The AI startup will train and deploy future models using AWS Trainium and Inferentia chips to train and deploy future foundation models with AWS as its primary cloud provider. In turn, Amazon says it will invest up to $4 billion in Anthropic, as it strives to compete with other technology firms in the race to develop generative AI, seeding growth for what is shaping up to be an entirely new economic and social landscape. Continue reading Amazon Plans to Invest Up to $4 Billion in AI Startup Anthropic

AWS, Advertising Drive Amazon to 11 Percent Revenue Gain

Amazon’s AWS cloud-computing unit generated $22.1 billion in Q2, a 12 percent year-over-year gain that was a highlight in a strong quarter for the e-commerce giant. The company generated a total of $134.4 billion in revenue for the period ending in June, an 11 percent increase over the prior year. Advertising was also strong, jumping 22 percent to $10.68 billion. Cost-cutting and rebounding e-commerce helped propel the Seattle-based company to a quarterly profit of $6.75 billion (its strongest performance since Q4 2021), which contrasted sharply with a loss in the same period last year. Continue reading AWS, Advertising Drive Amazon to 11 Percent Revenue Gain

Cerebras, G42 Partner on a Supercomputer for Generative AI

Cerebras Systems has unveiled the Condor Galaxy 1, powered by nine networked supercomputers designed for a total of 4 exaflops of AI compute via 54 million cores. Cerebras says the CG-1 greatly accelerates AI model training, completing its first run on a large language AI trained for Abu Dhabi-based G42 in only 10 days. Cerebras and G42 have partnered to offer the Santa Clara, California-based CG-1 as a cloud service, positioning it as an alternative to Nvidia’s DGX GH200 cloud supercomputer. The companies plan to release CG-2 and CG-3 in early 2024. Continue reading Cerebras, G42 Partner on a Supercomputer for Generative AI

AWS Invests $100 Million in Generative AI Innovation Center

Amazon Web Services (AWS) is investing $100 million to fund a global AI accelerator, aiming to link its machine learning initiatives with experts, customers and partners worldwide. Twilio, Highspot, Lonely Planet and Ryanair are among the first companies to get onboard with the AWS Generative AI Innovation Center to develop new applications. “Through free workshops, engagements and training, AWS will help customers imagine and scope the use cases that will create the greatest value for their businesses, based on best practices and industry expertise,” the company explains. Continue reading AWS Invests $100 Million in Generative AI Innovation Center

Meta In-House Chip Designs Include Processing for AI, Video

Meta Platforms has shared additional details on its next generation of AI infrastructure. The company has designed two custom silicon chips, including one for training and running AI models and eventually powering metaverse functions like virtual reality and augmented reality. Another chip is tailored to optimize video processing. Meta publicly discussed its internal chip development last week ahead of a Thursday virtual event on AI infrastructure. The company also showcased an AI-optimized data center design and talked about phase two of deployment of its 16,000 GPU supercomputer for AI research. Continue reading Meta In-House Chip Designs Include Processing for AI, Video