Google DeepMind Archives

Vertex AI Movie Studio Can Create Videos from Start to Score

By Paula Parisi
April 14, 2025

Among the many tech advancements unveiled at Google Cloud Next include a major generative media upgrade to Vertex AI, Google Cloud’s managed AI development platform. The new Vertex AI Media Studio lets enterprise users generate complete videos from scratch using text prompts. Lyria, Google’s text-to-music model is now available on Vertex in private preview. Both are subject to an “allowlist.” Chirp 3 now creates custom voices with just 10 seconds of audio input, while Imagen 3 has gained improved abilities for reconstructing missing or damaged portions of an image. Continue reading Vertex AI Movie Studio Can Create Videos from Start to Score

Deep Cogito Is Out of Stealth with Hybrid Reasoning Models

By Paula Parisi
April 10, 2025

San Francisco-based AI startup Deep Cogito has released five AI models in preview, making them available under an open-source license agreement. The models come in sizes 3B, 8B, 14B, 32B and 70B, with plans to release 109B, 400B and 671B versions in the weeks and months ahead. As for the current models, “each outperforms the best available open models of the same size, including counterparts from Meta, DeepSeek and Alibaba, across most standard benchmarks,” Deep Cogito claims, noting that the 70B model in particular “outperforms the newly released Llama 4 109B MoE model.” Continue reading Deep Cogito Is Out of Stealth with Hybrid Reasoning Models

Google Debuts Next-Gen Reasoning Models with Gemini 2.5

By Paula Parisi
March 27, 2025

Google has released what it calls its most intelligent AI model yet, Gemini 2.5. The first 2.5 model release, an experimental version of Gemini 2.5 Pro, is a next-gen reasoning model that Google says outperformed OpenAI o3-mini and Claude 3.7 Sonnet from Anthropic on common benchmarks “by meaningful margins.” Gemini 2.5 models “are thinking models, capable of reasoning through their thoughts before responding, resulting in enhanced performance and improved accuracy,” according to Google. The new model comes just three months after Google released Gemini 2.0 with reasoning and agentic capabilities. Continue reading Google Debuts Next-Gen Reasoning Models with Gemini 2.5

Google Launches Agentspace in the UK and Promotes Chirp 3

By Paula Parisi
March 19, 2025

Google is expanding its AI presence in the UK market, hosting a splashy launch event there for Agentspace. Google in December launched Agentspace, an AI agent hub that makes it easy for enterprises to build, manage and deploy custom agents using Gemini. The gathering was hosted by Google DeepMind CEO Demis Hassabis, and Google Cloud CEO Thomas Kurian and included participation by local customers BT Group and advertising powerhouse WPP. Google invited UK businesses to store cloud data locally using its $1 billion data center, opening there this year. The company also promoted its new Chirp 3 audio generator, which offers HD voice synthesis. Continue reading Google Launches Agentspace in the UK and Promotes Chirp 3

YouTube Shorts Updates Dream Screen with Google Veo 2 AI

By Paula Parisi
February 19, 2025

YouTube Shorts has upgraded its Dream Screen AI background generator to incorporate Google DeepMind’s latest video model, Veo 2, which will also generate standalone video clips that users can post to Shorts. “Need a specific scene but don’t have the right footage? Want to turn your imagination into reality and tell a unique story? Simply use a text prompt to generate a video clip that fits perfectly into your narrative, or create a whole new world,” coaxes YouTube, which seems to be trying out “Dream Screen” branding as an umbrella for its genAI efforts. Continue reading YouTube Shorts Updates Dream Screen with Google Veo 2 AI

OpenAI Operator Agent Available to ChatGPT Pro Subscribers

By Paula Parisi
January 27, 2025

OpenAI has launched Operator, a semi-autonomous AI agent that uses a proprietary web browser to execute tasks like planning a vacation using Tripadvisor or booking restaurant reservations through OpenTable. “It can look at a webpage and interact with it by typing, clicking and scrolling,” explains OpenAI. Operator is powered by a new model called Computer-Using Agent (CUA), and is available in research preview to ChatGPT Pro subscribers in the U.S. Combining GPT-4o’s computer vision capabilities with advanced reasoning, CUA is trained to interact with graphical user interfaces (GUIs) — parsing menus, clicking buttons and reading screen text. Continue reading OpenAI Operator Agent Available to ChatGPT Pro Subscribers

Galaxy Unpacked: More AI for S25 and a Peek at AR Glasses

By Paula Parisi
January 27, 2025

Samsung’s new Galaxy S25 line — the Galaxy S25 Ultra, Galaxy S25+ and Galaxy S25 — will more tightly integrate AI, including AI agents, becoming “true AI companions” at a level previously unknown to mobile devices. That leap is credited largely to a “first-of-its-kind” Snapdragon 8 Elite customization for the Galaxy chipset that “delivers greater on-device processing power for Galaxy AI and superior camera range and control with Galaxy’s next-gen ProVisual Engine,” according to Samsung. In addition, the top-of-the-line Galaxy S25 Ultra has been redesigned with a slightly larger 6.9-inch screen and rounded bevel. Continue reading Galaxy Unpacked: More AI for S25 and a Peek at AR Glasses

Veo 2 Is Unveiled Weeks After Google Debuted Veo in Preview

By Paula Parisi
December 18, 2024

Attempting to stay ahead of OpenAI in the generative video race, Google announced Veo 2, which it says can output 4K clips of two-minutes-plus at 4096 x 2160 pixels. Competitor Sora can generate video of up to 20 seconds at 1080p. However, TechCrunch says Veo 2’s supremacy is “theoretical” since it is currently available only through Google Labs’ experimental VideoFX platform, which is limited to videos of up to 8-seconds at 720p. VideoFX is also waitlisted, but Google says it will expand access this week (with no comment on expanding the cap). Continue reading Veo 2 Is Unveiled Weeks After Google Debuted Veo in Preview

DeepMind Genie 2 Creates Worlds That Emulate Video Games

By Paula Parisi
December 6, 2024

Google DeepMind’s new Genie 2 is a large foundation world model that generates interactive 3D worlds that are being likened to video games. “Games play a key role in the world of artificial intelligence research,” says Google DeepMind, noting “their engaging nature, challenges and measurable progress make them ideal environments to safely test and advance AI capabilities.” Based on a simple prompt image, Genie 2 is capable of producing “an endless variety of action-controllable, playable 3D environments” — suitable for training and evaluating embodied agents — that can be played by a human or AI agent using keyboard and mouse inputs. Continue reading DeepMind Genie 2 Creates Worlds That Emulate Video Games

Google DeepMind Touts AI-Powered Quantum Error Detection

By Paula Parisi
November 26, 2024

Google DeepMind has come up with an error correction technique it says will make quantum computers more reliable, particularly at scale. While quantum computing holds tremendous promise — potentially able to solve in just a few hours problems it would take a conventional computer “billions of years” to figure out, Google claims — the systems are notoriously unstable, due to the delicacy of the “quantum state.” AlphaQubit is an AI-based decoder that identifies quantum computing errors with accuracy. Combining DeepMind’s machine learning expertise with Google Quantum AI error correction, the technique advances efforts to create a reliable quantum computer. Continue reading Google DeepMind Touts AI-Powered Quantum Error Detection

YouTube Updates Shorts Player, Extends Length to 3 Minutes

By Paula Parisi
October 7, 2024

Beginning October 15, YouTube Shorts will extend its maximum length to 3 minutes. The move competitively positions the Google unit against TikTok, which allows for videos of up to 10 minutes when recording, or an hour when uploading. Regular YouTube accommodates videos of up to 12 hours for verified accounts and 15 minutes for unverified accounts, whether live or uploaded. But in terms of marketing focus, the current attention is on short-form video. YouTube is also updating the Shorts player, adding templates, and introducing a Shorts trends page for mobile. Continue reading YouTube Updates Shorts Player, Extends Length to 3 Minutes

YouTube Unveils New AI-Powered Features at Creator Event

By Paula Parisi
September 20, 2024

YouTube is going all in on generative AI with nine new generative features announced at the Made on YouTube creator event in New York. Google DeepMind’s AI video generation model, Veo, is coming to YouTube Shorts later this year, enabling “even more incredible video backgrounds, breathing life into concepts that were once impossible to visualize,” as well as six-second standalone AI segments that can be incorporated into short videos. “Imagine a BookTuber stepping into the pages of the classic novel ‘The Secret Garden,’” suggests YouTube Chief Product Officer Johanna Voolich in describing the new AI-powered features. Continue reading YouTube Unveils New AI-Powered Features at Creator Event

Google DeepMind Releases Imagen 3 for Free to U.S. Users

By Paula Parisi
August 22, 2024

Google DeepMind has made its latest AI image generator, Imagen 3, free for use in the U.S. via the company’s ImageFX platform. Imagen 3 will be available in multiple versions, “each optimized for different types of tasks, from generating quick sketches to high-resolution images.” Google announced Imagen 3 at Google I/O in March, and in June made it available to enterprise users through Vertex. Using simplified natural language text input rather than “complex prompt engineering,” Google says Imagen 3 generates high-quality images in a range styles, from photorealistic, painterly and textured to whimsically cartoony. Continue reading Google DeepMind Releases Imagen 3 for Free to U.S. Users

Global Technology Companies Sign Pledge to Foster AI Safety

By Paula Parisi
May 24, 2024

Leading AI firms spanning Europe, Asia, North America and the Middle East have signed a new voluntary commitment to AI safety. The 16 signatory companies — including Amazon, Google DeepMind, Meta Platforms, Microsoft, OpenAI, xAI and China’s Zhipu AI — will publish outlines indicating how they will measure the risks posed by their frontier models. “In the extreme, leading AI tech companies including from China and the UAE have committed to not develop or deploy AI models if the risks cannot be sufficiently mitigated,” according to UK Technology Secretary Michelle Donelan. Continue reading Global Technology Companies Sign Pledge to Foster AI Safety

Google Teases Astra AI Assistant and Debuts Gemini 1.5 Pro

By Paula Parisi
May 17, 2024

Google is showing off a developmental chatbot it says represents the future of AI assistants. Called Project Astra, it has the ability to “see” and “hear,” remembering the information ingested, which it can then answer questions about — from simple queries such as “Where did I leave my glasses?” to unpacking and explaining computer code. Demonstrated at the Google I/O conference this week, Astra understands the world “just like people do” and is able to converse naturally, in real time. The company says some Project Astra features may come to Gemini late this year. Continue reading Google Teases Astra AI Assistant and Debuts Gemini 1.5 Pro