Veo 2 Is Unveiled Weeks After Google Debuted Veo in Preview

Attempting to stay ahead of OpenAI in the generative video race, Google announced Veo 2, which it says can output 4K clips of two-minutes-plus at 4096 x 2160 pixels. Competitor Sora can generate video of up to 20 seconds at 1080p. However, TechCrunch says Veo 2’s supremacy is “theoretical” since it is currently available only through Google Labs’ experimental VideoFX platform, which is limited to videos of up to 8-seconds at 720p. VideoFX is also waitlisted, but Google says it will expand access this week (with no comment on expanding the cap). Continue reading Veo 2 Is Unveiled Weeks After Google Debuted Veo in Preview

DeepMind Genie 2 Creates Worlds That Emulate Video Games

Google DeepMind’s new Genie 2 is a large foundation world model that generates interactive 3D worlds that are being likened to video games. “Games play a key role in the world of artificial intelligence research,” says Google DeepMind, noting “their engaging nature, challenges and measurable progress make them ideal environments to safely test and advance AI capabilities.” Based on a simple prompt image, Genie 2 is capable of producing “an endless variety of action-controllable, playable 3D environments” — suitable for training and evaluating embodied agents — that can be played by a human or AI agent using keyboard and mouse inputs. Continue reading DeepMind Genie 2 Creates Worlds That Emulate Video Games

Google DeepMind Touts AI-Powered Quantum Error Detection

Google DeepMind has come up with an error correction technique it says will make quantum computers more reliable, particularly at scale. While quantum computing holds tremendous promise — potentially able to solve in just a few hours problems it would take a conventional computer “billions of years” to figure out, Google claims — the systems are notoriously unstable, due to the delicacy of the “quantum state.” AlphaQubit is an AI-based decoder that identifies quantum computing errors with accuracy. Combining DeepMind’s machine learning expertise with Google Quantum AI error correction, the technique advances efforts to create a reliable quantum computer. Continue reading Google DeepMind Touts AI-Powered Quantum Error Detection

YouTube Updates Shorts Player, Extends Length to 3 Minutes

Beginning October 15, YouTube Shorts will extend its maximum length to 3 minutes. The move competitively positions the Google unit against TikTok, which allows for videos of up to 10 minutes when recording, or an hour when uploading. Regular YouTube accommodates videos of up to 12 hours for verified accounts and 15 minutes for unverified accounts, whether live or uploaded. But in terms of marketing focus, the current attention is on short-form video. YouTube is also updating the Shorts player, adding templates, and introducing a Shorts trends page for mobile. Continue reading YouTube Updates Shorts Player, Extends Length to 3 Minutes

YouTube Unveils New AI-Powered Features at Creator Event

YouTube is going all in on generative AI with nine new generative features announced at the Made on YouTube creator event in New York. Google DeepMind’s AI video generation model, Veo, is coming to YouTube Shorts later this year, enabling “even more incredible video backgrounds, breathing life into concepts that were once impossible to visualize,” as well as six-second standalone AI segments that can be incorporated into short videos. “Imagine a BookTuber stepping into the pages of the classic novel ‘The Secret Garden,’” suggests YouTube Chief Product Officer Johanna Voolich in describing the new AI-powered features. Continue reading YouTube Unveils New AI-Powered Features at Creator Event

Google DeepMind Releases Imagen 3 for Free to U.S. Users

Google DeepMind has made its latest AI image generator, Imagen 3, free for use in the U.S. via the company’s ImageFX platform. Imagen 3 will be available in multiple versions, “each optimized for different types of tasks, from generating quick sketches to high-resolution images.” Google announced Imagen 3 at Google I/O in March, and in June made it available to enterprise users through Vertex. Using simplified natural language text input rather than “complex prompt engineering,” Google says Imagen 3 generates high-quality images in a range styles, from photorealistic, painterly and textured to whimsically cartoony. Continue reading Google DeepMind Releases Imagen 3 for Free to U.S. Users

Global Technology Companies Sign Pledge to Foster AI Safety

Leading AI firms spanning Europe, Asia, North America and the Middle East have signed a new voluntary commitment to AI safety. The 16 signatory companies — including Amazon, Google DeepMind, Meta Platforms, Microsoft, OpenAI, xAI and China’s Zhipu AI — will publish outlines indicating how they will measure the risks posed by their frontier models. “In the extreme, leading AI tech companies including from China and the UAE have committed to not develop or deploy AI models if the risks cannot be sufficiently mitigated,” according to UK Technology Secretary Michelle Donelan. Continue reading Global Technology Companies Sign Pledge to Foster AI Safety

Google Teases Astra AI Assistant and Debuts Gemini 1.5 Pro

Google is showing off a developmental chatbot it says represents the future of AI assistants. Called Project Astra, it has the ability to “see” and “hear,” remembering the information ingested, which it can then answer questions about — from simple queries such as “Where did I leave my glasses?” to unpacking and explaining computer code. Demonstrated at the Google I/O conference this week, Astra understands the world “just like people do” and is able to converse naturally, in real time. The company says some Project Astra features may come to Gemini late this year. Continue reading Google Teases Astra AI Assistant and Debuts Gemini 1.5 Pro

AI Video Startup Haiper Announces Funding and Plans for AGI

London-based AI video startup Haiper has emerged from stealth mode with $13.8 million in seed funding and a platform that generates up to two seconds of HD video from text prompts or images. Founded by alumni from Google DeepMind, TikTok and various academic research labs, Haiper is built around a bespoke foundation model that aims to serve the needs of the creative community while the company pursues a path to artificial general intelligence (AGI). Haiper is offering a free trial of what is currently a web-based user interface similar to offerings from Runway and Pika. Continue reading AI Video Startup Haiper Announces Funding and Plans for AGI

France’s Mistral AI Makes Its Global Debut on Microsoft Azure

Paris-based startup Mistral AI has made an immediate splash in the world of artificial intelligence, securing partnerships with IBM, Microsoft and others nine months after its launch. The company is offering natural language processing models, including its flagship Mistral Large, which becomes only the second LLM (after OpenAI) to land a commercial berth on Microsoft’s Azure cloud, where Meta Platforms’ Llama 2 is available in preview. Boasting “top-tier reasoning capacities” and sophisticated conversational capabilities, Mistral Large specializes in “reasoning, analysis and generation (RAG), is multilingual and supports up to 32,000 tokens.” Continue reading France’s Mistral AI Makes Its Global Debut on Microsoft Azure

Google Intros Gemini Advanced Chatbot, One AI Subscription

Google has rebranded its Bard chatbot as Gemini, and is launching a Gemini mobile app along with a subscription offering for Gemini Advanced that will be included as part of the new $19.95 monthly Google One AI Premium plan. As with Bard, Google will continue to make a free version of the Gemini chatbot available. Gemini Advanced is powered by Gemini Ultra, the most sophisticated of the three Gemini AI models Google unveiled in December. “Gemini Advanced not only allows you to have longer, more detailed conversations, it also better understands the context from your previous prompts,” Google explains. Continue reading Google Intros Gemini Advanced Chatbot, One AI Subscription

Microsoft Says Phi-2 Can Outperform Large Language Models

Microsoft is releasing Phi-2, a text-to-text small language model (SLM) that outperforms some LLMs, yet is light enough to run on a mobile device or laptop, according to Microsoft CEO Satya Nadella. The 2.7 billion-parameter SLM beat Meta Platforms’ Llama 2 and Mistral 7B from France (each with 7 billion parameters) says Microsoft, emphasizing its complex reasoning and language comprehension are exceptional for a model with less than 13 billion parameters. For now, Microsoft is making it available “for research purposes only” under a custom license. Continue reading Microsoft Says Phi-2 Can Outperform Large Language Models

Google Announces the Launch of Gemini, Its Largest AI Model

Google is closing the year by heralding 2024 as the “Gemini era,” with the introduction of its “most capable and general AI model yet,” Gemini 1.0. This new foundation model is optimized for three different use-case sizes: Ultra, Pro and Nano. As a result, Google is releasing a new, Gemini-powered version of its Bard chatbot, available to English speakers in the U.S. and 170 global regions. Google touts Gemini as built from the ground up for multimodality, reasoning across text, images, video, audio and code. However, Bard will not as yet incorporate Gemini’s ability to analyze sound and images. Continue reading Google Announces the Launch of Gemini, Its Largest AI Model

Nations Sign the Bletchley Declaration in Support of Ethical AI

U.S. Vice President Kamala Harris warned global leaders that the existential threats posed by artificial intelligence are very real and urgently need to be addressed. Harris’ remarks, delivered in a speech at the U.S. Embassy in Britain, summarized the prevailing view of world governments participating in the first global AI Safety Summit. The two-day event kicked off Wednesday with news that 27 nations — including the U.S., European Union member states and China — signed the Bletchley Declaration on AI, committing to voluntary guidelines to work as a group toward responsible and ethical AI. Continue reading Nations Sign the Bletchley Declaration in Support of Ethical AI

DeepMind and Academics Advance General Purpose Robots

“Robots are great specialists, but poor generalists,” according to Google DeepMind, which says models are typically trained for individual tasks, and changing a single variable can mean starting again from scratch. Now the London-based Alphabet subsidiary thinks it’s come up with a way to combine knowledge across robotics for a general purpose machine helper. In conjunction with 33 academic labs, Google DeepMind has pooled data from 22 different robot types to create the Open X-Embodiment dataset. Simultaneously, the group releases the RT-1-X robotics transformer (RT) model derived from RT-1. Continue reading DeepMind and Academics Advance General Purpose Robots