Snapchat: My AI Goes Multimodal with Google Cloud, Gemini

Snap Inc. is leveraging its relationship with Google Cloud to use Gemini for powering generative AI experiences within Snapchat’s My AI chatbot. The multimodal capabilities of Gemini on Vertex AI will greatly increase the My AI chatbot’s ability to understand and operate across different types of information such as text, audio, image, video and code. Snapchatters can use My AI to take advantage of Google Lens-like features, including asking the chatbot “to translate a photo of a street sign while traveling abroad, or take a video of different snack offerings to ask which one is the healthiest option.” Continue reading Snapchat: My AI Goes Multimodal with Google Cloud, Gemini

Latest Gemma 2 Models Emphasize Security and Performance

Google has unveiled three additions to its Gemma 2 family of compact yet powerful open-source AI models, emphasizing safety and transparency. The company’s Gemma 2 2B is a 2.6 billion parameter update to the lightweight 2B parameter Gemma 2, with built-in improvements in safety and performance. Built on Gemma 2, ShieldGemma is a suite of safety content classifier models that “filter the input and outputs of AI models and keep the user safe.” Interoperability model tool Gemma Scope offers what Google calls “unparalleled insight into our models’ inner workings.” Continue reading Latest Gemma 2 Models Emphasize Security and Performance

Gemini Powering Google Vids Multimedia Presentation Builder

Google has launched the beta version of its Gemini-powered Google Vids productivity app, which lets users create work-related video presentations that embed documents, slides, audio recordings and even additional videos into a timeline. Incorporated into Workspace Labs, Google’s AI preview space, Google says invited participants can use Vids to “build a narrative with high quality templates” or “get to a first draft faster.” Access to Google’s royalty-free stock content library and Vids recording studio means a project can be completed “without ever leaving Workspace,” according to the company. Continue reading Gemini Powering Google Vids Multimedia Presentation Builder

Anthropic’s Claude 3.5: ‘Frontier Intelligence at 2x the Speed’

Anthropic has launched a powerful new AI model, Claude 3.5 Sonnet, that can analyze text and images and generate text. That its release comes a mere three months after Anthropic debuted Claude 3 indicates just how quickly the field is developing. The Google-backed company says Claude 3.5 Sonnet has set “new industry benchmarks for graduate-level reasoning (GPQA), undergraduate-level knowledge (MMLU), and coding proficiency (HumanEval).” Sonnet is Anthropic’s mid-tier model, between Haiku and, on the high-end, Opus. Anthropic says 3.5 Sonnet is twice as fast as 3 Opus, offering “frontier intelligence at 2x the speed.” Continue reading Anthropic’s Claude 3.5: ‘Frontier Intelligence at 2x the Speed’

Google Teases Astra AI Assistant and Debuts Gemini 1.5 Pro

Google is showing off a developmental chatbot it says represents the future of AI assistants. Called Project Astra, it has the ability to “see” and “hear,” remembering the information ingested, which it can then answer questions about — from simple queries such as “Where did I leave my glasses?” to unpacking and explaining computer code. Demonstrated at the Google I/O conference this week, Astra understands the world “just like people do” and is able to converse naturally, in real time. The company says some Project Astra features may come to Gemini late this year. Continue reading Google Teases Astra AI Assistant and Debuts Gemini 1.5 Pro

Google Imagen 2 Now Generates 4-Second Clips on Vertex AI

During Google Cloud Next 2024 in Las Vegas, Google announced an updated version of its text-to-image generator Imagen 2 on Vertex AI that has the ability to generate video clips of up to four seconds. Google calls this feature “text-to-live images,” and it essentially delivers animated GIFs at 24 fps and 360×640 pixel resolution, though Google says there will be “continuous enhancements.” Imagen 2 can also generate text, emblems and logos in different languages, and has the ability to overlay those elements on existing images like business cards, apparel and products. Continue reading Google Imagen 2 Now Generates 4-Second Clips on Vertex AI

Google Offers Public Preview of Gemini Pro for Cloud Clients

Google is moving its most powerful artificial intelligence model, Gemini 1.5 Pro, into public preview for developers and Google Cloud customers. Gemini 1.5 Pro includes what Google claims is a breakthrough in long context understanding, with the ability to run 1 million tokens of information “opening up new possibilities for enterprises to create, discover and build using AI.” Gemini’s multimodal capabilities allow it to process audio, video, text, code and more, which when combined with long context, “enables enterprises to do things that just weren’t possible with AI before,” according to Google. Continue reading Google Offers Public Preview of Gemini Pro for Cloud Clients

AI-Powered Video Generator Available for Google Workspace

Google Vids is a new AI-powered video creation app for Google Workspace. The aim is to integrate a simple AI video editor with the real-time collaboration capabilities of cloud-based text editors Docs, Sheets and Slides, “allowing people everywhere to tap into immersive storytelling at work.” Vids will be released to Workspace Labs in June. Vids will be able to generate an easy-to-edit storyboard and piece together a first draft, with suggestions from stock footage and stills, as well as background music. “It can also help you land your message with the right voiceover,” according to Google. Continue reading AI-Powered Video Generator Available for Google Workspace

Anthropic’s Claude 3 AI Is Said to Have ‘Near-Human’ Abilities

Anthropic has released Claude 3, claiming new industry benchmarks that see the family of three new large language models approaching “near-human” cognitive capability in some instances. Accessible via Anthropic’s website, the three new models — Claude 3 Haiku, Claude 3 Sonnet and Claude 3 Opus — represent successively increased complexity and parameter count. Sonnet is powering the current Claude.ai chatbot and is free, for now, requiring only an email sign-in. Opus comes with the the $20 monthly subscription for Claude Pro. Both are generally available from the Anthropic website and via API in 159 countries, with Haiku coming soon. Continue reading Anthropic’s Claude 3 AI Is Said to Have ‘Near-Human’ Abilities

Reddit Announces IPO on Heels of Expanded Deal with Google

Community message board and social news aggregator Reddit, founded in 2005, has filed to go public on the New York Stock Exchange in an IPO observers say may be complete in a matter of weeks. It is the first social media company to go public in many years, with Snap Inc.’s 2017 offering cited as the most recent stock market splash. Reddit’s bankers are reportedly seeking a $5 billion valuation, about half the $10 billion it was valued at for a 2021 private funding round. Reddit filed with the SEC the same day it announced an “expanded partnership” with Google to use Vertex AI. Continue reading Reddit Announces IPO on Heels of Expanded Deal with Google

Cineverse to Launch cineSearch Powered by Google’s Vertex AI

Global streamer Cineverse has launched an AI-powered search and discovery tool called cineSearch. Developed using the Vertex AI platform from Google Cloud, cineSearch will be released in public beta this spring. A waitlist has already been started, and Cineverse says it will be made more widely available in partnership with OEM and third-party streaming platform partners in the coming months. Simultaneously, the Los Angeles-based firm is rolling out a new ad platform called Cineverse 360, or C360 for short, that will connect brands across omnichannel services reaching 82 million monthly viewers at launch. Continue reading Cineverse to Launch cineSearch Powered by Google’s Vertex AI

Unpacked: Samsung Intros Galaxy AI with Next Gen S Phones

During this week’s Unpacked event, Samsung introduced Galaxy AI, a suite of artificial intelligence tools designed for the new Galaxy S series smartphones — the Galaxy S24, Galaxy S24+, and Galaxy S24 Ultra. “AI amplifies nearly every experience on the Galaxy S24 series,” including real-time text and call translations, a powerful suite of creative tools in the ProVisual Engine and a new kind of “gestural search that lets users circle, highlight, scribble on or tap anything onscreen” to see related search results. The AI enhancements are largely enabled by a multiyear deal with Google and Qualcomm. Samsung also debuted a wearable accessory, the Galaxy Ring. Continue reading Unpacked: Samsung Intros Galaxy AI with Next Gen S Phones

Google Debuts Turnkey Gemini AI Studio for Developing Apps

Google is rolling out Gemini to developers, enticing them with tools including AI Studio, an easy-to-navigate Web-based platform that will serve as a portal to the multi-tiered Gemini ecosystem, beginning with Gemini Pro, with Gemini Ultra to come next year. The service aims to allow developers to quickly create prompts and Gemini-powered chatbots, providing access to API keys to integrate them into apps. They’ll also be able to access code, should projects require a full featured IDE. The site is essentially a revamped version of what was formerly Google’s MakerSuite. Continue reading Google Debuts Turnkey Gemini AI Studio for Developing Apps

AWS Rolls Out Bedrock Generative AI Service, Adds Llama 2

In a move to put “generative AI at the fingertips of every business, from startups to enterprises,” Amazon Web Services is commercially rolling out the Bedrock service it announced in April. Bedrock offers a wide range of foundation models from Amazon’s own Titan to products from Anthropic, Stability AI and soon Meta Platforms. The fully managed Bedrock service makes its generative FMs operable through a single, simple API. This means customers can experiment with various leading FMs and customize simple apps in-house, without the need for a third-party diving into their proprietary data. Continue reading AWS Rolls Out Bedrock Generative AI Service, Adds Llama 2

Google Introduces an AI Watermark That Cannot Be Removed

Google DeepMind and Google Cloud have teamed to launch what they claim is an indelible AI watermark tool, which if it works would mark an industry first. Called SynthID, the technique for identifying AI-generated images is being launched in beta. The technology embeds its digital watermark “directly into the pixels of an image, making it imperceptible to the human eye, but detectable for identification,” according to DeepMind. SynthID is being released to a limited number of Google’s Vertex AI customers using Imagen, a Google AI language model that generates photorealistic images. Continue reading Google Introduces an AI Watermark That Cannot Be Removed