Google Imagen 2 Now Generates 4-Second Clips on Vertex AI

During Google Cloud Next 2024 in Las Vegas, Google announced an updated version of its text-to-image generator Imagen 2 on Vertex AI that has the ability to generate video clips of up to four seconds. Google calls this feature “text-to-live images,” and it essentially delivers animated GIFs at 24 fps and 360×640 pixel resolution, though Google says there will be “continuous enhancements.” Imagen 2 can also generate text, emblems and logos in different languages, and has the ability to overlay those elements on existing images like business cards, apparel and products. Continue reading Google Imagen 2 Now Generates 4-Second Clips on Vertex AI

Google Adding Free AI Photo Editing Tools to Google Photos

Beginning May 15, Google Photos users can start accessing a suite of free AI-powered Magic Editor tools like Magic Eraser and Portrait Light. The features will also be accessible on more devices, including Pixel tablets. Last year, Google launched Magic Editor on Pixel 8 and Pixel 8 Pro phones. In addition to making the features available on all Pixel devices, all Google Photos users on Android and iOS will get baseline access to 10 Magic Editor saves per month. Additionally, those with a Pixel device or Premium Google One plan of at least 2TB will have unlimited use. Continue reading Google Adding Free AI Photo Editing Tools to Google Photos

Google Offers Public Preview of Gemini Pro for Cloud Clients

Google is moving its most powerful artificial intelligence model, Gemini 1.5 Pro, into public preview for developers and Google Cloud customers. Gemini 1.5 Pro includes what Google claims is a breakthrough in long context understanding, with the ability to run 1 million tokens of information “opening up new possibilities for enterprises to create, discover and build using AI.” Gemini’s multimodal capabilities allow it to process audio, video, text, code and more, which when combined with long context, “enables enterprises to do things that just weren’t possible with AI before,” according to Google. Continue reading Google Offers Public Preview of Gemini Pro for Cloud Clients

AI-Powered Video Generator Available for Google Workspace

Google Vids is a new AI-powered video creation app for Google Workspace. The aim is to integrate a simple AI video editor with the real-time collaboration capabilities of cloud-based text editors Docs, Sheets and Slides, “allowing people everywhere to tap into immersive storytelling at work.” Vids will be released to Workspace Labs in June. Vids will be able to generate an easy-to-edit storyboard and piece together a first draft, with suggestions from stock footage and stills, as well as background music. “It can also help you land your message with the right voiceover,” according to Google. Continue reading AI-Powered Video Generator Available for Google Workspace

YouTube Adds Shopping Features for Products, Virtual Stores

In 2023, viewers watched more than 30 billion hours of shopping-related videos on YouTube, according to the platform, which reports “a 25 percent increase in watch time” for videos that help people shop. The uptick coincided with the introduction of tagging features for creators, and now YouTube is expanding its retail involvement even further by allowing creators to set up storefronts and sell products in-app, as yet another way to monetize the service. The move comes as TikTok seeks to grow TikTok Shop as high as $17.5 billion in the U.S., a tenfold increase. Continue reading YouTube Adds Shopping Features for Products, Virtual Stores

Google Introduces Faster, More Efficient JPEG Coding Library

Google is attacking slow-loading web pages with the new JPEG image encoder/decoder Jpegli, which offers a 35 percent compression ratio improvement using high quality compression settings, the Alphabet company says. The Jpegli JPEG coding library offers backward compatibility via “a fully interoperable encoder and decoder complying with the original JPEG standard and its most conventional 8-bit formalism, and API/ABI compatibility with libjpeg-turbo and MozJPEG,” Google says. The resulting images compressed using Jpegli are “more precise and psychovisually effective” as a result of computations that make images “look clearer” with “fewer observable artifacts.” Continue reading Google Introduces Faster, More Efficient JPEG Coding Library

Opera Browser Is Experimenting with Local Support for LLMs

Opera has become the first browser to add support for large language models (LLMs). At this point the feature is experimental, and available only on the Opera One Developer browser as part of the AI Feature Drops program. The update offers about 150 LLMs from more than 50 different families, including Meta’s LLaMA, Google’s Gemma, Mixtral and Vicuna. Opera had previously only offered local support for its own Aria AI, a competitor to Microsoft Copilot and OpenAI’s ChatGPT. The local LLMs are being offered for testing as a complimentary addition to Opera’s online Aria service. Continue reading Opera Browser Is Experimenting with Local Support for LLMs

Big Tech Launches Consortium to Address AI Impact on Jobs

Artificial Intelligence is not angling to steal jobs, according to Big Tech, which is galvanizing its forces to push back against that perception by forming a new consortium that addresses the effect of AI on the workforce. Called the AI-Enabled ICT Workforce Consortium, it will “assess AI’s impact on technology jobs and identify skills development pathways for the roles most likely to be affected by artificial intelligence,” according to Cisco, which leads the initiative. Accenture, Eightfold, Google, IBM, Indeed, Intel, Microsoft and SAP are also participating. Continue reading Big Tech Launches Consortium to Address AI Impact on Jobs

OpenAI Integrates New Image Editor for DALL-E into ChatGPT

OpenAI has updated the editor for DALL-E, the artificial intelligence image generator that is part of the ChatGPT premium tiers. The update, based on the DALL-E 3 model, makes it easier for users to adjust their generated images. Shortly after DALL-E 3’s September debut, OpenAI integrated it into ChatGPT, enabling paid subscribers to generate images from text or image prompts. The new DALL-E editor interface lets users edit images “by selecting an area of the image to edit and describing your changes in chat” without using the selection tool. Desired changes can also be prompted “in the conversation panel,” according to OpenAI. Continue reading OpenAI Integrates New Image Editor for DALL-E into ChatGPT

Apple’s ReALM AI Advances the Science of Digital Assistants

Apple has developed a large language model it says has advanced screen-reading and comprehension capabilities. ReALM (Reference Resolution as Language Modeling) is artificial intelligence that can see and read computer screens in context, according to Apple, which says it advances technology essential for a true AI assistant “that aims to allow a user to naturally communicate their requirements to an agent, or to have a conversation with it.” Apple claims that in a benchmark against GPT-3.5 and GPT-4, the smallest ReALM model performed “comparable” to GPT-4, with its “larger models substantially outperforming it.” Continue reading Apple’s ReALM AI Advances the Science of Digital Assistants

U.S. and UK Form Partnership to Accelerate AI Safety Testing

The United States has entered into an agreement with the United Kingdom to collaboratively develop safety tests for the most advanced AI models. The memorandum of understanding aims at evaluating the societal and national defense risks posed by advanced models. Coming after commitments made at the AI Safety Summit in November, the deal is being described as the world’s first bilateral agreement on AI safety. The agreement, signed by U.S. Commerce Secretary Gina Raimondo and UK Technology Secretary Michelle Donelan, envisions the countries “working to align their scientific approaches” and to accelerate evaluations for AI models, systems and agents. Continue reading U.S. and UK Form Partnership to Accelerate AI Safety Testing

Amazon Increases Its Investment in Anthropic AI to $4 Billion

Amazon has added $2.75 billion to its initial September 2023 investment of $1.25 billion in Anthropic, completing its announced $4 billion stake in the artificial intelligence startup formed in 2021 by former members of OpenAI. As part of the resulting strategic collaboration, Anthropic’s most powerful models, including the Claude 3 series, are available on Amazon Bedrock, a service providing fully managed foundation models. Anthropic is using Amazon Web Services as its primary cloud provider and Amazon says Anthropic will use AWS Trainium and Inferentia chips “to build, train, and deploy its future models.” Continue reading Amazon Increases Its Investment in Anthropic AI to $4 Billion

Telegram Adds Business Features to Challenge Meta, Google

Messaging app Telegram has added business account features to create a custom start page, listings, maps, hours of operation, chatbot support and more. Anyone can turn their Telegram account into a Telegram Business account, and users don’t need coding skills. Public channels with 1,000 or more subscribers can receive 50 percent of the revenue from ads shown in their channels. Based in Dubai, Telegram says the channels of its global users generate over 1 trillion monthly views. In February it unveiled an ad program that adopted the TON blockchain’s Toncoin as its native currency. Continue reading Telegram Adds Business Features to Challenge Meta, Google

YouTube Creators Can Now Share Exclusive Shorts with Fans

Google’s YouTube has created a new model for its Shorts feed that lets creators share short-form videos as exclusive content for their paying viewers. The feature gives creators an opportunity to share exclusive content with their most ardent fans, in addition to other perks for paying subscribers, like badges, custom emojis, live streams and more. TikTok recently loosened its subscription requirements for creators, allowing more of them to participate. In March, the ByteDance owned service said it is renaming TikTok Live as “Subscription” and is opening it to “regular creators,” letting them post exclusive content that paying users can see. Continue reading YouTube Creators Can Now Share Exclusive Shorts with Fans

Google GenAI Accelerator Launches with $20 Million in Grants

Google.org, the charitable arm of the Alphabet giant, has launched a program to help fund non-profits working on technology to support “high-impact applications of generative AI.” The Google.org Accelerator: Generative AI is a six-month program that kicks off with more than $20 million in grants for 21 non-profit firms. Among them, student writing aid group Quill.org, job seeker for low- to middle-income countries Tabiya, and Benefits Data Trust, which helps low-income applicants access and enroll in public benefits. In addition to funds, the new unit provides mentorship, technical training and pro bono support from “a dedicated AI coach.” Continue reading Google GenAI Accelerator Launches with $20 Million in Grants