Amazon’s Video Generator Turns Stills into Advertising Clips

Amazon has joined the ranks of firms offering generative video tools, although its release is aimed only at advertisers, at least for now. Simply called Video Generator, it can turn a product image into a video that showcases the product and even demonstrates its features, “leveraging Amazon’s unique insights to vividly bring a product story to life.” At the company’s Accelerate 2024 conference Amazon also debuted Live Image, which lets brands create animated GIFs from stills, a customizable chatbot assistant for third-party sellers, and a new AI-powered recommendation engine based on customer interests. Continue reading Amazon’s Video Generator Turns Stills into Advertising Clips

YouTube Unveils New AI-Powered Features at Creator Event

YouTube is going all in on generative AI with nine new generative features announced at the Made on YouTube creator event in New York. Google DeepMind’s AI video generation model, Veo, is coming to YouTube Shorts later this year, enabling “even more incredible video backgrounds, breathing life into concepts that were once impossible to visualize,” as well as six-second standalone AI segments that can be incorporated into short videos. “Imagine a BookTuber stepping into the pages of the classic novel ‘The Secret Garden,’” suggests YouTube Chief Product Officer Johanna Voolich in describing the new AI-powered features. Continue reading YouTube Unveils New AI-Powered Features at Creator Event

Google Unveils Gemini-Powered Ad Features and AI Image ID

AI-powered ad campaigns “are continuing to deliver big results for businesses large and small,” according to Google, which has put Gemini to work for Google Ads. The company announced at the DMEXCO digital marketing event in Cologne a new suite of Gemini-powered tools aimed at making the experience even better by providing additional insights and more control over where and how marketing assets are deployed globally using Google Ads. For starters, Gemini’s “conversational experience” for search campaigns will expand its language palette, making auto-generated headlines and images available in German, French and Spanish in the months ahead. Continue reading Google Unveils Gemini-Powered Ad Features and AI Image ID

Google Begins Rolling Out Gemini Live Free to Android Users

Google announced the company is making its new AI assistant Gemini Live available free to all Android users. The move follows the feature’s release last month to Gemini Advanced subscribers. This general release will occur gradually, and only in English for the time being. Gemini Live lets users have a more natural, free-flowing conversation with their phones than was available through Google Assistant via the “Hey, Google” prompt. Gemini inquiries are meant to be conversational, eliciting a back and forth that queriers can interrupt, adding more detail or veering to another topic entirely. Continue reading Google Begins Rolling Out Gemini Live Free to Android Users

Google Ads Adopts Open-Source TEE Setup for Data Privacy

In its ongoing effort to strike the right balance between ad targeting and consumer data collection, Google Ads is introducing a new process it calls “confidential matching.” Relying on the hardware and software used for confidential computing in Trusted Execution Environments (TEEs), Google says this new approach allows businesses to securely manage their first-party data. They’ll still be able to use it to reach customers and measure the impact of their digital ad campaigns, but the information will be isolated “during processing so that no one — including Google — can access the data being processed.” Continue reading Google Ads Adopts Open-Source TEE Setup for Data Privacy

OpenAI Previews New LLMs Capable of Complex Reasoning

OpenAI is previewing a new series of AI models that can reason and correct complex coding mistakes, providing a more efficient solution for developers. Powered by OpenAI o1, the new models are “designed to spend more time thinking before they respond, much like a person would,” and as a result can “solve harder problems than previous models in science, coding, and math,” OpenAI claims, noting that “through training, they learn to refine their thinking process, try different strategies, and recognize their mistakes.” The first model in the series is being released in preview in OpenAI’s popular ChatGPT and in the company’s API. Continue reading OpenAI Previews New LLMs Capable of Complex Reasoning

YouTube Adding Tools to Protect Against Unauthorized AI Use

YouTube is introducing AI detection tools designed to allow people to learn when their face and/or voice are copied and used in third-party videos. As part of the effort, YouTube’s existing Content ID program that protects copyrighted music will expand to include more broad-based voice simulation detection technology. The new tools aim to protect “people from a variety of industries — from creators and actors to musicians and athletes,” according to the company. The Google-owned platform is also coming up with a way to address unauthorized use of its content for training AI models. Continue reading YouTube Adding Tools to Protect Against Unauthorized AI Use

Anthropic Announces Enhanced Claude Enterprise Plan for AI

Anthropic has launched the Claude Enterprise subscription plan to compete with OpenAI’s ChatGPT Enterprise business solution. Focused on security and administrative controls, Claude Enterprise is designed to help organizations securely collaborate with artificial intelligence using proprietary internal data. Pricing will vary based on the number of seats and how Claude is used but is expected to be more expensive than Claude Pro and Claude Teams ($20 and $25 per month, respectively). An expanded 500K context window, more usage capacity, and a native GitHub integration for work on entire codebases are advantages Anthropic touts for Claude Enterprise. Continue reading Anthropic Announces Enhanced Claude Enterprise Plan for AI

YouTube Adds Family Center, Parent Insights on Teen Viewing

YouTube is adding a Family Center hub along with a feature that allows parents to link their accounts to those of their teen children for insight on child use patterns. Linked parents will receive alerts with aggregated information about things like the number of new uploads, subscriptions and comments, or when a teen starts a live stream. What they won’t get are details about the content itself. YouTube calls it “a collaborative approach to teen supervision on YouTube.” The move comes as federal and state legislators get more aggressive about regulating online safety for minors. Continue reading YouTube Adds Family Center, Parent Insights on Teen Viewing

Alibaba’s Latest Vision Model Has Advanced Video Capability

China’s largest cloud computing company, Alibaba Cloud, has released a new computer vision model, Qwen2-VL, which the company says improves on its predecessor in visual understanding, including video comprehension and text-to-image processing in languages including English, Japanese, French, Spanish, Chinese and others. The company says it can analyze videos of more than 20 minutes in length and is able to respond appropriately to questions about content. Third-party benchmark tests compare Qwen2-VL favorably to leading competitors and the company is releasing two open-source versions with a larger private model to come. Continue reading Alibaba’s Latest Vision Model Has Advanced Video Capability

X Launches a Beta Version of Its Video Offering on App Stores

After teasing a big screen interface for months, social media company X has released the beta version of its new TV app called X TV, designed to provide “a massive leap forward in transforming X into a video-first platform,” while looking to compete with industry leaders such as Google’s YouTube. Importantly, the new presentation provides X with video-specific play for ad partners, which the Elon Musk-owned company has been attempting to lure back after loosened content moderation standards sent many fleeing. X CEO Linda Yaccarino said the X TV app is debuting ad-free, but reports indicate the company will introduce ad options in the future. Continue reading X Launches a Beta Version of Its Video Offering on App Stores

Gemini Gets Custom Gems AI Assistants and Adds Imagen 3

Google is giving Gemini Advanced, Enterprise and Business subscribers the ability to create personalized AI assistants, which the company calls “Gems.” “Create your own personal AI experts on any topic you want,” the Alphabet company says. The search giant is also reintroducing Gemini’s image generation capabilities with its latest Imagen 3 model, which will be available to everyone. Gemini, which is Google’s ChatGPT competitor, will again have the ability to generate images of people, something Google disabled in February after controversy over some of the images. The company announced it has implemented new guardrails. Continue reading Gemini Gets Custom Gems AI Assistants and Adds Imagen 3

OpenAI Pushes GPT-4o Customization with Free Token Offer

OpenAI announced its newest model, GPT-4o, can now be customized. The company said that the ability to fine-tune the multimodal GPT-4o has been “one of the most requested features from developers.” Customization can move the model toward more specific structure and tone of responses or allow it to follow specific instruction sets geared toward individual use cases. Developers can now implement custom datasets, aiming for better performance at a lower cost. The ChatGPT maker is rolling out the welcome mat by offering 1 million training tokens per day “for free for every organization” through September 23. Continue reading OpenAI Pushes GPT-4o Customization with Free Token Offer

OSI Aims for Industry Standard by Defining ‘Open Source AI’

Creating a universal definition of “open source AI” has generated a fair amount of debate and confusion, with many outfits using elastic parameters in order to achieve a fit. Now the Open Source Initiative (OSI) — “the authority that defines Open Source” — has issued what it hopes will become the baseline definition. That definition, which includes the ability to “use the system for any purpose and without having to ask for permission,” excludes a lot of AI platforms that currently describe themselves as “open,” many freely available only for non-commercial use. OSI’s remaining three parameters involve the ability to inspect the system and modify and share it. Continue reading OSI Aims for Industry Standard by Defining ‘Open Source AI’

Google Reaches a Compromise with California News Outlets

Google has reached a deal with California to contribute to a $250 million fund supporting California journalism over five years in exchange for legislators abandoning a bill requiring the tech giant to pay to use news content in Google Search. The proposed compromise, which has already generated controversy, allocates roughly $70 million from the state budget with the rest primarily from Google. In addition to financially supporting newsrooms, the fund will create a National AI Innovation Accelerator to provide access to new tools. Both initiatives are expected to go live in 2025, pending legislative approval. Continue reading Google Reaches a Compromise with California News Outlets