Meta AI Image Analysis and Editing Beta Tested for WhatsApp

Meta’s popular instant messaging service WhatsApp is reportedly beta testing a feature that would allow the already integrated Meta AI chatbot to edit and reply to images. The capability was spotted in the WhatsApp beta for Android 2.24.14.20, with AI powered by Llama 3, the company’s newest large language model released in April. The beta version works via a camera button added to the text box for Meta AI chat in WhatsApp. When pressed, the button triggers a pop-up that indicates Meta AI can analyze and edit photos, though it’s currently unclear to what extent. Continue reading Meta AI Image Analysis and Editing Beta Tested for WhatsApp

ByteDance Rival Kuaishou Creates Kling AI Video Generator

China’s Kuaishou Technology has a video generator called Kling AI in public beta that is getting great word-of-mouth, with comments from “incredibly realistic” to “Sora killer,” a reference to OpenAI’s still in closed beta video generator. Kuaishou claims that using only text prompts, Kling can generate “AI videos that closely mimic the real world’s complex motion patterns and physical characteristics,” in sequences as long as two minutes at 30 fps and 1080p, while supporting various aspect ratios. Kuaishou is China’s second most popular short-form video app, after ByteDance’s Douyin, the Chinese version of TikTok. Continue reading ByteDance Rival Kuaishou Creates Kling AI Video Generator

Graphics Productivity Tool Canva Unveils Enterprise Redesign

Web-based editing application Canva unveiled a significant makeover this week in Los Angeles at the Canva Create event. Touting “a whole new Canva,” the company shared changes that impact the entire platform, from pricing to tools, templates and user interface. The new editor, designed to make it easier to jump between projects, is “available to the first one million users who discover the secret portal hidden in their Canva homepage, before becoming available to the entire Canva community from August.” The 11-year-old company, which claims 183 million free and paid monthly users, also unveiled an enterprise solution. Continue reading Graphics Productivity Tool Canva Unveils Enterprise Redesign

Adobe Considers Sora, Pika and Runway AI for Premiere Pro

Adobe plans to add generative AI capabilities to its Premiere Pro editing platform and is exploring the update with third-party AI technologies including OpenAI’s Sora, as well as models from Runway and Pika Labs, making it easier “to draw on the strengths of different models” within everyday workflows, according to Adobe. Editors will gain the ability to generate and add objects into scenes or shots, remove unwanted elements with a click, and even extend frames and footage length. The company is also developing a video model for its own Firefly AI for video and audio work in Premiere Pro. Continue reading Adobe Considers Sora, Pika and Runway AI for Premiere Pro

DaVinci Resolve 19 Has AI Motion Tracking and Color Grading

Blackmagic Design has unveiled the new DaVinci Resolve 19, with multi-source editing, neural engine AI tools, Resolve FX and Fairlight AI audio panning among the highlight features. With more than 100 feature upgrades in all, Resolve 19 boasts IntelliTrack AI, Ultra NR noise reduction, ColorSlice six vector grading palettes and Film Look Creator FX. The company also announced the DaVinci Resolve Micro Color Panel, a more affordable color panel for DaVinci Resolve software that Blackmagic says was designed in collaboration with the world’s leading colorists. These tools are featured at the Blackmagic booth at NAB 2024. Continue reading DaVinci Resolve 19 Has AI Motion Tracking and Color Grading

Google Adding Free AI Photo Editing Tools to Google Photos

Beginning May 15, Google Photos users can start accessing a suite of free AI-powered Magic Editor tools like Magic Eraser and Portrait Light. The features will also be accessible on more devices, including Pixel tablets. Last year, Google launched Magic Editor on Pixel 8 and Pixel 8 Pro phones. In addition to making the features available on all Pixel devices, all Google Photos users on Android and iOS will get baseline access to 10 Magic Editor saves per month. Additionally, those with a Pixel device or Premium Google One plan of at least 2TB will have unlimited use. Continue reading Google Adding Free AI Photo Editing Tools to Google Photos

OpenAI Integrates New Image Editor for DALL-E into ChatGPT

OpenAI has updated the editor for DALL-E, the artificial intelligence image generator that is part of the ChatGPT premium tiers. The update, based on the DALL-E 3 model, makes it easier for users to adjust their generated images. Shortly after DALL-E 3’s September debut, OpenAI integrated it into ChatGPT, enabling paid subscribers to generate images from text or image prompts. The new DALL-E editor interface lets users edit images “by selecting an area of the image to edit and describing your changes in chat” without using the selection tool. Desired changes can also be prompted “in the conversation panel,” according to OpenAI. Continue reading OpenAI Integrates New Image Editor for DALL-E into ChatGPT

Generative Tech Enables Multiple Versions of the Same Movie

Filmmaker Gary Hustwit and artist Brendan Dawes aspire to change the way audiences experience film. Their startup, Anamorph, has launched with an app that can reassemble different versions of the same film. The app debuted with “Eno,” a Hustwit-directed documentary about the music iconoclast Brian Eno that premiered in January at the Sundance Film Festival, where every “Eno” showing presented the audience with a unique viewing experience. Drawing scenes from a repository of over 500 hours of “Eno” material, the Anamorph app would potentially be able to generate what the company says is billions of different configurations. Continue reading Generative Tech Enables Multiple Versions of the Same Movie

Adobe’s Prototype AI Tool Is a ‘Photoshop for Music-Making’

Project Music GenAI Control, an experimental work from Adobe Research, is setting out to change how people create and edit custom audio and music. The prototype tool lets creators generate music from text prompts, “and then have fine-grained control to edit that audio for their precise needs,” according to Adobe. Designed to help create music for broadcasts, podcasts or other “audio that’s just the right mood, tone, and length,” it can generate music from text prompts like “powerful rock,” “happy dance” or “sad jazz,” says Adobe Research Senior Research Scientist Nicholas Bryan, a creator of the technology. Continue reading Adobe’s Prototype AI Tool Is a ‘Photoshop for Music-Making’

Apple’s Keyframer AI Tool Uses LLMs to Prototype Animation

Apple has taken a novel approach to animation with Keyframer, using large language models to add motion to static images through natural language prompts. “The application of LLMs to animation is underexplored,” Apple researchers say in a paper that describes Keyframer as an “animation prototyping tool.” Based on input from animators and engineers, Keyframer lets users refine their work through “a combination of prompting and direct editing,” the paper explains. The LLM can generate CSS animation code. Users can also use natural language to request design variations. Continue reading Apple’s Keyframer AI Tool Uses LLMs to Prototype Animation

Apple Launches Open-Source Language-Based Image Editor

Apple has released MGIE, an open-source AI model that edits images using natural language instructions. MGIE, short for MLLM-Guided Image Editing, can also modify and optimize images. Developed in conjunction with University of California Santa Barbara, MGIE is Apple’s first AI model. The multimodal MGIE, which understands text and image input, also crops, resizes, flips, and adds filters based on text instructions using what Apple says is an easier instruction set than other AI editing programs, and is simpler and faster than learning a traditional program, like Apple’s own Final Cut Pro. Continue reading Apple Launches Open-Source Language-Based Image Editor

VideoPoet: Google Launches a Multimodal AI Video Generator

Google has unveiled a new large language model designed to advance video generation. VideoPoet is capable of text-to-video, image-to-video, video stylization, video inpainting and outpainting, and video-to-audio. “The leading video generation models are almost exclusively diffusion-based,” Google says, citing Imagen Video as an example. Google finds this counter intuitive, since “LLMs are widely recognized as the de facto standard due to their exceptional learning capabilities across various modalities.” VideoPoet eschews the diffusion approach of relying on separately trained tasks in favor of integrating many video generation capabilities in a single LLM. Continue reading VideoPoet: Google Launches a Multimodal AI Video Generator

Stability AI Is Offering Paid Membership for Commercial Users

As the pressure ratchets up for AI companies to go beyond the wow factor and make money, Stability AI has formalized three subscription tiers as it seeks to expand commercial use of its open-source, multimodal core models. The Stability AI Membership offerings include a free tier for personal and research (i.e., non-commercial) use, a professional tier that costs $20 a month, and a custom-priced enterprise tier for large outfits. The company says that with the three tiers it is “striking a balance between fostering competitiveness and maintaining openness in AI technologies.” Continue reading Stability AI Is Offering Paid Membership for Commercial Users

GenAI Lets Snapchat+ Subscribers Create and Share Images

Snapchat+ is rolling out new artificial intelligence features that let subscribers use text prompts to create generative AI images to share with friends. In addition, the Dreams feature, which creates generative AI selfies, is now able to add your friends to those photos. Snapchat+ subscribers get one pack of 8 Dreams per month as part of their $3.99 monthly fee. An onscreen button labeled “AI” lets subscribers access the AI image generator to choose from a menu of prompts (including “sunny day at the beach” and “planet made of cheese”) or they can enter their own descriptions. Continue reading GenAI Lets Snapchat+ Subscribers Create and Share Images

Adobe Reveals Its New AI Tool for Editing Problematic Audio

Adobe has unveiled Project Sound Lift, an AI-powered technology that separates speech recordings into discrete tracks of voices, non-speech sounds and other background noise in video. The company describes Project Sound Lift as “a one-click solution” that leverages AI to help users easily manipulate audio recordings “across a range of scenarios” to “enhance, transform, and control speech and sound independently.” Adobe’s existing Enhance Speech technology, available in the company’s Premiere Pro editing program, has been integrated within Project Sound Lift to aid creators in producing studio-quality audio content. Continue reading Adobe Reveals Its New AI Tool for Editing Problematic Audio