Generative AI Archives - Page 5 of 32

YouTube Dream Track Toolset Introduces an AI Remix Feature

By Paula Parisi
November 18, 2024

YouTube has added a new feature to its Dream Track toolset, which lets select U.S. creators use AI to generate songs using the vocals of artists including John Legend, Demi Lovato, Charli XCX, Charlie Puth and others. Now users can remix Dream Track songs using natural language to describe the changes they would like, stylistic and otherwise. Selecting the “restyle a track” option will steer users to creating a 30-second generative snippet for use in YouTube Shorts. The remixed snippets will credit the original song with “clear attribution” through the Short itself and the Shorts audio pivot page. It will also clearly indicate that the track was restyled with AI, according to Google. Continue reading YouTube Dream Track Toolset Introduces an AI Remix Feature

Particle Launches AI News App That Summarizes in Quick Hits

By Paula Parisi
November 14, 2024

Particle, the AI-powered news aggregator created by a pair of Twitter alums, has launched after a year in beta. The iOS app summarizes current events in quick hits the startup says do not violate the copyrights of publishers whose news it shares. Instead of simply scraping publishers’ work for proprietary use, the startup seeks to compensate publishers and drive traffic to news sites with prominent links to sources accompanying each AI news summary. Developed by Sara Beykpour and Marcel Molina, Particle has raised more than $11 million in early funding led by Lightspeed. Continue reading Particle Launches AI News App That Summarizes in Quick Hits

Baidu’s Ernie AI Gets Improved Text-to-Image and App Builder

By Paula Parisi
November 14, 2024

Ernie, the foundation model for Baidu’s generative AI, has been updated with iRAG technology to mitigate visual hallucinations and a no-code tool called Miaoda that creates apps using natural language. The company behind China’s largest search engine says Ernie now handles 1.5 billion daily user queries, up from 50 million circa its March 2023 launch (a 30x increase). Baidu also debuted Ernie-powered smart glasses from its Xiaodu Technology hardware unit. The Xiaodu AI Glasses features built-in voice activation and cameras for taking photos and video. The news was shared at this week’s Baidu World 2024 in Shanghai. Continue reading Baidu’s Ernie AI Gets Improved Text-to-Image and App Builder

Copilot Now Enables Custom AI Themes in Microsoft Outlook

By Paula Parisi
November 13, 2024

Microsoft Copilot now helps subscription users create personal themes in Outlook using generative AI. In what Microsoft says is “the first instance of dynamic AI-generated theming in productivity applications,” Copilot can now display inboxes against dynamic backdrops based on geography, the weather, or anything else users can imagine. The new feature is available across all popular platforms: Windows, Mac, iOS, Android and the Web. Just like you might “spruce up your office with artwork or plants,” Copilot lets AI enhance your digital environment, according to Microsoft. Continue reading Copilot Now Enables Custom AI Themes in Microsoft Outlook

BodyTalk Dubs into 29 Languages with Facial Moves to Match

By Paula Parisi
November 12, 2024

Panjaya is a AI startup that aims to disrupt the world of video dubbing with a way to generate “hyperrealistic” recreations of a person’s voice speaking a new language. The system also automatically modifies the imagery to match lip and other physical movements to match the new speech patterns. Called BodyTalk, the technique is the launch point for Panjaya as it emerges from the stealth in which it conducted its R&D the past three years, backed by $9.5 million from venture funds and angel backers. The startup describes BodyTalk as “AI dubbing that looks and feels as natural as the original.” Continue reading BodyTalk Dubs into 29 Languages with Facial Moves to Match

Google Offers New AI-Powered Vids App to Workspace Users

By Rob Scott
November 11, 2024

Google announced it is rolling out its Gemini AI-powered video presentation app that enables users to easily create video presentations. Vids is a productivity app featured in the company’s suite of Google Workspace products. The new app uses AI model Gemini to automatically insert royalty-free stock video footage, create storyboards and scripts, and generate music and voiceovers. It allows users to add documents, slides, visuals, audio and transitions to the presentation’s timeline. “Personalize your content with Vids recording studio to deliver employee training, share company-wide announcements, meeting updates, and more,” suggests Google. Continue reading Google Offers New AI-Powered Vids App to Workspace Users

Autodesk’s AI Tool Turns Live-Action Video into 3D Animation

By Paula Parisi
November 8, 2024

Wonder Animation is the latest tool from Wonder Dynamics, the AI startup founded by actor Tye Sheridan and VFX artist Nikola Todorovic in 2017 that Autodesk purchased in May. Now in beta, Wonder Animation can automatically transpose live-action footage into stylized 3D animation. Creators can shoot using any camera, on any set or location, and easily convert to 3D CGI. Matching the camera position and movement to the characters and environment, Wonder Animation lets you film using any camera system and lenses, edit those shots using Maya, Blender or Unreal, and then reconstruct the result as 3D animation using AI. Continue reading Autodesk’s AI Tool Turns Live-Action Video into 3D Animation

Amazon Prime Video Offers AI-Powered Recaps of TV Shows

By Paula Parisi
November 6, 2024

Amazon Prime Video has begun offering X-Ray Recaps, summaries of favorite TV shows that catch you up without risk of spoilers. The generative AI-powered feature can create snapshots of any requested view — episodes, pieces of episodes or full seasons of TV shows. “Whether you’re a few minutes into a new episode, halfway through a season” or took a break to get popcorn and need a quick refresher, X-Ray Recaps will catch you up “personalized down to the exact minute of where you are watching,” according to Amazon, which assures “guardrails are applied” to ensure the generation of spoiler-free summaries. Continue reading Amazon Prime Video Offers AI-Powered Recaps of TV Shows

Nvidia’s AI Blueprint Develops Agents to Analyze Visual Data

By Paula Parisi
November 6, 2024

Nvidia’s growing AI arsenal now includes video search and summarization tool AI Blueprint, which helps developers build visual AI agents that analyze video and image content. The agents can answer user questions, generate summaries and even enable alerts for specific scenarios. The new feature is part of Metropolis, Nvidia’s developer toolkit for building computer vision applications using generative AI. Globally, enterprises and public organizations increasingly rely on visual information. Cameras, IoT sensors and autonomous vehicles are ingesting visual data at high rates, and visual agents can help monitor and make sense of that workflow. Continue reading Nvidia’s AI Blueprint Develops Agents to Analyze Visual Data

Startup Noma Aims to Secure the Entire Data and AI Lifecycle

By Paula Parisi
November 5, 2024

As companies move forward with leveraging their proprietary data in generative AI applications, enterprises are contending with existing security solutions that may be inadequate for that task. Israeli startup Noma Security is addressing that concern. Just out of stealth mode, Noma has raised $32 million in a Series A round led by Ballistic Ventures with support from Glilot Capital Partners, Cyber Club London and a collection of angel investors. While enterprise firms that host their models at large cloud outfits have access to built-in MLOps security tools, those who are self-hosting, using smaller cloud operations, or want added protection might be interested in Noma. Continue reading Startup Noma Aims to Secure the Entire Data and AI Lifecycle

D-ID’s New Business-Use Avatars Can Converse in Real Time

By Paula Parisi
November 5, 2024

D-ID has launched two new types of AI-powered avatars: Premium+ and Express. The company’s video-to-video avatar tools aim to provide personal look-alikes that can sub for their creators in uses ranging from instructional videos to business presentations, offloading on-camera duties in areas including sales, marketing and customer support. “Premium+ Avatars can generate hyper-realistic digital humans that are indistinguishable from real people and will serve as the foundation for fully interactive digital agents revolutionizing how brands communicate,” while Express Avatars can rapidly generate serviceable avatars “from just one minute of source footage.” Continue reading D-ID’s New Business-Use Avatars Can Converse in Real Time

Amazon Pushes AI, Records Growth in Q3 Revenue and Profit

By Rob Scott
November 4, 2024

Amazon reported major revenue and profit increases during its third quarter, beating Wall Street’s forecasts, based largely on the company’s e-commerce sales and increasing demand for its cloud services. Capital expenditure, which reached a record amount following Amazon’s recent investments in artificial intelligence, will maintain its momentum as the company plans $75 billion capex on developing generative AI services over 2024-2025. “The faster we grow demand, the faster we have to invest capital in data centers, network gear and hardware,” explained CEO Andy Jassy. “We invest in all that upfront in advance of when we can monetize it.” Continue reading Amazon Pushes AI, Records Growth in Q3 Revenue and Profit

Midjourney Makes Powerful AI Image Editor Available in Alpha

By Paula Parisi
October 28, 2024

Midjourney is turning heads with its new image editor, which lets users upload images and then make adjustments. The company’s models — most recently Midjourney 6.1 — accept uploaded images as a reference to use for generative results. Now the Midjourney image editor allows precise adjustments to aspects of the frame. An “image retexturing mode” is also being introduced, as is v2 of its “AI moderator.” The new features are only available to users with yearly memberships, monthly memberships for the past 12 months, or those who have generated at least 10,000 Midjourney images. Continue reading Midjourney Makes Powerful AI Image Editor Available in Alpha

OpenAI: sCM Generates Media 50x Faster Than Other Models

By Paula Parisi
October 28, 2024

OpenAI is taking a new approach to generating media that it says is 50 times faster than the models commonly used today. Called sCM, the approach is a “consistency model,” a variation on the diffusion method used by many leading systems. OpenAI claims its new model is ideal for training for large scale datasets and generating video, audio and images that are of “comparable sample quality to leading diffusion models.” Such models often require hundreds of steps, creating challenges when it comes to real-time applications. OpenAI aims to change this with a faster system that requires less power. Continue reading OpenAI: sCM Generates Media 50x Faster Than Other Models

Runway’s Act-One Facial Capture Could Be a ‘Game Changer’

By Paula Parisi
October 25, 2024

Runway is launching Act-One motion capture system that uses video and voice recordings to map human facial expressions onto characters using the company’s latest model, Gen-3 Alpha. Runway calls it “a significant step forward in using generative models for expressive live action and animated content.” Compared to past facial capture techniques — which typically require complex rigging — Act-One is driven directly and only by the performance of an actor, requiring “no extra equipment,” making it more likely to capture and preserve an authentic, nuanced performance, according to the company. Continue reading Runway’s Act-One Facial Capture Could Be a ‘Game Changer’