YouTube Dream Track Toolset Introduces an AI Remix Feature

YouTube has added a new feature to its Dream Track toolset, which lets select U.S. creators use AI to generate songs using the vocals of artists including John Legend, Demi Lovato, Charli XCX, Charlie Puth and others. Now users can remix Dream Track songs using natural language to describe the changes they would like, stylistic and otherwise. Selecting the “restyle a track” option will steer users to creating a 30-second generative snippet for use in YouTube Shorts. The remixed snippets will credit the original song with “clear attribution” through the Short itself and the Shorts audio pivot page. It will also clearly indicate that the track was restyled with AI, according to Google. Continue reading YouTube Dream Track Toolset Introduces an AI Remix Feature

AI Startup Suno Raises Funds to ‘Democratize Music Creation’

Music startup Suno, which leverages ChatGPT tech with the goal of emulating that app’s success in music, has raised $125 million in Series B funding, resulting in a valuation of $500 million. Founded by Harvard physics PhD turned tech entrepreneur Mikey Shulman, the company is being called “a rising star” in the realm of generative AI. Suno lets people generate original songs by using text prompts or lyrics, with the AI supplying the melodies and harmonies for fully-formed compositions. “We started Suno to build a future where anyone can make music,” according to the company. Continue reading AI Startup Suno Raises Funds to ‘Democratize Music Creation’

Microsoft’s VASA-1 Can Generate Talking Faces in Real Time

Microsoft has developed VASA, a framework for generating lifelike virtual characters with vocal capabilities including speaking and singing. The premiere model, VASA-1, can perform the feat in real time from a single static image and a vocalization clip. The research demo showcases realistic audio-enhanced faces that can be fine-tuned to look in different directions or change expression in video clips of up to one minute at 512 x 512 pixels and up to 40fps “with negligible starting latency,” according to Microsoft, which says “it paves the way for real-time engagements with lifelike avatars that emulate human conversational behaviors.” Continue reading Microsoft’s VASA-1 Can Generate Talking Faces in Real Time

Google’s MusicLM AI Can Generate Tunes from Text Prompts

Google is introducing a new artificial intelligence app called MusicLM that creates music in any style or genre based on text prompts and can translate a whistled melody or casually hummed snipped into instrument sounds. TechCrunch calls the technology “impressive” but says the Alphabet company “fearing the risks, has no immediate plans to release it,” in recognition of the controversy surrounding AI models trained using copyrighted material. MusicLM was created using a dataset of 280,000 musical hours, resulting in the ability to generate minutes-long songs of “significant complexity.” Continue reading Google’s MusicLM AI Can Generate Tunes from Text Prompts