By
Paula ParisiJuly 18, 2024
YouTube Music is working to improve its discovery capabilities. The Google unit is testing an AI-powered personalized radio feature for Premium subscribers in the U.S., and is also gradually rolling out something called Sound Search, which lets users describe a type of sound, including by humming it, then having it searched from a catalog that features “over 100 million official songs,” according to YouTube Music. The feature was introduced on a limited basis on Android in May, and is now expanding to iOS users, albeit on what is still a limited basis. Continue reading YouTube Music Expands Its Sound Search and Tests AI Radio
By
Paula ParisiJuly 15, 2024
New York-based speech synthesis software startup ElevenLabs has launched its latest AI development — Voice Isolator and an API to go with it. Voice Isolator is designed to extract background noise, leaving clear dialogue for film, podcast, and interview post-production. The Voice Isolator API lets developers integrate the new product into third-party applications. To use the technology, content is uploaded and processed by the Voice Isolator model, resulting in what the company claims is speech comparable in quality to that obtained in a recording studio. The app is described as “free, with some limitations.” Continue reading ElevenLabs Voice Isolator Audio Post Tool Released with API
By
Paula ParisiJuly 10, 2024
YouTube has released an eraser tool update that makes it easy to remove copyrighted music from videos without disturbing the remaining audio, like dialogue and sound effects. The Erase Song update uses an AI algorithm to detect and remove the offending material, making it more accurate than what had previously been available, as well as easier. Creators whose material has Content ID claims can now excise the objectionable material without having to manually edit and upload a new video, thereby avoiding potential restrictions on where the video is viewable or whether it can be monetized. Continue reading YouTube AI Song Eraser Easily Removes Copyright Material
Spotify recently introduced a new $10.99 per month Basic streaming plan in the U.S., which includes “the music streaming benefits of your Premium plan without the monthly audiobook listening time.” As part of its move to provide “more choice for U.S. subscribers,” Spotify now offers subscriptions including an $11.99 per month Premium Individual plan, $16.99 Premium Duo option, $19.99 Premium Family (for up to 6 members of one household), and Audiobooks Access for $9.99 per month. Additionally, in an effort to boost video content the company is allowing podcasters, even those not officially hosted by Spotify, to upload video podcasts. Continue reading Spotify Offers Basic Streaming Plan, New Podcaster Feature
By
Paula ParisiJune 28, 2024
A group that includes the world’s three largest music labels — Sony, Universal and Warner — are backing federal lawsuits brought by the Recording Industry Association of America against AI companies Suno and Udio. Claiming “mass infringement,” the suits allege the startups scraped libraries of copyrighted songs to train models that power generative audio products allowing consumers to create music using text prompts. Suno is based in Massachusetts while Udio and its parent Uncharted are headquartered in New York, with the actions filed earlier this week in their respective states. Continue reading Recording Industry Sues AI Startups Citing Mass Infringement
By
Paula ParisiJune 27, 2024
Synthesia, which uses AI to create business avatars for use in content such as training, presentation and customer service videos, has announced a major platform update. “Coming soon” with Synthesia 2.0 are full-body avatars that include hands capable of a wide range of motions. Users can animate motion using skeletal sequences on which the persona selected from the catalog can then be automatically mapped. Starting next month, the Nvidia-backed UK company will offer the ability to incorporate brand identity — including typography, colors and logos — into templated videos. A new translation tool automatically applies updates to all languages. Continue reading Lifelike AI Avatars to Get New Features with Synthesia Update
By
Paula ParisiJune 20, 2024
Meta Platforms is publicly releasing five new AI models from its Fundamental AI Research (FAIR) team, which has been experimenting with artificial intelligence since 2013. These models including image-to-text, text-to-music generation, and multi-token prediction tools. Meta is introducing a new technique called AudioSeal, an audio watermarking technique designed for the localized detection of AI-generated speech. “AudioSeal makes it possible to pinpoint AI-generated segments within a longer audio snippet,” according to Meta. The feature is timely in light of concern about potential misinformation surrounding the fall presidential election. Continue reading Meta’s FAIR Team Announces a New Collection of AI Models
By
Paula ParisiJune 19, 2024
Google DeepMind has unveiled new research on AI tech it calls V2A (“video-to-audio”) that can generate soundtracks for videos. The initiative complements the wave of AI video generators from companies ranging from biggies like OpenAI and Alibaba to startups such as Luma and Runway, all of which require a separate app to add sound. V2A technology “makes synchronized audiovisual generation possible” by combining video pixels with natural language text prompts “to generate rich soundscapes for the on-screen action,” DeepMind writes, explaining that it can “create shots with a dramatic score, realistic sound effects or dialogue.” Continue reading DeepMind’s V2A Generates Music, Sound Effects, Dialogue
By
Paula ParisiJune 13, 2024
Nokia made what it claims is “the world’s first immersive voice and audio call” using cell phones, made possible by the new 3GPP Immersive Voice and Audio Services (IVAS) codec that lets consumers hear 3D spatial sound in real-time. The codec — which Nokia participated in crafting — is a major leap from today’s standard monophonic smartphone voice call experience and is part of the upcoming 5G Advanced standard. The innovation paves the way towards enhanced immersive spatial communications, extended reality and metaverse applications, says Nokia, explaining that it works across “any connected device,” including smartphones, tablets and PCs. Continue reading Nokia Makes the First-Ever 3D Spatial Audio Cell Phone Call
By
Paula ParisiJune 7, 2024
Stability AI has added another audio product to its lineup, releasing the open-source text-to-audio generator Stable Audio Open 1.0 for sound design. The new model can generate up to 47 seconds of samples and sound effects, including drum beats, instrument riffs, ambient sounds, foley and production elements. It also allows for adapting variations and changing the style of audio samples. Stability AI — best known for the image generator Stable Diffusion — in September released Stable Audio, a commercial product that can generate sophisticated music tracks of up to three minutes. Continue reading Stability AI Releases Free Sound FX Tool, Stable Audio Open
By
Paula ParisiJune 6, 2024
ElevenLabs has launched its text-to-sound generator Sound Effects for all users, available now at the company’s website. The new AI tool can create audio effects, short instrumental tracks, soundscapes and even character voices. Sound Effects “has been designed to help creators — including film and television studios, video game developers, and social media content creators — generate rich and immersive soundscapes quickly, affordably and at scale,” according to the startup, which developed the tool in partnership with Shutterstock, using its library of licensed audio tracks. Continue reading ElevenLabs Launches an AI Tool for Generating Sound Effects
By
Paula ParisiMay 29, 2024
Music startup Suno, which leverages ChatGPT tech with the goal of emulating that app’s success in music, has raised $125 million in Series B funding, resulting in a valuation of $500 million. Founded by Harvard physics PhD turned tech entrepreneur Mikey Shulman, the company is being called “a rising star” in the realm of generative AI. Suno lets people generate original songs by using text prompts or lyrics, with the AI supplying the melodies and harmonies for fully-formed compositions. “We started Suno to build a future where anyone can make music,” according to the company. Continue reading AI Startup Suno Raises Funds to ‘Democratize Music Creation’
By
Paula ParisiMay 23, 2024
Sonos, the company that helped launch the Wi-Fi speaker market is now branching into wireless over-ear headphones. The launch marks a much-anticipated and also inevitable move, considering the U.S. headset market was estimated to be almost $2.2 billion last year, nearly twice as large as the total for wireless speaker sales, according to market research firm Circana. Sonos Ace headphones have what is being called exceptional noise-cancellation and feature Bluetooth connectivity and a Wi-Fi chip so they can be used in conjunction with the Sonos soundbar for a personal home-theater experience. They ship June 5 for $449. Continue reading Sonos Rolls Out Its First Headphones, the $450 Bluetooth Ace
By
Paula ParisiMay 14, 2024
Substack is attempting to lure select TikTok posters to its publishing platform with the launch of Substack Creator Studio. Billed as “a fellowship for the next wave of video stars to turn their TikTok channels into Substack shows and communities,” the outlet says video-native creators will be able to forge a “more direct, intimate relationship with their audience” on Substack, while making money from subscriptions. Only 10 fellows will be initially selected, and given access to consulting and production support from Adam Faze’s Gymnasium short-form studio, producer of the TikTok series “Boy Room.” Continue reading Substack Creator Studio Bows with 10 Video Fellowship Slots
By
ETCentric StaffApril 24, 2024
Adobe plans to add generative AI capabilities to its Premiere Pro editing platform and is exploring the update with third-party AI technologies including OpenAI’s Sora, as well as models from Runway and Pika Labs, making it easier “to draw on the strengths of different models” within everyday workflows, according to Adobe. Editors will gain the ability to generate and add objects into scenes or shots, remove unwanted elements with a click, and even extend frames and footage length. The company is also developing a video model for its own Firefly AI for video and audio work in Premiere Pro. Continue reading Adobe Considers Sora, Pika and Runway AI for Premiere Pro