By
Paula ParisiSeptember 10, 2024
YouTube is introducing AI detection tools designed to allow people to learn when their face and/or voice are copied and used in third-party videos. As part of the effort, YouTube’s existing Content ID program that protects copyrighted music will expand to include more broad-based voice simulation detection technology. The new tools aim to protect “people from a variety of industries — from creators and actors to musicians and athletes,” according to the company. The Google-owned platform is also coming up with a way to address unauthorized use of its content for training AI models. Continue reading YouTube Adding Tools to Protect Against Unauthorized AI Use
By
Paula ParisiJune 15, 2022
Spotify will acquire London-based startup Sonantic, a company that creates realistic human voices from text using a proprietary AI engine. Sonantic made a recent high-profile contribution to pop culture by providing the means to simulate actor Val Kilmer’s voice in Paramount’s summer blockbuster “Top Gun: Maverick.” The move expands the music and podcasting streamer to expand into audio technology with broad implications. Spotify vice president of personalization Ziad Sultan says the technology will be integrated into the main platform to allow the company “to engage users in a new and even more personalized way.” Continue reading Spotify Announces Plan to Acquire AI Voice Startup Sonantic
As another example of the significant advances we have been following in artificial intelligence and deep learning, Chinese search giant Baidu has introduced Deep Voice 2, the second iteration of its compelling text-to-speech system. The company introduced Deep Voice just three months ago, with the ability to produce speech “in near real time” that was “nearly indistinguishable from an actual human voice,” according to The Verge. While the first system was limited to learning one voice at a time, “and required many hours of audio or more from which to build a sample,” the updated version “can learn the nuances of a person’s voice with just half an hour of audio, and a single system can learn to imitate hundreds of different speakers.” Continue reading Text-to-Speech System Quickly Mimics Hundreds of Accents