TTS Archives - ETCentric

Facebook Reveals New AI-Powered Text-to-Speech System

By Debra Kaufman
May 22, 2020

Facebook introduced an AI text-to-speech system (TTS) that produces a second of audio in 500 milliseconds. According to Facebook, the system, which is used with a new approach to data collection, powered the creation of a British accent-inflected voice in six months, versus over a year required for other voices. The TTS is now used for Facebook’s Portal smart display brand. The system can be hosted in real time via ordinary processors and is also available as a service for other apps, including Facebook’s VR. Continue reading Facebook Reveals New AI-Powered Text-to-Speech System

Google and IBM Create Advanced Text-to-Speech Systems

By Debra Kaufman
October 2, 2019

Both IBM and Google recently advanced development of Text-to-Speech (TTS) systems to create high-quality digital speech. OpenAI found that, since 2012, the compute power needed to train TTS models has exploded to more than 300,000 times. IBM created a much less compute-intensive model for speech synthesis, stating that it is able to do so in real-time and adapt to new speaking styles with little data. Google and Imperial College London created a generative adversarial network (GAN) to create high-quality synthetic speech. Continue reading Google and IBM Create Advanced Text-to-Speech Systems

Text-to-Speech System Quickly Mimics Hundreds of Accents

By ETCentric
May 26, 2017

As another example of the significant advances we have been following in artificial intelligence and deep learning, Chinese search giant Baidu has introduced Deep Voice 2, the second iteration of its compelling text-to-speech system. The company introduced Deep Voice just three months ago, with the ability to produce speech “in near real time” that was “nearly indistinguishable from an actual human voice,” according to The Verge. While the first system was limited to learning one voice at a time, “and required many hours of audio or more from which to build a sample,” the updated version “can learn the nuances of a person’s voice with just half an hour of audio, and a single system can learn to imitate hundreds of different speakers.” Continue reading Text-to-Speech System Quickly Mimics Hundreds of Accents

Audi Announces Next-Generation V2I Connected Car Features

By Erick Moen
August 18, 2016

Later this year, Audi will roll out the first feature of its new vehicle-to-infrastructure (V2I) service in select 2017 models. The company’s new traffic light information system will notify drivers of the remaining wait time at red lights. It represents the first time an individual vehicle will access real-time infrastructure information. The platform is a practical, yet significant, first step for connected cars as they begin to integrate into the existing municipal infrastructure with an eye toward the dawn of “smart cities.” Continue reading Audi Announces Next-Generation V2I Connected Car Features