By
ETCentric StaffMarch 21, 2024
Deepgram’s new Aura software turns text into generative audio with a “human-like voice.” The 9-year-old voice recognition company has raised nearly $86 million to date on the strength of its Voice AI platform. Aura is an extremely low-latency text-to-speech voice AI that can be used for voice AI agents, the company says. Paired with Deepgram’s Nova-2 speech-to-text API, developers can use it to “easily (and quickly) exchange real-time information between humans and LLMs to build responsive, high-throughput AI agents and conversational AI applications,” according to Deepgram. Continue reading Deepgram’s Speech Portfolio Now Includes Human-Like Aura
By
Paula ParisiApril 24, 2023
In a move it sees as a force multiplier, Alphabet is consolidating DeepMind and the Brain team from Google Research into a unit called Google DeepMind, uniting the teams responsible for Google Brain with DeepMind, the UK-based artificial intelligence research lab acquired in 2014. Collective accomplishments include AlphaGo, Transformers, WaveNet and AlphaFold, as well as software frameworks like TensorFlow and JAX for expressing, training and deploying large scale ML models. “Combining all this talent into one focused team, backed by the computational resources of Google, will significantly accelerate our progress in AI,” the company announced. Continue reading Google Restructures AI Research Units into Google DeepMind
By
Debra KaufmanApril 22, 2019
Facebook is bringing back FMV (full motion video) games, which use pre-recorded video files to display action. With the work of Facebook AI Research scientists, the new FMV games are much improved, with a system that can extract controllable characters from real-world videos and then control their motion, thus generating new image sequences. Facebook AI Research scientists, in collaboration with Tel Aviv University, also unveiled a system that, unsupervised, converts audio of one singer to the voice of another. Continue reading Facebook Uses AI to Improve Games, Swap Singers’ Voices
By
Emily WilsonMarch 29, 2018
According to members of Google’s Brain and Machine Perception teams, researchers at the tech giant have developed “ways to make machine-generated speech sound more natural to humans,” even providing examples of the more expressive speech in a company blog post, reports VentureBeat. Google also announced the release of its Cloud Text-to-Speech services, which could “be used to bring more natural speech to devices, apps or digital services that utilize voice control or voice computing,” the article explains.
Continue reading Google’s Machine-Generated Speech Will Sound More Human