By
Paula ParisiApril 10, 2025
Amazon has updated its Nova model series, with Nova Reel 1.1 now able to generate AI videos of up to two minutes as well as gaining a new ‘multi-shot’ feature. Announced in December, Nova Reel marked Amazon’s initial foray into generative video. AWS developer advocate Elizabeth Fuentes says that Nova Reel accommodates user prompts of up to 4,000 characters that can generate a series of six-second shots for a sequence totaling two minutes. The company also introduced the Nova Sonic real-time voice model that supports third-party enterprise development. Continue reading AWS Updates Nova Reels and Adds Nova Sonic Voice Model
By
Debra KaufmanDecember 5, 2017
Mozilla unveiled Project DeepSpeech and Project Common Voice to leverage the capabilities of speech recognition. The company says it has just reached “two important milestones” in the project out of its Machine Learning Group. Mozilla is releasing its open source speech recognition model, which it states is nearly as accurate as what humans can perceive from the same recordings, and is also unveiling the world’s second largest publicly available voice dataset, with contributions by almost 20,000 people around the world. Continue reading Mozilla Intros Open-Source Speech Recognition, Voice Dataset
By
Debra KaufmanNovember 27, 2017
AISense, a company that offers a voice transcription service, is partnering with videoconferencing service Zoom to bring a product to market in 2018 that will provide automatic transcriptions for Zoom’s customers. AISense’s technology uses machine learning to provide a full text record of what is said, and Zoom’s videoconferencing is its first practical use. AISense also just raised $10 million in Series A funding led by Horizons Ventures, with Draper Associates, Draper Dragon, David Cheriton, and Bridgewater Associates. Continue reading AISense Teams Up with Zoom for Voice Transcription Product