Meta AI Seamless Translator Converts Nearly 100 Languages

The research division of Meta AI has developed Seamless Communication, a suite of artificial intelligence models that generate what the company says is natural and authentic communication across languages, facilitating what amounts to real-time universal speech translation. The models were released with accompanying research papers and data. The flagship model, Seamless, merges capabilities from a trio of models — SeamlessExpressive, SeamlessStreaming and SeamlessM4T v2 — into a single system that can translate between almost 100 spoken and written languages, preserving idioms, emotion and the speaker’s vocal style, Meta says. Continue reading Meta AI Seamless Translator Converts Nearly 100 Languages

Stability AI Intros Real-Time Text-to-Image Generation Model

Stability AI, developer of Stable Diffusion (one of the leading visual content generators, alongside Midjourney and DALL-E), has introduced SDXL Turbo — a new AI model that demonstrates more of the latent possibilities of the common diffusion generation approach: images that update in real time as the user’s prompt updates. This feature was always a possibility even with previous diffusion models given text and images are comprehended differently across linear time, but increased efficiency of generation algorithms and the steady accretion of GPUs and TPUs in a developer’s data center makes the experience more magical. Continue reading Stability AI Intros Real-Time Text-to-Image Generation Model

Altman Reinstated as CEO of OpenAI, Microsoft Joins Board

Sam Altman has wasted no time since being rehired as CEO of OpenAI on November 22, four days after being fired. This week, the 38-year-old leader of one of the most influential artificial intelligence firms outlined his “immediate priorities” and announced a newly constituted “initial board” that includes a non-voting seat for investor Microsoft. The three voting members thus far include former Salesforce co-CEO Bret Taylor as chairman and former U.S. Treasury Secretary Larry Summers — both newcomers — and sophomore Adam D’Angelo, CEO of Quora. Mira Murati, interim CEO during Altman’s brief absence, returns to her role as CTO. Continue reading Altman Reinstated as CEO of OpenAI, Microsoft Joins Board

Amazon Previews Titan Image Generator for Bedrock Clients

Amazon is debuting its Titan Image Generator in preview for AWS Bedrock customers. The new Titan generative AI model can create new images from a text prompt or existing image, and automatically adds watermarking to protect intellectual property. The move into generative imaging puts Amazon in competition with a growing field that includes large firms like Adobe and Google. Unlike those companies and others, the e-retail giant is at present focusing exclusively on enterprise customers. Amazon Bedrock is a managed service giving developers access to a range of foundation models from companies including Meta Platforms, Anthropic, and Amazon itself. Continue reading Amazon Previews Titan Image Generator for Bedrock Clients

Newsom Report Examines Use of AI by California Government

California Governor Gavin Newsom has released a report examining the beneficial uses and potential harms of artificial intelligence in state government. Potential plusses include improving access to government services by identifying groups that are hindered due to language barriers or other reasons, while dangers highlight the need to prepare citizens with next generation skills so they don’t get left behind in the GenAI economy. “This is an important first step in our efforts to fully understand the scope of GenAI and the state’s role in deploying it,” Newsom said, calling California’s strategy “a nuanced, measured approach.” Continue reading Newsom Report Examines Use of AI by California Government

Google’s Bard AI Is Getting Smarter About YouTube Content

Google’s Bard AI chatbot is getting smarter regarding video queries. Specifically, a new YouTube extension is now able to answer questions about the content of individual videos without requiring playback. “We’re expanding the YouTube extension to understand some video content so you can have a richer conversation with Bard about it,” Google wrote on Bard’s changelog. In September, Google released a YouTube extension that made it easier to find specific videos. This update allows Bard to operate more interactively, sharing detailed information as it relates to YouTube’s visual content. Continue reading Google’s Bard AI Is Getting Smarter About YouTube Content

Nvidia Sales Surge as Rivals Circle and China Sanctions Loom

Nvidia logged another record quarter, with Q3 revenue of $18.12 billion, up 206 percent from a year ago and a 34 percent increase from Q2 that exceeded both its own and analyst projections. The surge, attributed to increasing demand for the chips that drive artificial intelligence, logged primarily under Nvidia’s data center results a record $14.51 billion, up 279 percent from the prior year and 41 percent from Q2. Profits swelled to $9.2 billion, a stunning 1,259 percent increase from 2022’s $680 million. The results for Nvidia’s Q3 were for the three-month period ending October 31. Continue reading Nvidia Sales Surge as Rivals Circle and China Sanctions Loom

Stability Introduces GenAI Video Model: Stable Video Diffusion

Stability AI has opened research preview on its first foundation model for generative video, Stable Video Diffusion, offering text-to-video and image-to-video. Based on the company’s Stable Diffusion text-to-image model, the new open-source model generates video by animating existing still frames, including “multi-view synthesis.” While the company plans to enhance and extend the model’s capabilities, it currently comes in two versions: SVD, which transforms stills into 576×1024 videos of 14 frames, and SVD-XT that generates up to 24 frames — each at between three and 30 frames per second. Continue reading Stability Introduces GenAI Video Model: Stable Video Diffusion

In Latest Twist, Sam Altman Is Reinstated as CEO of OpenAI

Just four days after being pushed out by the OpenAI board, Sam Altman will be reinstated as CEO of the company he co-founded. His return as chief executive was announced in a series of online posts. In addition, most of the company’s board members will be replaced. Following Altman’s firing, Microsoft CEO Satya Nadella had announced Altman would lead a new advanced AI research team to run independently within Microsoft. However, hundreds of OpenAI employees, including co-founder and board member Ilya Sutskever, continued to push for Altman’s return, threatening to quit if he was not reinstated. Continue reading In Latest Twist, Sam Altman Is Reinstated as CEO of OpenAI

Voice Cloning Startup CreateSafe Introduces GenAI Platform

Music tech studio CreateSafe has officially launched its generative AI-powered platform Triniti in open beta. Triniti lets artists create AI voice clones, generate text-to-audio samples, get assistance monetizing and managing music IP or interact with a chatbot on industry-specific music questions. The company has raised $4.6 million in a seed round led by cryptocurrency and blockchain investment firm Polychain Capital to further develop Triniti. Crush Ventures, hip hop music manager Anthony Saleh, and Paris Hilton’s 11:11 Media also participated in the funding round. Continue reading Voice Cloning Startup CreateSafe Introduces GenAI Platform

GitHub Copilot Brings AI-Powered Coding Tool to Enterprise

GitHub Copilot Chat for enterprise becomes generally available in December, and the GitHub site is integrating the artificial intelligence assistant across its entire platform, promising that AI will infuse every step of the developer lifecycle. “Just as GitHub was founded on Git, today we are re-founded on Copilot,” the Microsoft-owned company announced this month. Powered by OpenAI’s GPT-4, the new configuration will offer inline Copilot Chat for code questions, contextual guidance and “slash commands” for /fix and /test. The AI tool is designed to assist coders with their everyday workflows with a series of “one-click” assists and other shortcuts. Continue reading GitHub Copilot Brings AI-Powered Coding Tool to Enterprise

Sam Altman Joins Microsoft After Abruptly Ousted by OpenAI

Rumors were running rampant over the weekend as an unanticipated executive shuffle played out at OpenAI. It began on Friday when CEO Sam Altman was pushed out by the OpenAI board. President and co-founder Greg Brockman quickly resigned in solidarity, followed by several top researchers. Reports circulated the following day that investors were pressuring the board into reconsidering its decision, but by Sunday evening, OpenAI announced that former Twitch leader Emmett Shear would serve as the new interim chief. Shortly after, Microsoft CEO Satya Nadella said Altman, Brockman and other OpenAI employees would join Microsoft to lead an advanced AI research unit. Continue reading Sam Altman Joins Microsoft After Abruptly Ousted by OpenAI

Meta Touts Its Emu Foundational Model for Video and Editing

Having made the leap from image generation to video generation over the course of a few months in 2022, Meta Platforms introduces Emu, its first visual foundational model, along with Emu Video and Emu Edit, positioned as milestones in the trek to AI moviemaking. Emu uses just two diffusion models to generate 512×512 four-second long videos at 16 frames per second, Meta said, comparing that to 2022’s Make-A-Video, which requires a “cascade” of five models. Internal research found Emu video generations were “strongly preferred” over the Make-A-Video model based on quality (96 percent) and prompt fidelity (85 percent). Continue reading Meta Touts Its Emu Foundational Model for Video and Editing

Unity Opens Beta for Muse AI, Sets General Release for 2024

Unity has officially released its Muse AI platform for general use in early access. Muse is a suite of AI-powered tools that streamline game development. The Muse package includes Muse Chat to source answers and generate code, Muse Sprite for 2D sprites generation, and Muse Texture, providing 2D and 3D ready textures. Originally announced in July, Muse is now offered at a $30 per month subscription. Also announced at the firm’s annual Unite conference was the next major software update, Unity 6, for 2024, and the deployment of Unity Cloud to connect development tools across projects and pipelines. Continue reading Unity Opens Beta for Muse AI, Sets General Release for 2024

YouTube AI Music Generator Mimics Stars with Their Approval

YouTube Music is testing the first in a series of AI-related music experiments. Dream Track for Shorts and the Music AI tools suite were built in collaboration with Google DeepMind’s Lyria music model to allow both original and emulative song creation. Dream Track lets users combine text prompts with the selection of a participating artist to create sound for a YouTube Short of up to 30 seconds featuring an AI simulation of the performer’s voice. Music AI can generate new music from scratch, change audio from one style or instrument to another or add vocal accompaniments. Continue reading YouTube AI Music Generator Mimics Stars with Their Approval