Stability AI Develops ‘Stable Audio’ Generative Text-to-Music

Stability AI is launching Stable Audio, a music generation AI tool that uses latent diffusion to deliver what the company says is high-quality 44.1 kHz music for commercial use. Stable Audio uses a web-based interface to generate music from text prompts and duration. Because its latent diffusion model architecture has been conditioned on text metadata as well as audio file duration and start time, it defeats a problem common to diffusion for generative audio — producing cohesive musical segments as opposed to arbitrary sections of a song that start or end in the middle of a phrase. Continue reading Stability AI Develops ‘Stable Audio’ Generative Text-to-Music

Meta’s AudioCraft Turns Words into Music with Generative AI

Meta Platforms is releasing AudioCraft, a generative AI framework that creates “high-quality,” “realistic” audio and music from text prompts. AudioCraft consists of three models: MusicGen, AudioGen and EnCodec, all of which Meta announced it is open-sourcing. Released in June, MusicGen was trained on Meta-owned and licensed music, and generates music from text prompts, while AudioGen, which was trained on public domain samples, generates sound effects (like honking horns and barking dogs) from text prompts. The EnCodec decoder allows “higher quality music generation with fewer artifacts,” according to Meta. Continue reading Meta’s AudioCraft Turns Words into Music with Generative AI

Meta’s MusicGen AI Works with Language and Song Prompts

Meta Platforms has debuted what’s being called “ChatGPT for audio.” MusicGen is an AI music generator that can create tunes from natural language or song snippets. The company says MusicGen was trained on 20,000 hours of music, including 10,000 hours of “high-quality” licensed songs and 390,000 instrumental tracks. Meta released MusicGen on GitHub this past weekend, and is currently demoing the app on Facebook’s Hugging Face page. Visitors can generate tunes by describing the sound they want. Among Meta’s prompts: “80s driving pop song with heavy drums and synth pads in the background.” Continue reading Meta’s MusicGen AI Works with Language and Song Prompts