Stability AI Archives

Flora Is a New AI Interface Geared Toward Helping Creatives

By Paula Parisi
March 5, 2025

Flora is a new software interface built by AI creatives for creative AI applications. Much like Apple reinvented the personal computer UI to make it feel natural for people who were not IT specialists, Flora aims to reframe the way designers and artists interact with generative AI. “AI tools make it easy to create, but lack creative control,” the startup’s founder Weber Wong says, opining that such tools have proven “great for making AI slop, but not for doing great creative work.” Wong’s goal is to make an AI interface everyone will find comfortable and intuitive, simplifying use and curating “the best text, image, and video models.” Continue reading Flora Is a New AI Interface Geared Toward Helping Creatives

Cameron’s Lightstorm Vision Partners with Meta to Advance MR

By Paula Parisi
December 9, 2024

James Cameron’s Lightstorm Entertainment has debuted a new unit, Lightstorm Vision, which launches with a deal that sees Meta Quest become its exclusive mixed reality platform. The multiyear deal will see the Hollywood heavyweight partnering with Meta Platforms “to scale the creation of world-class 3D entertainment experiences spanning live sports and concerts, feature films, and TV series featuring big-name IP on Meta Quest,” Meta announced. In addition to producing original content for the Meta Quest, Lightstorm Vision will use its tech to help third parties create new programming and potentially conform existing work through MR conversion. Continue reading Cameron’s Lightstorm Vision Partners with Meta to Advance MR

Amazon Dives into Generative AI with Nova Foundation Models

By Paula Parisi
December 5, 2024

After years of focusing on AI infrastructure, Amazon is plunging into the frontier model business with the Nova series. The new family of generative AI models includes the text-to-text model Amazon Nova Micro and Amazon Nova Lite for fast, mobile-friendly apps, and at the upper echelon the multimodal Amazon Nova Pro and Amazon Nova Premier for processing text, images and video. Amazon, which is heavy into production via Amazon Studios and MGM, is also launched two specialty models focused on “studio quality” output — Amazon Nova Canvas for images and Amazon Nova Reel for video. Continue reading Amazon Dives into Generative AI with Nova Foundation Models

Story Raises $80M to Create Blockchain-Based IP Protection

By Paula Parisi
August 23, 2024

Palo Alto-based startup PIP Labs announced an $80 million funding round for Story Protocol, a blockchain platform to track intellectual property rights in the era of artificial intelligence and the data scraping that enables model training. CEO and co-founder Seung Yoon “SY” Lee says the company aims to create a more sustainable IP environment for digital consumers and builders. The raise, led by Andreessen Horowitz (a16z) and Polychain Capital, values the startup at $2.25 billion. The move comes after Sahara AI announced it raised $43 million this month to fund a blockchain-based IP tracking system. Continue reading Story Raises $80M to Create Blockchain-Based IP Protection

Canva Aims to Boost Its GenAI Efforts with Leonardo Purchase

By Rob Scott
August 1, 2024

Graphic design company Canva announced it is acquiring fellow Australian startup Leonardo AI with plans to have Leonardo’s 120 employees, including executives, join the Canva AI team. Financial terms of the deal were not disclosed. Sydney-based Leonardo has been gaining attention for its advanced generative AI platform that helps users create images and art based on the open-source Stable Diffusion model developed by Stability AI. The Leonardo team claims its offering is different than other AI art platforms since it provides users with more control. Users can experiment with text prompts and quick sketches as Leonardo.ai creates photorealistic images in real time. Continue reading Canva Aims to Boost Its GenAI Efforts with Leonardo Purchase

Stable Video 4D Adds Time Dimension to Generative Imagery

By Paula Parisi
July 29, 2024

Stability AI has unveiled an experimental new model, Stable Video 4D, which generates photorealistic 3D video. Building on what it created with Stable Video Diffusion, released in November, this latest model can take moving image data of an object and iterate it from multiple angles — generating up to eight different perspectives. Stable Video 4D can generate five frames across eight views in about 40 seconds using a single inference, according to the company, which says the model has “future applications in game development, video editing, and virtual reality.” Users begin by uploading a single video and specifying desired 3D camera poses. Continue reading Stable Video 4D Adds Time Dimension to Generative Imagery

Runway Making Gen-3 Alpha AI Video Model Available to All

By Rob Scott
July 8, 2024

New York-based AI startup Runway has made its latest frontier model — which creates realistic AI videos from text, image or video prompts — generally available to users willing to upgrade to a paid plan starting at $12 per month for each editor. Introduced several weeks go, Gen-3 Alpha reportedly offers significant improvements over Gen-1 and Gen-2 in areas such as speed, motion, fidelity and consistency. Runway explains it worked with a “team of research scientists, engineers and artists” to develop the upgrades but did not specify where it collected its training data. As the AI video field ramps up, current rivals include Stability AI, OpenAI, Pika and Luma Labs. Continue reading Runway Making Gen-3 Alpha AI Video Model Available to All

DeepMind’s V2A Generates Music, Sound Effects, Dialogue

By Paula Parisi
June 19, 2024

Google DeepMind has unveiled new research on AI tech it calls V2A (“video-to-audio”) that can generate soundtracks for videos. The initiative complements the wave of AI video generators from companies ranging from biggies like OpenAI and Alibaba to startups such as Luma and Runway, all of which require a separate app to add sound. V2A technology “makes synchronized audiovisual generation possible” by combining video pixels with natural language text prompts “to generate rich soundscapes for the on-screen action,” DeepMind writes, explaining that it can “create shots with a dramatic score, realistic sound effects or dialogue.” Continue reading DeepMind’s V2A Generates Music, Sound Effects, Dialogue

Stability AI Releases Free Sound FX Tool, Stable Audio Open

By Paula Parisi
June 7, 2024

Stability AI has added another audio product to its lineup, releasing the open-source text-to-audio generator Stable Audio Open 1.0 for sound design. The new model can generate up to 47 seconds of samples and sound effects, including drum beats, instrument riffs, ambient sounds, foley and production elements. It also allows for adapting variations and changing the style of audio samples. Stability AI — best known for the image generator Stable Diffusion — in September released Stable Audio, a commercial product that can generate sophisticated music tracks of up to three minutes. Continue reading Stability AI Releases Free Sound FX Tool, Stable Audio Open

Microsoft Upgrades Azure AI Stack, Previews Cobalt 100 VMs

By Paula Parisi
May 23, 2024

Microsoft is implementing a series of upgrades to its AI stack, including Azure AI Studio and Copilot Studio, developer platforms for building apps that leverage generative AI. At its Build 2024 conference, the company also announced it is going into preview with its Azure virtual machines (VMs), the first platform to use the powerful Arm-based Cobalt 100 cloud processors developed by Microsoft, which shared that plan at its November Ignite conference. Other Build highlights include the general availability of OpenAI’s new GPT-4o flagship model across the Azure OpenAI Service and the new multimodal Phi-3-vision SLM. Continue reading Microsoft Upgrades Azure AI Stack, Previews Cobalt 100 VMs

Google Imagen 2 Now Generates 4-Second Clips on Vertex AI

By ETCentric Staff
April 12, 2024

During Google Cloud Next 2024 in Las Vegas, Google announced an updated version of its text-to-image generator Imagen 2 on Vertex AI that has the ability to generate video clips of up to four seconds. Google calls this feature “text-to-live images,” and it essentially delivers animated GIFs at 24 fps and 360×640 pixel resolution, though Google says there will be “continuous enhancements.” Imagen 2 can also generate text, emblems and logos in different languages, and has the ability to overlay those elements on existing images like business cards, apparel and products. Continue reading Google Imagen 2 Now Generates 4-Second Clips on Vertex AI

New Tech from MIT, Adobe Advances Generative AI Imaging

By ETCentric Staff
March 28, 2024

Researchers from the Massachusetts Institute of Technology and Adobe have unveiled a new AI acceleration tool that makes generative apps like DALL-E 3 and Stable Diffusion up to 30x faster by reducing the process to a single step. The new approach, called distribution matching distillation, or DMD, maintains or enhances image quality while greatly streamlining the process. Theoretically, the technique “marries the principles of generative adversarial networks (GANs) with those of diffusion models,” consolidating “the hundred steps of iterative refinement required by current diffusion models” into one step, MIT PhD student and project lead Tianwei Yin says. Continue reading New Tech from MIT, Adobe Advances Generative AI Imaging

OpenAI Releases Early Demos of Sora Video Generation Tool

By ETCentric Staff
March 27, 2024

OpenAI’s Sora text- and image-to-video tool isn’t publicly available yet, but the company is showing what it’s capable of by putting it in the hands of seven artists. The results — from a short film about a balloon man to a hybrid flamingo giraffe — are stirring excitement and priming the pump for what OpenAI CTO Mira Murati says will be a 2024 general release. Challenges include making it cheaper to run and enhancing guardrails. Since introducing Sora last month, OpenAI says it’s “been working with visual artists, designers, creative directors and filmmakers to learn how Sora might aid in their creative process.” Continue reading OpenAI Releases Early Demos of Sora Video Generation Tool

Stable Video 3D Generates Orbital Animation from One Image

By ETCentric Staff
March 25, 2024

Stability AI has released Stable Video 3D, a generative video model based on the company’s foundation model Stable Video Diffusion. SV3D, as it’s called, comes in two versions. Both can generate and animate multi-view 3D meshes from a single image. The more advanced version also let users set “specified camera paths” for a “filmed” look to the video generation. “By adapting our Stable Video Diffusion image-to-video diffusion model with the addition of camera path conditioning, Stable Video 3D is able to generate multi-view videos of an object,” the company explains. Continue reading Stable Video 3D Generates Orbital Animation from One Image

Lightricks LTX Studio Is a Text-to-Video Filmmaking Platform

By ETCentric Staff
March 1, 2024

Lightricks, the company behind apps including Facetune, Photoleap and Videoleap, has come up with a text-to-video tool called LTX Studio that it is being positioned as a turnkey AI tool for filmmakers and other creators. “From concept to creation,” the new app aims to enable “the transformation of a single idea into a cohesive, AI-generated video.” Currently waitlisted, Lightricks says it will make the web-based tool available to the public for free, at least initially, beginning in April, allowing users to “direct each scene down to specific camera angles with specialized AI.” Continue reading Lightricks LTX Studio Is a Text-to-Video Filmmaking Platform