By
Paula ParisiAugust 6, 2024
A new generative AI startup called Black Forest Labs has hit the scene, debuting with a suite of text-to-image models branded FLUX.1. Based in Germany, Black Forest was founded by some of the researchers involved in developing Stable Diffusion and has raised $31 million in funding from principal investor Andreessen Horowitz and angels including CAA founder and former talent agent Michael Ovitz. The FLUX.1 suite focuses on “image detail, prompt adherence, style diversity and scene complexity,” the company says of its three initial variants: FLUX.1 [pro], FLUX.1 [dev] and FLUX.1 [schnell]. Continue reading Black Forest Labs Announces Suite of Text-to-Image Models
By
Paula ParisiJuly 22, 2024
Microsoft has officially moved its AI-powered Designer app out of preview, making the Canva competitor available to iOS and Android users. The app uses text prompts to generate images and designs for items such as logos, greeting cards, stickers and invitations. Powered by OpenAI’s DALL-E 3 image model, Designer is available as an app in Windows and as a free mobile app. New capabilities include the ability to edit existing designs and the addition of “prompt templates” to help users who are starting the design process with a blank canvas. “Just describe what you want to see, and Designer can create it for you,” explains Microsoft. Continue reading Microsoft Designer Adds AI Editing, Launches Mobile Release
By
Paula ParisiMay 16, 2024
Google is launching two new AI models: the video generator Veo and Imagen 3, billed as the company’s “highest quality text-to-image model yet.” The products were introduced at Google I/O this week, where new demo recordings created using the Music AI Sandbox were also showcased. The 1080p Veo videos can be generated in “a wide range of cinematic and visual styles” and run “over a minute” in length, Google says. Veo is available in private preview in VideoFX by joining a waitlist. At a future date, the company plans to bring some Veo capabilities to YouTube Shorts and other products. Continue reading Veo AI Image Generator and Imagen 3 Unveiled at Google I/O
By
ETCentric StaffApril 5, 2024
OpenAI has updated the editor for DALL-E, the artificial intelligence image generator that is part of the ChatGPT premium tiers. The update, based on the DALL-E 3 model, makes it easier for users to adjust their generated images. Shortly after DALL-E 3’s September debut, OpenAI integrated it into ChatGPT, enabling paid subscribers to generate images from text or image prompts. The new DALL-E editor interface lets users edit images “by selecting an area of the image to edit and describing your changes in chat” without using the selection tool. Desired changes can also be prompted “in the conversation panel,” according to OpenAI. Continue reading OpenAI Integrates New Image Editor for DALL-E into ChatGPT
By
ETCentric StaffMarch 28, 2024
Researchers from the Massachusetts Institute of Technology and Adobe have unveiled a new AI acceleration tool that makes generative apps like DALL-E 3 and Stable Diffusion up to 30x faster by reducing the process to a single step. The new approach, called distribution matching distillation, or DMD, maintains or enhances image quality while greatly streamlining the process. Theoretically, the technique “marries the principles of generative adversarial networks (GANs) with those of diffusion models,” consolidating “the hundred steps of iterative refinement required by current diffusion models” into one step, MIT PhD student and project lead Tianwei Yin says. Continue reading New Tech from MIT, Adobe Advances Generative AI Imaging
By
Paula ParisiJanuary 17, 2024
Getty Images and Nvidia are expanding their AI partnership with the addition of the text-to-image platform Generative AI by iStock, designed to produce stock photos that can be used by individuals or enterprise customers. Built on Nvidia Picasso, a foundry for custom AI models, and trained exclusively on data from Getty Images’ proprietary creative libraries, Generative AI by iStock “has been engineered to guard against generations of known products, people, places or other copyrighted elements,” Getty explains, adding that “any licensed visual that a customer generates comes with iStock’s standard $10,000 USD legal coverage.” Continue reading CES: Getty Rolls Out iStock Generative AI Powered by Nvidia
By
Paula ParisiNovember 1, 2023
Creative image platform Shutterstock has added AI-powered editing features that provide “the potential for infinite options to refine and perfect images” in the company’s library of more than 700 million stock selections. A go-to source for brand marketers and digital media companies, Shutterstock is offering six signature AI capabilities as well as secondary features such as a virtual AI design assistant and advanced filters under the umbrella Creative AI. What’s more, Shutterstock says it will compensate its licensed artists when their images are edited with AI. Continue reading Shutterstock Offers AI Image Editor for Massive Stock Library
By
Paula ParisiOctober 25, 2023
OpenAI is developing an AI tool that can identify images created by artificial intelligence — specifically those made in whole or part by its Dall-E 3 image generator. Calling it a “provenance classifier,” company CTO Mira Murati began publicly discussing the detection app last week but said not to expect it in general release anytime soon. This, despite Murati’s claim it is “almost 99 percent reliable.” That is still not good enough for OpenAI, which knows there is much at stake when the public perception of artists’ work can be impacted by a filter applied by AI, which is notoriously capricious. Continue reading OpenAI Developing ‘Provenance Classifier’ for GenAI Images
By
Paula ParisiOctober 11, 2023
OpenAI began previewing vision capabilities for GPT-4 in March, and the company is now starting to roll out the image input and output to users of its popular ChatGPT. The multimodal expansion also includes audio functionality, with OpenAI proclaiming late last month that “ChatGPT can now see, hear and speak.” The upgrade vaults GPT-4 into the multimodal category with what OpenAI is apparently calling GPT-4V (for “Vision,” though equally applicable to “Voice”). “We’re rolling out voice and images in ChatGPT to Plus and Enterprise users,” OpenAI announced. Continue reading ChatGPT Goes Multimodal: OpenAI Adds Vision, Voice Ability
By
Paula ParisiSeptember 25, 2023
During its Surface and AI event in New York City on Thursday, Microsoft introduced a pair of new Surface laptops and an array of generative AI upgrades to Bing Chat, Windows Copilot and more. Taking center stage in hardware was the company’s more powerful Surface Laptop Studio 2 and the ultra-portable Surface Laptop Go 3. Also unveiled was the Surface Go 4 for Business, the latest miniature version of its Surface Pro tablet, and the company’s large touchscreen Surface Hub, designed for office use. Beginning this month, Microsoft rolls out Copilot — “your everyday AI companion” — in a free Windows 11 update, followed by Bing, Edge, and Microsoft 365 this fall. Continue reading Microsoft Unveils Next-Gen Surface Devices, New AI Features
By
Paula ParisiSeptember 22, 2023
OpenAI has released the DALL-E 3 generative AI imaging platform in research preview. The latest iteration features more safety options and integrates with OpenAI’s ChatGPT, currently driven by the now seasoned large language model GPT-4. That is the ChatGPT version to which Plus subscribers and enterprise customers have access — the same who will be able to preview DALL-E 3. The free chatbot is built around GPT-3.5. OpenAI says GPT-4 makes for better contextual understanding by DALL-E, which even in version 2 evidenced some glaring comprehension glitches. Continue reading OpenAI’s Latest Version of DALL-E Integrates with ChatGPT