DALL-E 2 Archives - ETCentric

Social Startup Plai Labs Debuts Free Text-to-Video Generator

By Paula Parisi
November 9, 2023

The entrepreneurs behind the Myspace social network and gaming company Jam City have shifted their focus to generative AI and web3 with a new venture, Plai Labs, a social platform that provides AI tools for collaboration and connectivity. Plai Labs has released a free text-to-video generator, PlaiDay, which will compete with other GenAI video tools from the likes of OpenAI (DALL-E 2), Google (Imagen), Meta Platforms (Make-A-Video) and Stable Diffusion. But PlaiDay hopes to set itself apart by offering the ability to personalize videos with selfie likenesses. Continue reading Social Startup Plai Labs Debuts Free Text-to-Video Generator

Samsung Next Invests in Irreverent Labs’ Text-to-Video Tech

By Paula Parisi
September 5, 2023

Seattle-area startup Irreverent Labs has shifted its focus from blockchain-based video games and NFTs to artificial intelligence. Specifically, it wants to build foundation models for text-to-video generation and related content creation tools. Text-to-video is being explored by several companies but is still in development. Samsung Next was intrigued enough with the proposition to invest an undisclosed sum in Irreverent. While there are several apps that output cartoonish results, ambitious efforts are limited. Animations that aim for photorealism, such as Meta’s Make-a-Video and Runway’s Gen-2, can output only four or five seconds of video at a time. Continue reading Samsung Next Invests in Irreverent Labs’ Text-to-Video Tech

MAGE AI Unifies Generative and Recognition Image Training

By Paula Parisi
June 23, 2023

Researchers at MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) have introduced a computer vision system that combines image recognition and image generation technology into one training model instead of two. The result, MAGE (short for MAsked Generative Encoder) holds promise for a wide variety of use cases and is expected to reduce costs through unified training, according to the team. “To the best of our knowledge, this is the first model that achieves close to state-of-the-art results for both tasks using the same data and training paradigm,” the researchers said. Continue reading MAGE AI Unifies Generative and Recognition Image Training

Microsoft’s Next Generation of Bing AI Interacts with Images

By Paula Parisi
May 8, 2023

Microsoft’s AI-powered Bing search engine has been drawing in excess of 100 million daily active users and logged half a billion chats. With OpenAI’s GPT-4 and DALL-E 2 models driving the action, it has also created over 200 million images since debuting in limited preview in February. Seeking to build on that momentum, Microsoft is adding new features and integrating Bing more tightly with its Edge browser. The company is also ditching its waitlist in a move to open preview. “We’re underway with the transformation of search,” CVP and consumer CMO Yusuf Mehdi said at a preview event last week. Continue reading Microsoft’s Next Generation of Bing AI Interacts with Images

OpenAI’s Altman Talks Up Machine Learning on Global Tour

By Paula Parisi
April 13, 2023

Amidst calls to put the brakes on large language model development, OpenAI CEO Sam Altman has hit the global circuit to tout the advantages of artificial intelligence and commercial opportunities with his firm. Altman’s 17-city tour includes stops in Washington D.C., Toronto, Tokyo, Rio De Janeiro, Lagos, London, Paris, Madrid, Brussels, Munich, Tel Aviv, Singapore, Dubai, New Delhi, Jakarta, Seoul and Melbourne. On Monday, Altman met with Japanese Prime Minister Fumio Kishida and other government officials, vowing to collaborate on protecting user privacy and data protection. Continue reading OpenAI’s Altman Talks Up Machine Learning on Global Tour

Microsoft Introduces Visual AI Tools to Bing, Edge Platforms

By Paula Parisi
March 23, 2023

Microsoft is bringing Bing Image Creator to the new Bing search engine and Edge browser. Powered by an advanced version of the DALL-E model from OpenAI, the new tools will allow users to generate images using word prompts to describe what they want to want to create. The news comes as Microsoft says its new Bing AI Copilot has had “more than 100 million chats to date,” with people using it to refine answers to complex questions or as entertainment or creative inspiration. Bing data indicates images are one of the most searched categories, second only to general web searches, according to Microsoft. Continue reading Microsoft Introduces Visual AI Tools to Bing, Edge Platforms

Watermark-Erasing AI Worries Photographers, Other Creatives

By Paula Parisi
January 31, 2023

A new artificial intelligence service offering free watermark removal from photographs is causing worry among copyright holders. Photographers took to Twitter to complain about this threat to their livelihoods while the creative community at large pondered the broader implications for AI infringement on intellectual property rights — a central aspect of discussions involving ChatGPT, which was trained using privately held as well as public domain data. Available to download as an app from sites including Product Hunt and the Google Play Store, the WatermarkRemover.io app itself is legal, while some of its potential uses are not. Continue reading Watermark-Erasing AI Worries Photographers, Other Creatives

Microsoft Adding ChatGPT to Wide Release of Azure OpenAI

By Paula Parisi
January 18, 2023

Microsoft plans to add OpenAI’s artificial intelligence app ChatGPT to its Azure OpenAI Service, which is now being made generally available after being offered to select enterprise customers in limited availability since November 2021. ChatGPT’s Azure debut expands on the existing relationship with OpenAI, in which Microsoft in 2019 invested $1 billion, a stake it is considering to expanding by another $10 billion. Microsoft couched the moves as a ”continued commitment to democratizing AI, and ongoing partnership with OpenAI.” Microsoft chief exec Satya Nadella also announced the company plans to eventually include AI tools like ChatGPT into all of its products. Continue reading Microsoft Adding ChatGPT to Wide Release of Azure OpenAI

QuickVid Uses AI to Create Short Videos from Text Prompts

By Paula Parisi
January 5, 2023

QuickVid is a new AI-driven text-to-video platform aiming for a mass market user base. The tool draws on various generative AI systems to automatically create short-form videos for YouTube, Instagram, TikTok and other platforms. Created by former Meta Platforms programmer Daniel Habib “in a matter of weeks,” QuickVid is quite rudimentary, though Habib says he plans to continue fine tuning and adding features. Unlike Google and Meta have done with their nascent text-to-video systems, QuickVid has bypassed the formalities of research papers and industry previews and jumped directly to a public-facing website. Continue reading QuickVid Uses AI to Create Short Videos from Text Prompts

OpenAI’s Point-E Offers a New Take on Text-to-3D Modeling

By Paula Parisi
January 3, 2023

In the wake of overwhelming public response to recent offerings DALL-E 2 and ChatGPT, OpenAI this week introduced Point-E, a text-to-3D model generator that is garnering positive feedback. Faster and less resource intensive than comparable systems, it’s still in the early stages and prone to occasional disjointed results but has advanced the proposition. Using a single Nvidia V100 GPU, Point-E can create a 3D model in under two minutes, generating “point clouds” — data sets representing a 3D shape. Point clouds compute more easily than the wire-fame meshes traditionally used to model 3D objects. Continue reading OpenAI’s Point-E Offers a New Take on Text-to-3D Modeling

Microsoft Integrates DALL-E 2 into Designer and Creator Apps

By Paula Parisi
October 14, 2022

Microsoft announced it is integrating OpenAI’s DALL-E 2 into its new Microsoft Designer app, as well as its Microsoft Edge browser and the Image Creator tool in its Bing search engine. Microsoft provides cloud computing services to OpenAI and has partnered with OpenAI in AI commercialization efforts including the Azure OpenAI Service, now in preview, and GitHub Copilot. The Designer web app can be used to create designs for posters, presentations, invitations and other graphics that can be printed and used for display or shared on social or business media. Continue reading Microsoft Integrates DALL-E 2 into Designer and Creator Apps

Google and Meta Are Developing AI Text-to-Video Generators

By Paula Parisi
October 10, 2022

AI image generators like OpenAI’s DALL-E 2 and Google’s Imagen have been generating a lot of attention recently. Now AI text-to-video generators are edging into the spotlight, with Google debuting Imagen Video on the heels of Meta AI’s Make-A-Video rollout last month. Imagen Video has been used to generate videos of up to 25-minutes at a 24 fps, 1280×768 pixel spec. Imagen Video was trained “on a combination of an internal dataset consisting of 14 million video-text pairs and 60 million image-text pairs,” resulting in some unusual functionality, according to Google Research. Continue reading Google and Meta Are Developing AI Text-to-Video Generators

OpenAI Expands DALL-E 2 Functionality with Facial Uploads

By Paula Parisi
September 21, 2022

OpenAI has begun allowing users of its DALL-E 2 image-generating system to work with facial image uploads. The program previously allowed only computer-generated faces in an effort to prevent deepfakes and misuse, but OpenAI says improvements to its safety system succeeded in “minimizing the potential of harm” from things like explicit, political or violent content. OpenAI will continue to prohibit use of unauthorized photos and will seek to protect right of publicity, though it remains to be seen how effective that will be. In the past, customers have complained the company was overzealous in its policing. Continue reading OpenAI Expands DALL-E 2 Functionality with Facial Uploads

Stability AI Releases Stable Diffusion Text-to-Image Generator

By Paula Parisi
August 18, 2022

Stability AI is in the first stage of release of Stable Diffusion, a text-to-image generator similar in functionality to OpenAI’s DALL-E 2, with one important distinction: this open-source newcomer lacks the filters that prevent the earlier system from creating images of public figures or content deemed excessively toxic. Last week the Stable Diffusion code was made available to just over a thousand researchers and the Los Altos-based startup anticipates a public release in the coming weeks. The unfettered unleashing of a powerful imaging system has stirred controversy in the AI community, raising ethical questions. Continue reading Stability AI Releases Stable Diffusion Text-to-Image Generator

Businesses Experiment with DALL-E 2, Report Mixed Results

By Paula Parisi
August 12, 2022

OpenAI’s powerful text-to-image generator DALL-E 2 is still in beta, but businesses are already testing it for commercial use. Apparel firm Stitch Fix has been using it to visualize fabric and color personalization, while Heinz tapped the AI system for a marketing campaign. Cosmopolitan used it to design a magazine cover. Others have leveraged the image engine to generate logos and thumbnails. These early adopters are identifying technical issues that OpenAI says it is addressing as it readies DALL-E 2 for enterprise. Foremost among the complaints is the lack of a dedicated API for public use. Continue reading Businesses Experiment with DALL-E 2, Report Mixed Results