By
Paula ParisiNovember 9, 2023
The entrepreneurs behind the Myspace social network and gaming company Jam City have shifted their focus to generative AI and web3 with a new venture, Plai Labs, a social platform that provides AI tools for collaboration and connectivity. Plai Labs has released a free text-to-video generator, PlaiDay, which will compete with other GenAI video tools from the likes of OpenAI (DALL-E 2), Google (Imagen), Meta Platforms (Make-A-Video) and Stable Diffusion. But PlaiDay hopes to set itself apart by offering the ability to personalize videos with selfie likenesses. Continue reading Social Startup Plai Labs Debuts Free Text-to-Video Generator
By
Paula ParisiSeptember 5, 2023
Seattle-area startup Irreverent Labs has shifted its focus from blockchain-based video games and NFTs to artificial intelligence. Specifically, it wants to build foundation models for text-to-video generation and related content creation tools. Text-to-video is being explored by several companies but is still in development. Samsung Next was intrigued enough with the proposition to invest an undisclosed sum in Irreverent. While there are several apps that output cartoonish results, ambitious efforts are limited. Animations that aim for photorealism, such as Meta’s Make-a-Video and Runway’s Gen-2, can output only four or five seconds of video at a time. Continue reading Samsung Next Invests in Irreverent Labs’ Text-to-Video Tech
By
Paula ParisiJune 23, 2023
Researchers at MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL) have introduced a computer vision system that combines image recognition and image generation technology into one training model instead of two. The result, MAGE (short for MAsked Generative Encoder) holds promise for a wide variety of use cases and is expected to reduce costs through unified training, according to the team. “To the best of our knowledge, this is the first model that achieves close to state-of-the-art results for both tasks using the same data and training paradigm,” the researchers said. Continue reading MAGE AI Unifies Generative and Recognition Image Training
By
Paula ParisiMay 8, 2023
Microsoft’s AI-powered Bing search engine has been drawing in excess of 100 million daily active users and logged half a billion chats. With OpenAI’s GPT-4 and DALL-E 2 models driving the action, it has also created over 200 million images since debuting in limited preview in February. Seeking to build on that momentum, Microsoft is adding new features and integrating Bing more tightly with its Edge browser. The company is also ditching its waitlist in a move to open preview. “We’re underway with the transformation of search,” CVP and consumer CMO Yusuf Mehdi said at a preview event last week. Continue reading Microsoft’s Next Generation of Bing AI Interacts with Images
By
Paula ParisiApril 13, 2023
Amidst calls to put the brakes on large language model development, OpenAI CEO Sam Altman has hit the global circuit to tout the advantages of artificial intelligence and commercial opportunities with his firm. Altman’s 17-city tour includes stops in Washington D.C., Toronto, Tokyo, Rio De Janeiro, Lagos, London, Paris, Madrid, Brussels, Munich, Tel Aviv, Singapore, Dubai, New Delhi, Jakarta, Seoul and Melbourne. On Monday, Altman met with Japanese Prime Minister Fumio Kishida and other government officials, vowing to collaborate on protecting user privacy and data protection. Continue reading OpenAI’s Altman Talks Up Machine Learning on Global Tour
By
Paula ParisiMarch 23, 2023
Microsoft is bringing Bing Image Creator to the new Bing search engine and Edge browser. Powered by an advanced version of the DALL-E model from OpenAI, the new tools will allow users to generate images using word prompts to describe what they want to want to create. The news comes as Microsoft says its new Bing AI Copilot has had “more than 100 million chats to date,” with people using it to refine answers to complex questions or as entertainment or creative inspiration. Bing data indicates images are one of the most searched categories, second only to general web searches, according to Microsoft. Continue reading Microsoft Introduces Visual AI Tools to Bing, Edge Platforms
By
Paula ParisiJanuary 31, 2023
A new artificial intelligence service offering free watermark removal from photographs is causing worry among copyright holders. Photographers took to Twitter to complain about this threat to their livelihoods while the creative community at large pondered the broader implications for AI infringement on intellectual property rights — a central aspect of discussions involving ChatGPT, which was trained using privately held as well as public domain data. Available to download as an app from sites including Product Hunt and the Google Play Store, the WatermarkRemover.io app itself is legal, while some of its potential uses are not. Continue reading Watermark-Erasing AI Worries Photographers, Other Creatives
By
Paula ParisiJanuary 18, 2023
Microsoft plans to add OpenAI’s artificial intelligence app ChatGPT to its Azure OpenAI Service, which is now being made generally available after being offered to select enterprise customers in limited availability since November 2021. ChatGPT’s Azure debut expands on the existing relationship with OpenAI, in which Microsoft in 2019 invested $1 billion, a stake it is considering to expanding by another $10 billion. Microsoft couched the moves as a ”continued commitment to democratizing AI, and ongoing partnership with OpenAI.” Microsoft chief exec Satya Nadella also announced the company plans to eventually include AI tools like ChatGPT into all of its products. Continue reading Microsoft Adding ChatGPT to Wide Release of Azure OpenAI
By
Paula ParisiJanuary 5, 2023
QuickVid is a new AI-driven text-to-video platform aiming for a mass market user base. The tool draws on various generative AI systems to automatically create short-form videos for YouTube, Instagram, TikTok and other platforms. Created by former Meta Platforms programmer Daniel Habib “in a matter of weeks,” QuickVid is quite rudimentary, though Habib says he plans to continue fine tuning and adding features. Unlike Google and Meta have done with their nascent text-to-video systems, QuickVid has bypassed the formalities of research papers and industry previews and jumped directly to a public-facing website. Continue reading QuickVid Uses AI to Create Short Videos from Text Prompts
By
Paula ParisiJanuary 3, 2023
In the wake of overwhelming public response to recent offerings DALL-E 2 and ChatGPT, OpenAI this week introduced Point-E, a text-to-3D model generator that is garnering positive feedback. Faster and less resource intensive than comparable systems, it’s still in the early stages and prone to occasional disjointed results but has advanced the proposition. Using a single Nvidia V100 GPU, Point-E can create a 3D model in under two minutes, generating “point clouds” — data sets representing a 3D shape. Point clouds compute more easily than the wire-fame meshes traditionally used to model 3D objects. Continue reading OpenAI’s Point-E Offers a New Take on Text-to-3D Modeling
By
Paula ParisiOctober 14, 2022
Microsoft announced it is integrating OpenAI’s DALL-E 2 into its new Microsoft Designer app, as well as its Microsoft Edge browser and the Image Creator tool in its Bing search engine. Microsoft provides cloud computing services to OpenAI and has partnered with OpenAI in AI commercialization efforts including the Azure OpenAI Service, now in preview, and GitHub Copilot. The Designer web app can be used to create designs for posters, presentations, invitations and other graphics that can be printed and used for display or shared on social or business media. Continue reading Microsoft Integrates DALL-E 2 into Designer and Creator Apps
By
Paula ParisiOctober 10, 2022
AI image generators like OpenAI’s DALL-E 2 and Google’s Imagen have been generating a lot of attention recently. Now AI text-to-video generators are edging into the spotlight, with Google debuting Imagen Video on the heels of Meta AI’s Make-A-Video rollout last month. Imagen Video has been used to generate videos of up to 25-minutes at a 24 fps, 1280×768 pixel spec. Imagen Video was trained “on a combination of an internal dataset consisting of 14 million video-text pairs and 60 million image-text pairs,” resulting in some unusual functionality, according to Google Research. Continue reading Google and Meta Are Developing AI Text-to-Video Generators
By
Paula ParisiSeptember 21, 2022
OpenAI has begun allowing users of its DALL-E 2 image-generating system to work with facial image uploads. The program previously allowed only computer-generated faces in an effort to prevent deepfakes and misuse, but OpenAI says improvements to its safety system succeeded in “minimizing the potential of harm” from things like explicit, political or violent content. OpenAI will continue to prohibit use of unauthorized photos and will seek to protect right of publicity, though it remains to be seen how effective that will be. In the past, customers have complained the company was overzealous in its policing. Continue reading OpenAI Expands DALL-E 2 Functionality with Facial Uploads
By
Paula ParisiAugust 18, 2022
Stability AI is in the first stage of release of Stable Diffusion, a text-to-image generator similar in functionality to OpenAI’s DALL-E 2, with one important distinction: this open-source newcomer lacks the filters that prevent the earlier system from creating images of public figures or content deemed excessively toxic. Last week the Stable Diffusion code was made available to just over a thousand researchers and the Los Altos-based startup anticipates a public release in the coming weeks. The unfettered unleashing of a powerful imaging system has stirred controversy in the AI community, raising ethical questions. Continue reading Stability AI Releases Stable Diffusion Text-to-Image Generator
By
Paula ParisiAugust 12, 2022
OpenAI’s powerful text-to-image generator DALL-E 2 is still in beta, but businesses are already testing it for commercial use. Apparel firm Stitch Fix has been using it to visualize fabric and color personalization, while Heinz tapped the AI system for a marketing campaign. Cosmopolitan used it to design a magazine cover. Others have leveraged the image engine to generate logos and thumbnails. These early adopters are identifying technical issues that OpenAI says it is addressing as it readies DALL-E 2 for enterprise. Foremost among the complaints is the lack of a dedicated API for public use. Continue reading Businesses Experiment with DALL-E 2, Report Mixed Results