Adobe Pursues Ethical, Responsible AI in the Creative Space

As a next step in its advances in ethical AI, Adobe has announced its Firefly generative AI platform now supports text prompts in more than 100 international languages. The company says its Firefly AI app has generated over one billion images in Firefly and Photoshop since implementation in March. Adobe has also deployed artificial intelligence in Express, Illustrator and the Creative Cloud. Positioning its latest news as an expansion of global proportions, Adobe’s generative AI products will now support text prompts in native dialects in the standalone Firefly web service, with localization coming to more than 20 additional languages. Continue reading Adobe Pursues Ethical, Responsible AI in the Creative Space

Meta Develops Computer Vision AI That Learns Like Humans

Meta Platforms continues to make progress on a mission to develop artificial intelligence that can teach itself to learn how the world works. Chief AI Scientist Yann LeCun has taken a special interest in developing the new model, called Image Joint Embedding Predictive Architecture, or I-JEPA, which learns by building an internal representation of the outside world and analyzing image abstracts instead of comparing pixels. The approach allows AI techto learn more like humans do, with their ability to figure out complex tasks and adapt to new situations. Continue reading Meta Develops Computer Vision AI That Learns Like Humans

Runway Makes Next Advance in Consumer Text-to-Video AI

Google-backed AI startup Runway has released Gen-2, an early entry among commercially available text-to-video models. Previously waitlisted in limited release, the commercial availability is impactful, since text-to-video is predicted as the next big bump in artificial intelligence, following the explosion of AI use generating text and images. While Runway’s solution may not be ready to serve as a professional video tool, this is the next step in development of tech expected to impact media and entertainment. Filmmaker Joe Russo recently predicted that within the next two years, AI may have the ability to create feature films. Continue reading Runway Makes Next Advance in Consumer Text-to-Video AI

Photo App Reimagine Brings Old Images to Life with AI Tools

Family history platform MyHeritage is releasing a mobile app called Reimagine that enables high-speed scanning of entire album pages to complement the company’s AI tools for restoring — and even facially animating — historical photos. Now users can easily import printed photos stored in albums by snapping page pictures on their iOS or Android device. The app will separate the individual photos, cropping and saving them as standalone images to which metadata can be added for indexing. The app also works with individual photos, or digital uploads from a camera roll. Continue reading Photo App Reimagine Brings Old Images to Life with AI Tools

Twitter Community Notes Aim to Curb Impact of Fake Images

Twitter is emphasizing crowdsourced moderation. The launch of Community Notes for images in posts seeks to address instances where morphed or AI-generated images are posted. The idea is to expose altered content before it goes viral, as did the image of Pope Francis wearing a Balenciaga puffy coat in March and the fake image of an explosion at the Pentagon in May. Twitter says Community Notes about an image will appear with “recent and future” posts containing the graphic in question. Currently in the test phase, the feature works with tweets featuring a single image. Continue reading Twitter Community Notes Aim to Curb Impact of Fake Images

Meta’s Open-Source ImageBind Works Across Six Modalities

Meta Platforms has built and is open-sourcing ImageBind, an artificial intelligence that combines six modalities: audio, visual, text, thermal, movement and depth data. Currently a research project, it suggests a future in which AI models generate multisensory content. “ImageBind equips machines with a holistic understanding that connects objects in a photo with how they will sound, their 3D shape, how warm or cold they are, and how they move,” Meta says. In other words, ImageBind’s approach more closely approximates human thinking by training on the relationship between things rather than ingesting massive datasets so as absorb every possibility. Continue reading Meta’s Open-Source ImageBind Works Across Six Modalities

Microsoft’s Next Generation of Bing AI Interacts with Images

Microsoft’s AI-powered Bing search engine has been drawing in excess of 100 million daily active users and logged half a billion chats. With OpenAI’s GPT-4 and DALL-E 2 models driving the action, it has also created over 200 million images since debuting in limited preview in February. Seeking to build on that momentum, Microsoft is adding new features and integrating Bing more tightly with its Edge browser. The company is also ditching its waitlist in a move to open preview. “We’re underway with the transformation of search,” CVP and consumer CMO Yusuf Mehdi said at a preview event last week. Continue reading Microsoft’s Next Generation of Bing AI Interacts with Images

Pinterest Sets Multiyear Deal with Amazon for Third-Party Ads

Image-sharing social platform Pinterest has named Amazon as its first third-party ad partner. The multiyear strategic partnership will see the e-commerce giant marketing various brands and products on Pinterest and porting interested shoppers back to its site to complete the sale for “a seamless on-Amazon buying experience.” The integration will begin later this year and roll out over several quarters. The news was timed to Pinterest’s Q1 results, which saw revenue up by 5 percent year-over-year to $603 million. The number of global monthly active users also increased, by 7 percent to 463 million, a gain of 13 million. Continue reading Pinterest Sets Multiyear Deal with Amazon for Third-Party Ads

Microsoft’s Bing Chat Powers a New Approach to Advertising

As Microsoft ushers in Kya Sainsbury-Carter to head its $18 billion digital advertising business, Bing Chat is joining her at center stage. The company has plans for generative AI to transform the category, including with paid links in chat results. Since February the company has been testing ads in Bing Chat searches. Microsoft hasn’t disclosed how many people are using the new Bing with AI chat, nor how many ads it has served. Bing Chat’s responses include footnoted links to resources amplifying the information in the chatbot’s conversational answers, but sometimes it links to paid search ads. Continue reading Microsoft’s Bing Chat Powers a New Approach to Advertising

Canva Launches New Branding Features and Magic AI Tools

Canva, the web-based design platform, is debuting “Magic” AI-powered tools that can automate a variety of tasks, from logo design to video editing. The idea is to empower people without design training to do these things, and more. Infographics, advertising materials, illustrations and presentations are among the types of output Canva AI offers. The company is also adding brand management tools to its Visual Worksuite, including a Brand Hub that provides assets for creative application, with permission settings that can restrict off-brand use of things like color or fonts. Continue reading Canva Launches New Branding Features and Magic AI Tools

Microsoft Introduces Visual AI Tools to Bing, Edge Platforms

Microsoft is bringing Bing Image Creator to the new Bing search engine and Edge browser. Powered by an advanced version of the DALL-E model from OpenAI, the new tools will allow users to generate images using word prompts to describe what they want to want to create. The news comes as Microsoft says its new Bing AI Copilot has had “more than 100 million chats to date,” with people using it to refine answers to complex questions or as entertainment or creative inspiration. Bing data indicates images are one of the most searched categories, second only to general web searches, according to Microsoft. Continue reading Microsoft Introduces Visual AI Tools to Bing, Edge Platforms

OpenAI Announces Official Launch of GPT-4 Multimodal Tech

OpenAI has released GPT-4, which it says is a more powerful and reliable version of the artificial intelligence technology powering its viral ChatGPT chatbot. GPT-4 can analyze images and handle larger blocks of text and is generally “more creative and collaborative” than earlier iterations when it comes to things like composing songs, writing screenplays and mimicking a user’s authorial style. “GPT-4 can solve difficult problems with greater accuracy, thanks to its broader general knowledge and problem-solving abilities,” OpenAI says. GPT-4 is already driving the chatbot technology behind Microsoft’s Bing AI search engine, now in beta. Continue reading OpenAI Announces Official Launch of GPT-4 Multimodal Tech

Google’s PaLM API, MakerSuite Coming to Select Developers

Google is readying an API and other enterprise tools for its Pathways Language Model (PaLM) — a large language model similar to GPT — to encourage developers to create chatbots and other apps using the platform. PaLM is one of Google’s most advanced systems, with the capability to generate text, images, code, video and audio from natural language prompts. Much like OpenAI’s GTP series and the LLaMA family from Meta Platforms, it is suitable for a wide variety of general tasks. To facilitate PaLM’s use for specific tasks, Google is launching the MakerSuite along with the PaLM API. Continue reading Google’s PaLM API, MakerSuite Coming to Select Developers

Microsoft Unveils AI Model That Comprehends Image Content

Microsoft researchers have unveiled Kosmos-1, a new AI model the company says analyzes images for content, performs visual text recognition, solves visual puzzles and passes visual IQ tests. It also understands natural language instructions. The new model is what’s known as multimodal AI, which means it uses different instruction sets, from text to audio and video. Mixing media is a key step in building artificial general intelligence (AGI) that can perform tasks in a manner approximating human performance. Examples from a Kosmos-1 research paper show it can effectively analyze images, answering questions about them. Continue reading Microsoft Unveils AI Model That Comprehends Image Content

QuickVid Uses AI to Create Short Videos from Text Prompts

QuickVid is a new AI-driven text-to-video platform aiming for a mass market user base. The tool draws on various generative AI systems to automatically create short-form videos for YouTube, Instagram, TikTok and other platforms. Created by former Meta Platforms programmer Daniel Habib “in a matter of weeks,” QuickVid is quite rudimentary, though Habib says he plans to continue fine tuning and adding features. Unlike Google and Meta have done with their nascent text-to-video systems, QuickVid has bypassed the formalities of research papers and industry previews and jumped directly to a public-facing website. Continue reading QuickVid Uses AI to Create Short Videos from Text Prompts