By
Paula ParisiJune 22, 2023
Vimeo is leveraging artificial intelligence to automate video editing, and says its new AI suite of tools enables the creation of “a fully produced video in minutes by generating scripts from text prompts, recording videos in one take, and editing content as easily as a Word doc,” the company claims. Features include recording using a built-in screen teleprompter and the ability to quickly delete unwanted filler words (“ums” and “uhs”) and long pauses. The video hosting and sharing platform is rolling out the AI tools in July as part of the $20 per month standard subscription. Continue reading Vimeo Says Its AI Makes Video as Easy to Edit as Word Docs
By
Paula ParisiJune 15, 2023
Meta Platforms continues to make progress on a mission to develop artificial intelligence that can teach itself to learn how the world works. Chief AI Scientist Yann LeCun has taken a special interest in developing the new model, called Image Joint Embedding Predictive Architecture, or I-JEPA, which learns by building an internal representation of the outside world and analyzing image abstracts instead of comparing pixels. The approach allows AI techto learn more like humans do, with their ability to figure out complex tasks and adapt to new situations. Continue reading Meta Develops Computer Vision AI That Learns Like Humans
By
Paula ParisiJune 13, 2023
Google-backed AI startup Runway has released Gen-2, an early entry among commercially available text-to-video models. Previously waitlisted in limited release, the commercial availability is impactful, since text-to-video is predicted as the next big bump in artificial intelligence, following the explosion of AI use generating text and images. While Runway’s solution may not be ready to serve as a professional video tool, this is the next step in development of tech expected to impact media and entertainment. Filmmaker Joe Russo recently predicted that within the next two years, AI may have the ability to create feature films. Continue reading Runway Makes Next Advance in Consumer Text-to-Video AI
By
Paula ParisiMay 23, 2023
Details are emerging about the text-based Twitter competitor being developed by Meta Platforms. What is being referred to internally as “Instagram’s new text-based app for conversations” will offer a feed with text posts of up to 500 characters that are capable of attaching links, photos, and videos. The move comes as alternatives including Bluesky, Cohost, Hive, Mastodon and Substack try to gain market share by luring disaffected Twitter users to their platforms. Instagram’s entry in progress — codenamed “P92,” and alternately referred to as “Barcelona” — may soon be interoperable with all of them. Continue reading Meta Testing Decentralized Instagram App as Rival to Twitter
By
Paula ParisiMay 15, 2023
Meta Platforms has built and is open-sourcing ImageBind, an artificial intelligence that combines six modalities: audio, visual, text, thermal, movement and depth data. Currently a research project, it suggests a future in which AI models generate multisensory content. “ImageBind equips machines with a holistic understanding that connects objects in a photo with how they will sound, their 3D shape, how warm or cold they are, and how they move,” Meta says. In other words, ImageBind’s approach more closely approximates human thinking by training on the relationship between things rather than ingesting massive datasets so as absorb every possibility. Continue reading Meta’s Open-Source ImageBind Works Across Six Modalities
By
Paula ParisiMay 8, 2023
Microsoft’s AI-powered Bing search engine has been drawing in excess of 100 million daily active users and logged half a billion chats. With OpenAI’s GPT-4 and DALL-E 2 models driving the action, it has also created over 200 million images since debuting in limited preview in February. Seeking to build on that momentum, Microsoft is adding new features and integrating Bing more tightly with its Edge browser. The company is also ditching its waitlist in a move to open preview. “We’re underway with the transformation of search,” CVP and consumer CMO Yusuf Mehdi said at a preview event last week. Continue reading Microsoft’s Next Generation of Bing AI Interacts with Images
By
Paula ParisiMarch 27, 2023
After several months of testing, Anthropic is making its AI chatbot Claude available for general release in two configurations: the high-performace Claude and a lighter, cheaper, faster option called Claude Instant. Anthropic was launched in 2021 by a pair of former OpenAI employees, and its Claude chatbots are competitors to that firm’s ChatGPT. Accessible through a chat interface and API in Anthropic’s developer console, Claude is being marketed as the product of training designed to produce a more “helpful, honest, and harmless AI systems.” To that end, Anthropic says “Claude is much less likely to produce harmful outputs.” Continue reading Anthropic Takes Claude Chatbot Public After Months of Tests
By
Paula ParisiMarch 16, 2023
OpenAI has released GPT-4, which it says is a more powerful and reliable version of the artificial intelligence technology powering its viral ChatGPT chatbot. GPT-4 can analyze images and handle larger blocks of text and is generally “more creative and collaborative” than earlier iterations when it comes to things like composing songs, writing screenplays and mimicking a user’s authorial style. “GPT-4 can solve difficult problems with greater accuracy, thanks to its broader general knowledge and problem-solving abilities,” OpenAI says. GPT-4 is already driving the chatbot technology behind Microsoft’s Bing AI search engine, now in beta. Continue reading OpenAI Announces Official Launch of GPT-4 Multimodal Tech
By
Paula ParisiMarch 16, 2023
Google is readying an API and other enterprise tools for its Pathways Language Model (PaLM) — a large language model similar to GPT — to encourage developers to create chatbots and other apps using the platform. PaLM is one of Google’s most advanced systems, with the capability to generate text, images, code, video and audio from natural language prompts. Much like OpenAI’s GTP series and the LLaMA family from Meta Platforms, it is suitable for a wide variety of general tasks. To facilitate PaLM’s use for specific tasks, Google is launching the MakerSuite along with the PaLM API. Continue reading Google’s PaLM API, MakerSuite Coming to Select Developers
By
Paula ParisiMarch 15, 2023
Chat app Discord is expanding the use of artificial intelligence on its platform, including the addition of OpenAI technology to its chatbot and moderation features. Discord says it has 150 million users across 19 million interest groups, called “servers,” that dialogue using text, audio and video chat. Discord’s Midjourney text-to-image generation group is its largest community, with in excess of 13 million members. “Harnessed properly, AI can fundamentally enhance and empower genuine human connection,” Discord CEO Jason Citron said at a press event last week, heralding “the most exciting moments in technology emerging.” Continue reading Discord Integrates OpenAI Tech, Updates AI-Driven Features
By
Paula ParisiMarch 9, 2023
Reddit is introducing changes designed to make it easier for users to browse and navigate its communities. Currently testing is a concept that separates text and video into separate streams, dubbed “Read” and “Watch.” Users can toggle between the split-view feeds. In the current format, both “Read” and “Watch” will include recommendations as well as posts that users subscribe to. “In 2023, the product and design improvements you’ll see from us will simplify and streamline how people discover, join, and contribute (post, vote, comment) to communities and bring new ways to engage in conversations and content,” Reddit explains. Continue reading Reddit Tests Split-View Text and Video Feeds, Other Updates
By
Paula ParisiMarch 6, 2023
Microsoft researchers have unveiled Kosmos-1, a new AI model the company says analyzes images for content, performs visual text recognition, solves visual puzzles and passes visual IQ tests. It also understands natural language instructions. The new model is what’s known as multimodal AI, which means it uses different instruction sets, from text to audio and video. Mixing media is a key step in building artificial general intelligence (AGI) that can perform tasks in a manner approximating human performance. Examples from a Kosmos-1 research paper show it can effectively analyze images, answering questions about them. Continue reading Microsoft Unveils AI Model That Comprehends Image Content
By
Paula ParisiFebruary 21, 2023
With language models like ChatGPT dominating recent tech news, Meta Platforms has unveiled a new artificial intelligence platform of its own called Toolformer that breaks new ground in that it can teach itself to use external apps and APIs. The result, Meta says, is that Toolformer combines the conversational aptitude and other things large language models are good at while shoring up those areas in which it typically does not excel — like math and fact-checking — by figuring out how to use external tools like search engines, calculators and calendars. Continue reading Meta Toolformer Sidesteps AI Language Limits with API Calls
By
Paula ParisiFebruary 10, 2023
GlossAi can turn full-length videos — or even whole libraries of video and podcast content —into an array of short clips and posts suitable for dissemination across a wide swathe of outlets. The Israel-based firm has raised $8 million in a seed round as it enters an emerging market in which Adobe and AI startup QuickVid are already playing, but no single app has definitely taken hold. GlossAi has the ability to take a video and automatically generate not only a highlight reel, but also things like 15-second snippets, blog posts (from a transcript), slide decks and more. Continue reading GlossAi Content Propagation App Raises $8M in Seed Round
By
Paula ParisiFebruary 6, 2023
Alphabet is touting artificial intelligence advances as it faces disappointing Q4 earnings, with CEO Sundar Pichai, who is also CEO of Google, telling analysts the company will soon share its own generative AI system with the public, competing head-on with OpenAI’s ChatGPT and DALL-E. “In the coming weeks and months, we’ll make these language models available, starting with LaMDA, so that people can engage directly with them,” Pichai said. Google’s parent company reported a 3.6 percent decline in core ad revenue, at $59 billion in Q4, while overall revenue was up 1 percent to $76 billion. Continue reading Alphabet Reveals Major AI Push, Plans to Take On ChatGPT