By
Paula ParisiNovember 14, 2024
Particle, the AI-powered news aggregator created by a pair of Twitter alums, has launched after a year in beta. The iOS app summarizes current events in quick hits the startup says do not violate the copyrights of publishers whose news it shares. Instead of simply scraping publishers’ work for proprietary use, the startup seeks to compensate publishers and drive traffic to news sites with prominent links to sources accompanying each AI news summary. Developed by Sara Beykpour and Marcel Molina, Particle has raised more than $11 million in early funding led by Lightspeed. Continue reading Particle Launches AI News App That Summarizes in Quick Hits
By
Paula ParisiNovember 14, 2024
Ernie, the foundation model for Baidu’s generative AI, has been updated with iRAG technology to mitigate visual hallucinations and a no-code tool called Miaoda that creates apps using natural language. The company behind China’s largest search engine says Ernie now handles 1.5 billion daily user queries, up from 50 million circa its March 2023 launch (a 30x increase). Baidu also debuted Ernie-powered smart glasses from its Xiaodu Technology hardware unit. The Xiaodu AI Glasses features built-in voice activation and cameras for taking photos and video. The news was shared at this week’s Baidu World 2024 in Shanghai. Continue reading Baidu’s Ernie AI Gets Improved Text-to-Image and App Builder
By
Paula ParisiOctober 29, 2024
In its first week of public beta, Anthropic’s “Computer Use” feature is gaining immediate traction, helping people do research and complete coding tasks. Claude works autonomously in Computer Use mode, suggesting broad implications for future productivity and workforce goals. Coming on the heels of OpenAI’s Swarm framework, these early forays into independent AI assistants seem to indicate that implementing such systems will be an area of focus for businesses in 2025. Claude can “see” what’s onscreen and use its “judgment” to adapt to different tasks, segueing across workflows and software. Continue reading Anthropic’s AI Agents for Claude Sonnet Increase Productivity
By
Paula ParisiOctober 3, 2024
OpenAI unveiled major updates at its DevDay conference with the focus largely on making AI more accessible, efficient and affordable. Included were four innovations: Vision Fine-Tuning in the API, Model Distillation, Prompt Caching and the public beta of Realtime API. The approach underscores OpenAI’s effort to empower its developer ecosystem even as it continues to compete for end-users in the enterprise space. The Realtime API gives developers the option of building “nearly real-time” speech-to-speech app experiences, selecting from among six OpenAI voices. Vision Fine-Tuning for GPT-4o enables customization of the model’s visual understanding of images and text. Continue reading OpenAI Showcases Latest Updates for Voice, Picture and More
By
Paula ParisiSeptember 17, 2024
Dolby Labs has introduced cloud-based solutions to support clients with real-time, interactive streaming capabilities. The announcement, made from IBC 2024 in Amsterdam, follows Dolby’s July acquisition of streaming tools provider THEO Technologies, which services top sports, media and entertainment companies worldwide. Dolby and THEO promise streaming that is “more interactive, personalized, and delivered with extremely low latency.” Dolby will also offer a new capability, THEOads, providing an advertising environment “that is optimized for the dynamic nature of live content.” Continue reading Dolby to Expand Its Cloud-Based Live Streaming with THEO
By
Paula ParisiSeptember 16, 2024
OpenAI is previewing a new series of AI models that can reason and correct complex coding mistakes, providing a more efficient solution for developers. Powered by OpenAI o1, the new models are “designed to spend more time thinking before they respond, much like a person would,” and as a result can “solve harder problems than previous models in science, coding, and math,” OpenAI claims, noting that “through training, they learn to refine their thinking process, try different strategies, and recognize their mistakes.” The first model in the series is being released in preview in OpenAI’s popular ChatGPT and in the company’s API. Continue reading OpenAI Previews New LLMs Capable of Complex Reasoning
By
Paula ParisiAugust 29, 2024
In a move toward increased transparency, San Francisco-based AI startup Anthropic has published the system prompts for three of its most recent large language models: Claude 3 Opus, Claude 3.5 Sonnet and Claude 3 Haiku. The information is now available on the web and in the Claude iOS and Android apps. The prompts are instruction sets that reveal what the models can and cannot do. Anthropic says it will regularly update the information, emphasizing that evolving system prompts do not affect the API. Examples of Claude’s prompts include “Claude cannot open URLs, links, or videos” and, when dealing with images, “avoid identifying or naming any humans.” Continue reading Anthropic Publishes Claude Prompts, Sharing How AI ‘Thinks’
By
Paula ParisiAugust 27, 2024
OpenAI announced its newest model, GPT-4o, can now be customized. The company said that the ability to fine-tune the multimodal GPT-4o has been “one of the most requested features from developers.” Customization can move the model toward more specific structure and tone of responses or allow it to follow specific instruction sets geared toward individual use cases. Developers can now implement custom datasets, aiming for better performance at a lower cost. The ChatGPT maker is rolling out the welcome mat by offering 1 million training tokens per day “for free for every organization” through September 23. Continue reading OpenAI Pushes GPT-4o Customization with Free Token Offer
By
Paula ParisiAugust 23, 2024
D-ID, a platform that uses AI to generate digital humans, has announced D-ID Video Translate in general availability. The tool lets businesses and content creators automatically re-voice videos in multiple languages, “cloning the speaker’s voice and adapting their lip movements from a single upload.” D-ID is making the Video Translate tool, which accommodates 30 different languages, free to D-ID subscribers for a limited time, available through the D-ID Studio or the company’s API. Languages include Arabic, Mandarin, Japanese, Hindi and Ukrainian, in addition to Spanish, German, French and Italian. Users can simultaneously translate content using bulk translation. Continue reading D-ID Employs AI to Translate Videos into Multiple Languages
By
Rob ScottAugust 1, 2024
Graphic design company Canva announced it is acquiring fellow Australian startup Leonardo AI with plans to have Leonardo’s 120 employees, including executives, join the Canva AI team. Financial terms of the deal were not disclosed. Sydney-based Leonardo has been gaining attention for its advanced generative AI platform that helps users create images and art based on the open-source Stable Diffusion model developed by Stability AI. The Leonardo team claims its offering is different than other AI art platforms since it provides users with more control. Users can experiment with text prompts and quick sketches as Leonardo.ai creates photorealistic images in real time. Continue reading Canva Aims to Boost Its GenAI Efforts with Leonardo Purchase
By
Paula ParisiJuly 30, 2024
An alternative app store called AltStore PAL recently launched in response to the European Union’s Digital Markets Act (DMA) and is now offering third-party iOS apps. The move comes several months after the company implemented an updated version of its open-source app marketplace in the EU. The DMA was enacted to foster competition, regulating Apple into opening up to rivals. Among AltStore PAL’s new offerings is iTorrent, which lets users download peer-to-peer files, and qBitControl, a remote client for iOS devices. Another app, PeopleDrop, automatically helps users connect to those nearby. Epic Games revealed it plans to offer “Fortnite” on AltStore PAL. Continue reading App Merchant AltStore PAL Bows in EU with a Focus on iOS
By
Paula ParisiJuly 24, 2024
Google has reconsidered its previously announced plan to turn off third-party tracking cookies in its Chrome browser in favor of an option to be controlled by consumers. The original plan was pushed back a few times but was expected to take place early next year. Competitors and regulators have raised concerns about the deprecation that would have left Google — which hauled in more than $237.86 billion in ad revenue last year — free to use its own tracking to serve targeted ads to those using Chrome. Google is now developing a new plan to let consumers make their own informed decisions about whether to allow third-party cookies. Continue reading Google Changes Direction with Plans for Third-Party Cookies
By
Paula ParisiJuly 15, 2024
New York-based speech synthesis software startup ElevenLabs has launched its latest AI development — Voice Isolator and an API to go with it. Voice Isolator is designed to extract background noise, leaving clear dialogue for film, podcast, and interview post-production. The Voice Isolator API lets developers integrate the new product into third-party applications. To use the technology, content is uploaded and processed by the Voice Isolator model, resulting in what the company claims is speech comparable in quality to that obtained in a recording studio. The app is described as “free, with some limitations.” Continue reading ElevenLabs Voice Isolator Audio Post Tool Released with API
Meta Platforms CEO Mark Zuckerberg recently announced that the company will test a feature to create AI characters through the AI Studio on Instagram that can engage with fans and respond to messages. “Rolling out an early test in the U.S. of our AI Studio so you might start seeing AIs from your favorite creators and interest-based AIs in the coming weeks on Instagram,” he wrote. “These will primarily show up in messaging for now, and will be clearly labeled as AI.” Zuckerberg noted the beta test will help the company improve AI characters and will be made “available to more people soon.” Meta launched AI Studio last year to help businesses build custom chatbots. Continue reading Meta Testing AI Chatbots for Instagram Created by Its Users
By
Paula ParisiMay 20, 2024
The Google Home API has been opened to developers that want to use the smart home devices and automations in apps. “Building on the foundation of Matter, we’ve re-envisioned Google Home as a platform for developers — all developers, not just those that build smart home devices,” the company announced at Google I/O. The new APIs provide access to over 600 million devices with a single integration and create the possibility for Google TVs to serve as smart home hubs. Google’s established partners have access to the Home APIs, and the company is now waitlisting other interested developers. Among the first partners are ADT and Eve. Continue reading Google Reimagines Home as Platform for All App Developers