The Browser Company is Building Dia, an AI-First Web Browser

“AI won’t exist as an app, or a button… it’ll be an entirely new environment built on top of a web browser.” That is the pitch from The Browser Company, the New York-based firm behind the Arc browser that is now developing an AI-first web interface called Dia, expected to debut early next year. Dia aims to leverage AI tools to simplify common Internet tasks. The repertoire is now a familiar one, with things like writing assists and inspirational prompts becoming AI givens in a competitive field where Microsoft Copilot and Google Gemini are already established. The Browser Company is trying to distinguish Dia with a simple, user-friendly interface. Continue reading The Browser Company is Building Dia, an AI-First Web Browser

Hume AI Introduces Voice Control and Claude Interoperability

Artificial voice startup Hume AI has had a busy Q4, introducing Voice Control, a no-code artificial speech interface that gives users control over 10 voice dimensions ranging from “assertiveness” to “buoyancy” and “nasality.” The company also debuted an interface that “creates emotionally intelligent voice interactions” with Anthropic’s foundation model Claude that has prompted one observer to ponder the possibility that keyboards will become a thing of the past when it comes to controlling computers. Both advances expand on Hume’s work with its own foundation model, Empathic Voice Interface 2 (EVI 2), which adds emotional timbre to AI voices. Continue reading Hume AI Introduces Voice Control and Claude Interoperability

Qwen with Questions: Alibaba Previews New Reasoning Model

Alibaba Cloud has released the latest entry in its growing Qwen family of large language models. The new Qwen with Questions (QwQ) is an open-source competitor to OpenAI’s o1 reasoning model. As with competing large reasoning models (LRMs), QwQ can correct its own mistakes, relying on extra compute cycles during inference to assess its responses, making it well suited for reasoning tasks like math and coding. Described as an “experimental research model,” this preview version of QwQ has 32-billion-parameters and a 32,000-token context, leading to speculation that a more powerful iteration is in the offing. Continue reading Qwen with Questions: Alibaba Previews New Reasoning Model

AWS Opens Physical Locations for Fast, Secure Data Uploads

Amazon Web Services has opened AWS Data Transfer Terminals in Los Angeles and New York. These secure physical locations allow customers to bring their storage devices for fast uploads to the AWS Cloud. The enterprise service can significantly reduce data ingestion time for use cases including uploads of “large datasets from fleets of vehicles collecting data in metro areas for training machine learning models” as well as “digital audio and video files from content creators for media processing workloads” and local government organizations compiling geographical and other smart city data. Continue reading AWS Opens Physical Locations for Fast, Secure Data Uploads

Bertelsmann and ElevenLabs Team Up to Foster AI Production

German media company Bertelsmann has partnered with AI startup ElevenLabs on an effort to drive tech innovation and workflow across Bertelsmann production, marketing and distribution. Bertelsmann operations span roughly 50 countries with businesses including the publisher Penguin Random House, record label BMG and the RTL Group television unit. The objective is for ElevenLabs tools in voice and audio generation to help Bertelsmann expand productivity and reach. In August, New York-based ElevenLabs opened a European headquarters in London, expanding its international footprint for text-to-speech and other audio apps. Continue reading Bertelsmann and ElevenLabs Team Up to Foster AI Production

Couchbase Capella AI Helps Deploy Agents, Models, Services

Couchbase, the publicly traded data platform for developers, has launched Capella AI Services with the aim of simplifying the process of developing and deploying agentic AI apps for enterprise clients. Capella AI joins the company’s flagship Couchbase Capella cloud data platform. AI offerings include model hosting, automated vectorization, unstructured data preprocessing and AI agent catalog services. Couchbase’s goal is to “allow organizations to prototype, build, test and deploy AI agents” while giving developers control over data across the development lifecycle, including secure data mitigation for large language models running outside the organization. Continue reading Couchbase Capella AI Helps Deploy Agents, Models, Services

Luma AI Upgrades Its Video Generator and Adds Image Model

Anticipating what one outlet calls “the likely imminent release of OpenAI’s Sora,” generative AI video competitors are compelled to step up their game. Luma AI has released a major upgrade to its Dream Machine, speeding its already quick video generation and enabling a chat function for natural language prompts, so you can talk to it as with OpenAI’s ChatGPT. In addition to the new interface, Dream Machine is going mobile and adding a new foundation image model, Luma AI Photon, which “has been purpose built to advance the power and capabilities of Dream Machine,” according to the company. Continue reading Luma AI Upgrades Its Video Generator and Adds Image Model

Lightricks LTX Video Model Impresses with Speed and Motion

Lightricks has released an AI model called LTX Video (LTXV) it says generates five seconds of 768 x 512 resolution video (121 frames) in just four seconds, outputting in less time than it takes to watch. The model can run on consumer-grade hardware and is open source, positioning Lightricks as a mass market challenger to firms like Adobe, OpenAI, Google and their proprietary systems. “It’s time for an open-sourced video model that the global academic and developer community can build on and help shape the future of AI video,” Lightricks co-founder and CEO Zeev Farbman said. Continue reading Lightricks LTX Video Model Impresses with Speed and Motion

Google Offers Spotify Extension for Gemini Mobile Ecosystem

Google has added a Gemini extension that lets users link their Spotify accounts and leverage the AI for music search and discovery. Currently only for Android in English, the app accepts spoken and text prompts to select music by song, album, artist or playlist using “Play & Search.” Only Spotify Premium subscribers will be able to request and play specific tunes on demand. And while users will be able to use Gemini to activate existing playlists or pipe music themed to an activity or mood (like workouts or romantic meals), it cannot create a Spotify playlist or radio. Continue reading Google Offers Spotify Extension for Gemini Mobile Ecosystem

Anthropic Protocol Intends to Standardize AI Data Integration

Anthropic is releasing what it hopes will be a new standard in data integration for AI. Called the Model Context Protocol (MCP), its goal is to eliminate the need to customize each integration by having code written each time a company’s data is connected to a model. The open-source MCP tool could become a universal way to link data sources to AI. The aim is to have models querying databases directly. MCP is “a new standard for connecting AI assistants to the systems where data lives, including content repositories, business tools, and development environments,” according to Anthropic. Continue reading Anthropic Protocol Intends to Standardize AI Data Integration

Nvidia AI Model Fugatto a Breakthrough in Generative Sound

Nvidia has unveiled an AI sound model research project called Fugatto that “can create any combination of music, voices and sounds” based on text and audio inputs. Described by Nvidia as “the world’s most flexible sound machine,” many appear to agree that the new model represents an audio breakthrough, with the potential to generate a wide array of sounds that have not previously existed. While popular sound models from companies including Suno and ElevenLabs “can compose a song or modify a voice, none have the dexterity of the new offering,” Nvidia claims. Continue reading Nvidia AI Model Fugatto a Breakthrough in Generative Sound

Microsoft’s Windows 365 Link a Mini PC for Cloud Streaming

Microsoft is releasing a mini PC designed specifically to access the cloud version of Windows. The $349 Windows 365 Link is a compact, fanless system that will connect local monitors and peripherals to Windows in the cloud. Microsoft plans to bring it to market next year as a companion to the Windows 365 cloud suite, helping companies transition employees to virtual machines. The concept is described as the first move toward a new type of “boot to cloud PCs” that offer little in the way of versatility, but are cheap, easy to operate and secure. Continue reading Microsoft’s Windows 365 Link a Mini PC for Cloud Streaming

Free Streaming Video Platform Plex Redesigns User Interface

Subscription-free streaming platform Plex — which features more than 600 channels, movies and TV shows — has redesigned its user interface to emphasize discovery and personalization. The new look is available in preview on mobile, with a public rollout planned in the weeks to come. “For personal media pros, we’ve centralized media libraries into a dedicated tab,” explains Plex, noting it’s added an option to save favorite libraries and quickly access power-user features. The Watchlist now has a dedicated spot in the navigation bar. Customers are using Plex to “find any title, anytime,” then linking it across services or adding it to the Watchlist. Continue reading Free Streaming Video Platform Plex Redesigns User Interface

GitHub Promotes Open-Source Security with Funding Initiative

The GitHub Secure Open Source Fund will award financing to select applicants in a program designed to fuel security and sustainability for open-source projects. Applications are open now and close on January 7. During that time, 125 projects will be selected for a piece of the $1.25 million investment fund, made possible through the participation of American Express, the Alfred P. Sloan Foundation, Chainguard, HeroDevs, Kraken, Mayfield Fund, Microsoft, Shopify, Stripe and others. In addition to monetary support, recipients will be invited to take part in a three-week educational program. Continue reading GitHub Promotes Open-Source Security with Funding Initiative

Google DeepMind Touts AI-Powered Quantum Error Detection

Google DeepMind has come up with an error correction technique it says will make quantum computers more reliable, particularly at scale. While quantum computing holds tremendous promise — potentially able to solve in just a few hours problems it would take a conventional computer “billions of years” to figure out, Google claims — the systems are notoriously unstable, due to the delicacy of the “quantum state.” AlphaQubit is an AI-based decoder that identifies quantum computing errors with accuracy. Combining DeepMind’s machine learning expertise with Google Quantum AI error correction, the technique advances efforts to create a reliable quantum computer. Continue reading Google DeepMind Touts AI-Powered Quantum Error Detection