By
Paula ParisiApril 1, 2025
Chinese smartphone giant Vivo is entering the XR headset market with a device called the Vivo Vision that is drawing comparisons to Apple’s Vision Pro in name and looks. The headset debut coincides with the announcement of the Vivo Robotics Lab, signaling a strategic expansion beyond mobile phones. Vivo EVP and COO Hu Baishan said that AI and robotics currently represent the height of technological achievement in the digital and physical worlds, and that the mobile phone industry, with its massive consumer base and advanced infrastructure is well-positioned to bridge the two worlds, “blending digital connectivity with physical capabilities.” Continue reading Smartphone Maker Vivo Intros Vision XR and Robotics Group
By
Douglas ChanMarch 31, 2025
During Nvidia’s GTC AI Conference in San Jose earlier this month, VP and GM of Media & Entertainment Richard Kerris presented the Nvidia Media2 initiative that builds on the company’s Blackwell GPU foundation to enable real-time AI solutions for all aspects of media production workflows. His talk showcased a broad range of generative AI breakthroughs in real-time ray tracing and VFX, video search and summarization, and musically-based sound effects (SFX). Kerris also shared insights on the media industry’s reception to AI thus far and humbly implored the audience to consider using such technology as an effective new tool for storytelling. Continue reading Nvidia Forges AI Initiative to Streamline Production Workflows
By
Rob ScottMarch 31, 2025
Just prior to the start of the weekend, Elon Musk announced that his artificial intelligence company xAI is acquiring his social media platform X (formerly Twitter) “in an all-stock transaction,” valuing xAI at $80 billion and X at $33 billion ($45 billion less $12 billion in debt). The merger has the potential to create a powerful GenAI-powered content platform. The billionaire purchased Twitter in late 2022 for $44 billion, following months of legal skirmishes. According to Musk, X currently touts more than 600 million active users, while “xAI has rapidly become one of the leading AI labs in the world, building models and data centers at unprecedented speed and scale.” Continue reading Elon Musk Announces xAI Corporation Will Purchase X Social
By
Paula ParisiMarch 31, 2025
Tech firm Infinite Reality — which specializes in AI-powered 3D immersive experiences — has agreed to pay $207 million for Napster, the 26-year-old music streaming service. The sellers are crypto investment firm Hivemind Capital Partners and blockchain firm Algorand, that acquired the platform in 2022. Infinite Reality is privately held, listing among its investors Liberty Media, Live Nation, MGM, T-Mobile and Barry Diller’s IAC. The company plans to steer Napster to superfan experiences, making it “more immersive, more social, and more shoppable.” Napster CEO Jon Vlassopulos, former global music chief at Roblox, will continue in his current post. Continue reading Infinite Reality Agrees to Acquire Napster in $207 Million Deal
By
Paula ParisiMarch 28, 2025
China’s Ant Group is using local semiconductors to train AI at a cost that is 20 percent less than companies typically spend, according to reports. Ant used domestic chips — from companies including Alibaba, an investor in Ant, and Huawei — to launch a unique Mixture of Experts (MoE) training approach that produced results commensurate to training with Nvidia H800 chips. Ant is the latest Chinese company to focus on low cost training, joining a competition triggered by DeepSeek, which in January announced it could build AI comparable to the models released by U.S. companies like OpenAI, Anthropic and Google for billions less. Continue reading Ant Group Stacks Chips to Reduce Development Costs for AI
By
Paula ParisiMarch 28, 2025
Alibaba Cloud has released Qwen2.5-Omni-7B, a new AI model the company claims is efficient enough to run on edge devices like mobile phones and laptops. Boasting a relatively light 7-billion parameter footprint, Qwen2.5-Omni-7B understands text, images, audio and video and generates real-time responses in text and natural speech. Alibaba says its combination of compact size and multimodal capabilities is “unique,” offering “the perfect foundation for developing agile, cost-effective AI agents that deliver tangible value, especially intelligent voice applications.” One example would be using a phone’s camera to help a vision impaired-person navigate their environment. Continue reading Alibaba’s Powerful Multimodal Qwen Model Is Built for Mobile
By
Paula ParisiMarch 27, 2025
OpenAI has activated the multimodal image generation capabilities of GPT-4o, making it available to ChatGPT users on the Plus, Pro, Team and Free tiers. It replaces DALL-E 3 as the default image generator for the popular chatbot. GPT-4o’s accuracy with text, understanding of symbols and precision with prompts combined with well multimodal capabilities that allow the model to take cues from visual material have transformed its image capabilities from largely unpredictable to “consistent and context-aware,” resulting in “a practical tool with precision and power,” claims OpenAI. Continue reading OpenAI Delivers Native GPT-4o Image Generator to ChatGPT
By
Paula ParisiMarch 27, 2025
Google has released what it calls its most intelligent AI model yet, Gemini 2.5. The first 2.5 model release, an experimental version of Gemini 2.5 Pro, is a next-gen reasoning model that Google says outperformed OpenAI o3-mini and Claude 3.7 Sonnet from Anthropic on common benchmarks “by meaningful margins.” Gemini 2.5 models “are thinking models, capable of reasoning through their thoughts before responding, resulting in enhanced performance and improved accuracy,” according to Google. The new model comes just three months after Google released Gemini 2.0 with reasoning and agentic capabilities. Continue reading Google Debuts Next-Gen Reasoning Models with Gemini 2.5
By
Paula ParisiMarch 27, 2025
Microsoft is debuting a suite of security agents for Copilot that will take over repetitive and rote tasks burdening cybersecurity teams. This next evolution of Security Copilot with AI agents is designed to autonomously assist in critical areas such as phishing, data security, and identity management. “The relentless pace and complexity of cyberattacks have surpassed human capacity and establishing AI agents is a necessity for modern security,” notes the company. Microsoft Threat Intelligence is processing 84 trillion signals per day, indicating exponential growth in cyberattacks, including 7,000 password attacks per second, the company says. Continue reading Microsoft Is Combating Security Threats with Copilot Agents
By
Paula ParisiMarch 25, 2025
Google has added a Canvas feature to its Gemini AI chatbot that provides users with a real-time collaborative space where writing and coding projects can be refined and other ideas iterated and shared. “Canvas is designed for seamless collaboration with Gemini,” according to Gemini Product Director Dave Citron, who notes that Canvas makes it “an even more effective collaborator” in helping bring ideas to life. The move marks a trend whereby AI companies are trying to turn chatbot platforms into turnkey productivity suites. Google is launching a limited release of Gemini Live Video in addition to bringing its Audio Overview feature of NotebookLM to Gemini. Continue reading Canvas and Live Video Add Productivity Features to Gemini AI
By
Paula ParisiMarch 25, 2025
Anthropic’s Claude can now search the Internet in real time, allowing it to provide timely and relevant responses that are also more accurate than what the chatbot previously offered, according to the company. Claude incorporates direct citations for its Web-retrieved material, so users can fact-check its sources. “Instead of finding search results yourself, Claude processes and delivers relevant sources in a conversational format.” While this is not exactly groundbreaking — ChatGPT, Grok 3, Copilot, Perplexity and Gemini all have real-time Web retrieval and most include citations — Claude takes a slightly different approach. Continue reading Real-Time Web Access Informs Claude 3.7 Sonnet Responses
By
Paula ParisiMarch 25, 2025
Search firm Perplexity AI has renewed its push to acquire TikTok, outlining its vision for “Rebuilding TikTok in America.” As ByteDance approaches its extended deadline of April 5 to sell TikTok or see it banned here in the U.S., Oracle and its cohort of investors have emerged the frontrunners. While the three-year-old Perplexity is a longshot — with observers saying it does not have the cash on hand to purchase the social powerhouse — with deep-pocketed investors including Nvidia, Databricks and Amazon founder Jeff Bezos, it likely has access to funding should its offer be accepted. Continue reading Perplexity AI Outlines Pitch to Acquire TikTok, Rebuild for U.S.
By
Paula ParisiMarch 24, 2025
Japanese tech investment firm Softbank has agreed to acquire Silicon Valley chip startup Ampere for $6.5 billion, indicating that technology originating in smartphones will eventually become integral to global data centers and the future of artificial intelligence. The eight-year-old Ampere sells chips based on Arm technology, the processor type used in virtually all mobile phones. SoftBank purchased Arm in 2016 and has since been working to ensure the technology becomes used more broadly. Softbank says it will allow Ampere to retain its own name, operating it as a wholly-owned subsidiary. Continue reading Softbank Agrees to Acquire Chipmaker Ampere for $6.5 Billion
By
Paula ParisiMarch 24, 2025
OpenAI has debuted three new models for transcription and voice generation — gpt-4o-transcribe, gpt-4o-mini-transcribe and gpt-4o-mini-tts. The text-to-speech and speech-to-text AI models are designed to help developers create AI agents with highly customizable voices. OpenAI claims these models will power natural and responsive voice agents, moving AI out of the text-based communications stage and into intuitive spoken conversations. The suite outperforms existing solutions in accuracy and reliability, OpenAI says, especially with “accents, noisy environments, and varying speech speeds,” making them well-suited for customer call centers and meeting notes. Continue reading OpenAI Pushes Conversational Agents with Three New Models
By
Paula ParisiMarch 24, 2025
Chat interfaces powered by generative AI are impacting online shopping, according to an Adobe Analytics study that found that AI-influenced visits to U.S. retail website increased by 1,200 percent from July 2024 to February 2025. Adobe says this “significant surge” demonstrates an emerging retail AI economy. GenAI chat interfaces are “becoming a helpful assistant for compiling research before making a purchase,” influencing how consumers behave online, according to Adobe. While paid search and email continue to be the dominant traffic drivers, the past year’s growth in AI-aided shopping signals a shift. Continue reading Adobe Analytics: AI-Powered Online Shopping Surges in U.S.