By
Paula ParisiJanuary 14, 2025
Nvidia Cosmos, a platform of generative world foundation models (WFMs) and related tools to advance the development of physical AI systems like autonomous vehicles and robots, was introduced at CES 2025. Cosmos WFMs are designed to provide developers a way to generate massive amounts of photo-real, physics-based synthetic data to train and evaluate their existing models. The goal is to reduce costs by streamlining real-world testing with a ready data pipeline. Developers can also build custom models by fine-tuning Cosmos WFMs. Cosmos integrates Nvidia Omniverse, a physics simulation tool used for entertainment world-building. Continue reading CES: Nvidia’s Cosmos Models Teach AI About Physical World
By
Yves BergquistJanuary 14, 2025
CES has regularly featured robots over the years, but we’ve never really seen anything pivotal. CES 2025 marked a change in this area. “The ChatGPT moment for robotics is just around the corner,” said Nvidia CEO Jensen Huang in his keynote, and we couldn’t agree more. And while attention was focused on LLMs, the field of industrial robotics has been unleashed like never before. According to World Robotics 2024, the International Federation of Robotics’ recent report, 4.3 million units were deployed in factories worldwide as of Q3 2024, a number that’s increasing at a clip of half a million units per year. This is double from 7 years ago, and the trend is accelerating. Continue reading CES: Is the ChatGPT Moment for Robotics Around the Corner?
By
Paula ParisiJanuary 10, 2025
OpenAI has unveiled a new frontier model, OpenAI o3, which it claims can “reason” through challenges involving math, science and computer programming. Available to safety and research testers, it is expected to be available to individuals and businesses this year. OpenAI o3 is said to be over 20 percent more efficient at common programming tasks than its predecessor OpenAI o1 and beat a company scientist on a programming test. Model o3 is part of a broader effort to create AI systems that can reason through complex problems. In late December Google debuted a similar platform, the experimental Gemini 2.0 Flash Thinking Mode. Continue reading OpenAI Previews Two New Reasoning Models: o3 and o3-Mini
By
Debra KaufmanJanuary 9, 2025
Brands Mastercard and MGM Resorts International, the Ad Council and advertising technology company XR Extreme Reach (XR) gathered for a CES panel discussion on how real-time AI metrics can help increase representation in ads, thus boosting greater marketing ROI and audience trust. It was moderated by The Female Quotient Chief Executive Shelley Zalis, whose company collaborated with XR to unveil, in October, the Representation Index (RX) to measure inclusivity in global advertising. XR’s SVP of Enterprise Solutions Kristin Wnuk was also there to describe her company’s work in the space. Continue reading CES: Utilizing Real-Time AI to Measure Representation in Ads
By
Yves BergquistJanuary 9, 2025
In the never-ending smorgasbord of AI hype, “agents” represent practical and worthwhile potential. AI agents are autonomous AI programs that can understand some context and take action in that context. Agents can autonomously perform a task that involves mapping a goal to its context and parameters (even if they’re not explicitly laid out), process data across multiple formats and ontologies to understand the goal and work through the task, call multiple functions across multiple apps, and take some action to achieve the goal. Unfortunately, however, while many are talking about AI agents, few are promoting actual products at CES. Continue reading CES: Show Features a Surprisingly Small Number of AI Agents
By
Paula ParisiJanuary 9, 2025
BMW has revealed an upcoming release of its iDrive operating system that essentially turns the entire windshield into a 3D heads-up display. The “close-to-production” version of BMW Panoramic Vision showcased at CES 2025 integrates augmented reality to layer navigational directions and driver assistance tips onto the windshield. It also does away with the conventional dashboard “gauge cluster,” projecting digital equivalents onto the windshield that can be customized. The setup is powered by the new BMW Operating System X and will be introduced in all new BMW models from the end of 2025. Continue reading CES: BMW iDrive Turns the Car Windshield into an AR Display
By
Paula ParisiDecember 10, 2024
Meta Platforms has packed more artificial intelligence into a smaller package with Llama 3.3, which the company released last week. The open-source large language model (LLM) “improves core performance at a significantly lower cost, making it even more accessible to the entire open-source community,” Meta VP of Generative AI Ahmad Al-Dahle wrote on X social. The 70 billion parameter text-only Llama 3.3 is said to perform on par with the 405 billion parameter model that was part of Meta’s Llama 3.1 release in July, with less computing power required, significantly lowering its operational costs. Continue reading Meta’s Llama 3.3 Delivers More Processing for Less Compute
By
Paula ParisiDecember 4, 2024
Alibaba Cloud has released the latest entry in its growing Qwen family of large language models. The new Qwen with Questions (QwQ) is an open-source competitor to OpenAI’s o1 reasoning model. As with competing large reasoning models (LRMs), QwQ can correct its own mistakes, relying on extra compute cycles during inference to assess its responses, making it well suited for reasoning tasks like math and coding. Described as an “experimental research model,” this preview version of QwQ has 32-billion-parameters and a 32,000-token context, leading to speculation that a more powerful iteration is in the offing. Continue reading Qwen with Questions: Alibaba Previews New Reasoning Model
By
Paula ParisiDecember 3, 2024
Couchbase, the publicly traded data platform for developers, has launched Capella AI Services with the aim of simplifying the process of developing and deploying agentic AI apps for enterprise clients. Capella AI joins the company’s flagship Couchbase Capella cloud data platform. AI offerings include model hosting, automated vectorization, unstructured data preprocessing and AI agent catalog services. Couchbase’s goal is to “allow organizations to prototype, build, test and deploy AI agents” while giving developers control over data across the development lifecycle, including secure data mitigation for large language models running outside the organization. Continue reading Couchbase Capella AI Helps Deploy Agents, Models, Services
By
Paula ParisiNovember 27, 2024
Nvidia has unveiled an AI sound model research project called Fugatto that “can create any combination of music, voices and sounds” based on text and audio inputs. Described by Nvidia as “the world’s most flexible sound machine,” many appear to agree that the new model represents an audio breakthrough, with the potential to generate a wide array of sounds that have not previously existed. While popular sound models from companies including Suno and ElevenLabs “can compose a song or modify a voice, none have the dexterity of the new offering,” Nvidia claims. Continue reading Nvidia AI Model Fugatto a Breakthrough in Generative Sound
By
Paula ParisiNovember 14, 2024
Ernie, the foundation model for Baidu’s generative AI, has been updated with iRAG technology to mitigate visual hallucinations and a no-code tool called Miaoda that creates apps using natural language. The company behind China’s largest search engine says Ernie now handles 1.5 billion daily user queries, up from 50 million circa its March 2023 launch (a 30x increase). Baidu also debuted Ernie-powered smart glasses from its Xiaodu Technology hardware unit. The Xiaodu AI Glasses features built-in voice activation and cameras for taking photos and video. The news was shared at this week’s Baidu World 2024 in Shanghai. Continue reading Baidu’s Ernie AI Gets Improved Text-to-Image and App Builder
By
Paula ParisiNovember 5, 2024
The Massachusetts Institute of Technology has come up what it thinks is a better way to teach robots general purpose skills. Derived from LLM techniques, the method provides robot intelligence access to an enormous amount of data at once, rather than exposing it to individual programs for specific tasks. Faster and more cost efficient, the approach has been referred to as a “brute force” approach to problem-solving, and machine learners have taken to it in lieu of individualized, task-specific “imitation learning.” Early tests show it outperforming traditional training by more than 20 percent under simulation and real-world conditions. Continue reading MIT Intros LLM-Inspired Teacher for General Purpose Robots
By
Paula ParisiOctober 23, 2024
Microsoft next month moves to public preview with a Copilot Studio feature that lets users create autonomous AI agents. The agents had been in private preview since the spring, and the tech giant’s move to take them public comes after Salesforce launched its own agentic program in September. Microsoft also has plans to add 10 autonomous agents to Dynamics 365, an enterprise suite geared toward resource planning and customer relationship management. Microsoft announced the news this week at its “AI Tour” event in London. Copilot is Microsoft’s branded AI assistant, while Copilot Studio lets people customize their Copilot assistants. Continue reading Microsoft Widens Copilot AI Agent Preview, Adds Templates
By
Paula ParisiOctober 18, 2024
A new LLM framework evaluates how well generative AI models are meeting the challenge of compliance with the legal parameters of the European Union’s AI Act. The free and open-source software is the product of a collaboration between ETH Zurich; Bulgaria’s Institute for Computer Science, Artificial Intelligence and Technology (INSAIT); and Swiss startup LatticeFlow AI. It is being billed as “the first evaluation framework of the EU AI Act for Generative AI models.” Already, it has found that some of the top AI foundation models are falling short of European regulatory goals in areas including cybersecurity resilience and discriminatory output. Continue reading ‘EU AI Act Checker’ Holds Big AI Accountable for Compliance
By
Paula ParisiOctober 18, 2024
Anthropic, maker of the the popular Claude AI chatbot, has updated its Responsible Scaling Policy (RSP), designed and implemented to mitigate the risks of advanced AI systems. The policy was introduced last year and has since been improved, with new protocols added to ensure AI models are developed and deployed safely as they grow more powerful. This latest update offers “a more flexible and nuanced approach to assessing and managing AI risks while maintaining our commitment not to train or deploy models unless we have implemented adequate safeguards,” according to Anthropic. Continue reading Anthropic Updates ‘Responsible Scaling’ to Minimize AI Risks