By
Paula ParisiNovember 6, 2024
Nvidia’s growing AI arsenal now includes video search and summarization tool AI Blueprint, which helps developers build visual AI agents that analyze video and image content. The agents can answer user questions, generate summaries and even enable alerts for specific scenarios. The new feature is part of Metropolis, Nvidia’s developer toolkit for building computer vision applications using generative AI. Globally, enterprises and public organizations increasingly rely on visual information. Cameras, IoT sensors and autonomous vehicles are ingesting visual data at high rates, and visual agents can help monitor and make sense of that workflow. Continue reading Nvidia’s AI Blueprint Develops Agents to Analyze Visual Data
By
Paula ParisiOctober 1, 2024
The Allen Institute for AI (also known as Ai2, founded by Paul Allen and led by Ali Farhadi) has launched Molmo, a family of four open-source multimodal models. While advanced models “can perceive the world and communicate with us, Molmo goes beyond that to enable one to act in their worlds, unlocking a whole new generation of capabilities, everything from sophisticated web agents to robotics,” according to Ai2. On some third-party benchmark tests, Molmo’s 72 billion parameter model outperforms other open AI offerings and “performs favorably” against proprietary rivals like OpenAI’s GPT-4o, Google’s Gemini 1.5 and Anthropic’s Claude 3.5 Sonnet, Ai2 says. Continue reading Allen Institute Announces Vision-Optimized Molmo AI Models
By
Paula ParisiAugust 9, 2024
Robotics startup Figure AI — with investors including OpenAI, Nvidia and Microsoft — has released its next-gen humanoid, Figure 02. Its predecessor made a splash earlier this year with a demo that captured it conversing with an interlocutor as it organized household items and prepared a snack. Compared to the Figure 01 prototype, with exposed wiring and limited range of motion, Figure 02 is more polished. The latest iteration boasts skeletal improvements for heavier lifting as well as enhanced visual reasoning to assist with machine learning. The result is characterized as “a major leap” in AI-powered robotics, a category in which players include Tesla and 1X Technologies. Continue reading Humanoid Robot Figure 02 Touts Better Strength, Reasoning
By
ETCentric StaffMarch 18, 2024
Robotics firm Figure AI is getting a lot of attention for its humanoid robot, Figure 01, which the company unveiled along with news that it has raised $675 million, for a $2.6 billion valuation, from investors including OpenAI, Nvidia, Microsoft and Amazon founder Jeff Bezos. Pronounced “Figure One,” the general purpose robot looks and moves like a human, and can perform mundane tasks like serving food as well as undesirable jobs like picking up trash. It “sees” using “onboard cameras that feed into a large vision-language model (VLM) trained by OpenAI,” according to Figure co-founder and CEO Brett Adcock. Continue reading Figure Unveils Humanoid Robot, Draws Notable Investments