OpenAI Previews Two New Reasoning Models: o3 and o3-Mini

OpenAI has unveiled a new frontier model, OpenAI o3, which it claims can “reason” through challenges involving math, science and computer programming. Available to safety and research testers, it is expected to be available to individuals and businesses this year. OpenAI o3 is said to be over 20 percent more efficient at common programming tasks than its predecessor OpenAI o1 and beat a company scientist on a programming test. Model o3 is part of a broader effort to create AI systems that can reason through complex problems. In late December Google debuted a similar platform, the experimental Gemini 2.0 Flash Thinking Mode. Continue reading OpenAI Previews Two New Reasoning Models: o3 and o3-Mini

Hume AI Introduces Voice Control and Claude Interoperability

Artificial voice startup Hume AI has had a busy Q4, introducing Voice Control, a no-code artificial speech interface that gives users control over 10 voice dimensions ranging from “assertiveness” to “buoyancy” and “nasality.” The company also debuted an interface that “creates emotionally intelligent voice interactions” with Anthropic’s foundation model Claude that has prompted one observer to ponder the possibility that keyboards will become a thing of the past when it comes to controlling computers. Both advances expand on Hume’s work with its own foundation model, Empathic Voice Interface 2 (EVI 2), which adds emotional timbre to AI voices. Continue reading Hume AI Introduces Voice Control and Claude Interoperability

Qwen with Questions: Alibaba Previews New Reasoning Model

Alibaba Cloud has released the latest entry in its growing Qwen family of large language models. The new Qwen with Questions (QwQ) is an open-source competitor to OpenAI’s o1 reasoning model. As with competing large reasoning models (LRMs), QwQ can correct its own mistakes, relying on extra compute cycles during inference to assess its responses, making it well suited for reasoning tasks like math and coding. Described as an “experimental research model,” this preview version of QwQ has 32-billion-parameters and a 32,000-token context, leading to speculation that a more powerful iteration is in the offing. Continue reading Qwen with Questions: Alibaba Previews New Reasoning Model

Baidu’s Ernie AI Gets Improved Text-to-Image and App Builder

Ernie, the foundation model for Baidu’s generative AI, has been updated with iRAG technology to mitigate visual hallucinations and a no-code tool called Miaoda that creates apps using natural language. The company behind China’s largest search engine says Ernie now handles 1.5 billion daily user queries, up from 50 million circa its March 2023 launch (a 30x increase). Baidu also debuted Ernie-powered smart glasses from its Xiaodu Technology hardware unit. The Xiaodu AI Glasses features built-in voice activation and cameras for taking photos and video. The news was shared at this week’s Baidu World 2024 in Shanghai. Continue reading Baidu’s Ernie AI Gets Improved Text-to-Image and App Builder

Databricks Previews Toolkit for Internal Data, AI App Creation

Databricks Apps is a new platform designed to make building internal data and AI applications something that can be done in a few clicks. Available now in public preview on AWS and Azure, the template-based system lets users weave data and frameworks of choice into full-featured apps that can run in the Databricks environment. The company says the system can code and deploy a secure data app with AI integration in five minutes. “Ideal use cases include data visualization, AI applications, self-service analytics and data quality monitoring,” according to the San Francisco-based company. Continue reading Databricks Previews Toolkit for Internal Data, AI App Creation

Alibaba Cloud Ups Its AI Game with 100 Open-Source Models

Alibaba Cloud last week globally released more than 100 new open-source variants of its large language foundation model, Qwen 2.5, to the global open-source community. The company has also revamped its proprietary offering as a full-stack AI-computing infrastructure across cloud products, networking and data center architecture, all aimed at supporting the growing demands of AI computing. Alibaba Cloud’s significant contribution was revealed at the Apsara Conference, the annual flagship event held by the cloud division of China’s e-retail giant, often referred to as the Chinese Amazon. Continue reading Alibaba Cloud Ups Its AI Game with 100 Open-Source Models

New AI Coding App Cursor Gains Following and $60M in Funds

An AI-powered coding app called Cursor is building a fanbase, with everyone from hobbyists to engineers subscribing to the service. The platform reportedly has 30,000 paying customers, among them employees at OpenAI, Midjourney and Perplexity. Referred to as “the ChatGPT of coding,” Cursor uses popular models including GPT-4o and Claude 3.5 Sonnet to automate building apps and other coding tasks. Cursor was launched by two-year-old startup Anysphere, which has raised more than $60 million in Series A funding led by Andreessen Horowitz and Thrive Capital. Continue reading New AI Coding App Cursor Gains Following and $60M in Funds

Airtable Enters No-Code Enterprise App Space with Cobuilder

Airtable, a 10-year-old firm focused on customized apps, is launching Cobuilder, which uses AI to turn a concept into a customizable application “in seconds,” without the need for human coding. The debut adds to a rapidly expanding field of no-code platforms that help non-technical types develop software suitable for enterprise use. “Within the next five years, teams will build the vast majority of applications in-house, customizing them to transform their most critical workflows,” predicts Airtable co-founder and CEO Howie Liu. “To get there, knowledge workers who are closest to the work need to be empowered to build.” Continue reading Airtable Enters No-Code Enterprise App Space with Cobuilder

Mistral, Nvidia Bring Enterprise AI to Desktop with NeMo 12B

Nvidia and French startup Mistral AI are jointly releasing a new language model called Mistral NeMo 12B that brings enterprise AI capabilities to the desktop without the need for major cloud resources. Developers can easily customize and deploy the new LLM for applications supporting chatbots, multilingual tasks, coding and summarization, according to Nvidia. “NeMo 12B offers a large context window of up to 128k tokens,” explains Mistral, adding that “its reasoning, world knowledge, and coding accuracy are state-of-the-art in its size category.” Available under the Apache 2.0 license, it is easy to implement as a drop-in replacement for Mistral 7B. Continue reading Mistral, Nvidia Bring Enterprise AI to Desktop with NeMo 12B

Figma Redesigns Its User Interface and Adds New AI Features

Figma is rolling out its third redesigned user interface, UI3, aimed at making the company even more competitive with Adobe. New are native AI features that accelerate workflows, letting teams build high-quality software. Available in limited beta, Figma AI adds the ability to generate design drafts with a single prompt, enabling rapid experimentation and prototyping. The move advances Figma’s goal of moving beyond design tool to a full-blown product development platform, while making the service intuitive and friendly enough for novices while maintaining the full features demanded by Sigma’s professional users. Continue reading Figma Redesigns Its User Interface and Adds New AI Features

Mistral Development Tool Knows 80 Programming Languages

French startup Mistral AI has released its first large language model for coding. Codestral gives developers looking for a code-native AI tool an option to Meta’s Code Llama, Microsoft’s GitHub Copilot and Amazon Q. Fluent in 80 programming languages — including Python, C++ and JavaScript — Codestral can complete code, write tests, and augment partial code “using a fill-in-the-middle mechanism,” while reducing “the risk of errors and bugs,” according to the company. The new LLM is described as open, but its license prohibits commercial use of both Codestral and its outputs. Continue reading Mistral Development Tool Knows 80 Programming Languages

AWS CodeWhisperer Is Rebranded as Part of Amazon Q Suite

Amazon has pulled the plug on CodeWhisperer, which has been incorporated into its Q Developer product, announced at November’s re:Invent as part of an AI-powered AWS enterprise suite called Amazon Q, which also includes Q Business. Both Q Developer, which enables natural language coding, and Q Business, a data-driven productivity tool are now in general release, and Q Apps has just been added in preview, letting employees “build generative AI-powered apps from their company’s data, without any prior coding experience.” The move comes as Amazon seeks to gain ground on the Microsoft-owned GitHub’s AI coding products. Continue reading AWS CodeWhisperer Is Rebranded as Part of Amazon Q Suite

GitHub Puts Copilot Workspace Developer Platform in Preview

GitHub has introduced Copilot Workspace, a Copilot-native developer environment for artificial intelligence, in technical preview. Developers are invited to sign up for a waitlist for the service, which allows the use of natural language to plan, build, test and run code. The Microsoft-owned company has introduced various aspects of Copilot over the past few years, adding an autocomplete pair programmer in 2022, and in 2023 Copilot Chat for natural language coding, debugging and testing, “allowing developers to converse with their code in real time.” The “task-centric” Copilot Workspace leverages different agents for a “start-to-finish experience.” Continue reading GitHub Puts Copilot Workspace Developer Platform in Preview

Google Introduces Faster, More Efficient JPEG Coding Library

Google is attacking slow-loading web pages with the new JPEG image encoder/decoder Jpegli, which offers a 35 percent compression ratio improvement using high quality compression settings, the Alphabet company says. The Jpegli JPEG coding library offers backward compatibility via “a fully interoperable encoder and decoder complying with the original JPEG standard and its most conventional 8-bit formalism, and API/ABI compatibility with libjpeg-turbo and MozJPEG,” Google says. The resulting images compressed using Jpegli are “more precise and psychovisually effective” as a result of computations that make images “look clearer” with “fewer observable artifacts.” Continue reading Google Introduces Faster, More Efficient JPEG Coding Library

Startup Cognition Launches AI Software Coding Engine Devin

Months-old startup Cognition AI has emerged from stealth mode with Devin, a generative platform it is calling “the world’s first fully autonomous AI software engineer.” Although Cognition has yet to make Devin widely available, much less allow independent testing, if its claims are true it would mark a turning point in the AI coding space, moving it from a field of AI assistants to a full-fledged AI engineer. Based on natural language instruction, Devin could potentially take a project from concept to execution rather than simply suggesting code snippets or offering barebones frameworks. Continue reading Startup Cognition Launches AI Software Coding Engine Devin