OpenAI’s Affordable GPT-4.1 Models Place Focus on Coding

OpenAI has launched a new series of multimodal models dubbed GPT-4.1 that represent what the company says is a leap in small model performance, including longer context windows and improvements in coding and instruction following. Geared to developers and available exclusively via API (not through ChatGPT), the 4.1 series comes in three variations: in addition to the flagship GPT‑4.1, GPT‑4.1 mini and GPT‑4.1 nano, OpenAI’s first nano model. Unlike Web-connected models (which have “retrieval-augmented generation,” or RAG) and can access up-to-date information, they are static knowledge models. Continue reading OpenAI’s Affordable GPT-4.1 Models Place Focus on Coding

Deep Cogito Is Out of Stealth with Hybrid Reasoning Models

San Francisco-based AI startup Deep Cogito has released five AI models in preview, making them available under an open-source license agreement. The models come in sizes 3B, 8B, 14B, 32B and 70B, with plans to release 109B, 400B and 671B versions in the weeks and months ahead. As for the current models, “each outperforms the best available open models of the same size, including counterparts from Meta, DeepSeek and Alibaba, across most standard benchmarks,” Deep Cogito claims, noting that the 70B model in particular “outperforms the newly released Llama 4 109B MoE model.” Continue reading Deep Cogito Is Out of Stealth with Hybrid Reasoning Models

Google Debuts Next-Gen Reasoning Models with Gemini 2.5

Google has released what it calls its most intelligent AI model yet, Gemini 2.5. The first 2.5 model release, an experimental version of Gemini 2.5 Pro, is a next-gen reasoning model that Google says outperformed OpenAI o3-mini and Claude 3.7 Sonnet from Anthropic on common benchmarks “by meaningful margins.” Gemini 2.5 models “are thinking models, capable of reasoning through their thoughts before responding, resulting in enhanced performance and improved accuracy,” according to Google. The new model comes just three months after Google released Gemini 2.0 with reasoning and agentic capabilities. Continue reading Google Debuts Next-Gen Reasoning Models with Gemini 2.5

Real-Time Web Access Informs Claude 3.7 Sonnet Responses

Anthropic’s Claude can now search the Internet in real time, allowing it to provide timely and relevant responses that are also more accurate than what the chatbot previously offered, according to the company. Claude incorporates direct citations for its Web-retrieved material, so users can fact-check its sources. “Instead of finding search results yourself, Claude processes and delivers relevant sources in a conversational format.” While this is not exactly groundbreaking — ChatGPT, Grok 3, Copilot, Perplexity and Gemini all have real-time Web retrieval and most include citations — Claude takes a slightly different approach. Continue reading Real-Time Web Access Informs Claude 3.7 Sonnet Responses

Amazon Plans an AI Push with Nova Reasoning Model, Agents

Amazon is ramping up its AI activity, reportedly planning to release its own advanced reasoning model as part of the company’s Nova family. The Nova line was introduced in December at re:Invent and the new addition could debut as early as June. Its reasoning prowess is being compared to the abilities of OpenAI’s o3-mini and DeepSeek-R1. But reports say Amazon is taking the hybrid reasoning approach embraced by Anthropic’s Claude 3.7 Sonnet (Amazon has a 10 percent stake in Anthropic). The e-retail giant is also preparing for an agentic AI push, having established a dedicated unit, reports say. Continue reading Amazon Plans an AI Push with Nova Reasoning Model, Agents

Anthropic Introduces a New Claude Hybrid Reasoning Model

Anthropic has released a new frontier model, Claude 3.7 Sonnet, described as the industry’s first “hybrid AI reasoning model.” The new Claude is different in that it can both respond to questions in real time or, alternatively, “think” about a problem for a prolonged period of time — basically as long as a user would like. Users can choose between “near-instant responses or extended, step-by-step thinking that is made visible to the user” by selecting the appropriate “reasoning” capability for Claude, Anthropic says. Along with the new model, Anthropic is also debuting a command line tool for agentic coding, Claude Code. Continue reading Anthropic Introduces a New Claude Hybrid Reasoning Model