Alibaba’s Latest Vision Model Has Advanced Video Capability

China’s largest cloud computing company, Alibaba Cloud, has released a new computer vision model, Qwen2-VL, which the company says improves on its predecessor in visual understanding, including video comprehension and text-to-image processing in languages including English, Japanese, French, Spanish, Chinese and others. The company says it can analyze videos of more than 20 minutes in length and is able to respond appropriately to questions about content. Third-party benchmark tests compare Qwen2-VL favorably to leading competitors and the company is releasing two open-source versions with a larger private model to come. Continue reading Alibaba’s Latest Vision Model Has Advanced Video Capability

Mistral, Nvidia Bring Enterprise AI to Desktop with NeMo 12B

Nvidia and French startup Mistral AI are jointly releasing a new language model called Mistral NeMo 12B that brings enterprise AI capabilities to the desktop without the need for major cloud resources. Developers can easily customize and deploy the new LLM for applications supporting chatbots, multilingual tasks, coding and summarization, according to Nvidia. “NeMo 12B offers a large context window of up to 128k tokens,” explains Mistral, adding that “its reasoning, world knowledge, and coding accuracy are state-of-the-art in its size category.” Available under the Apache 2.0 license, it is easy to implement as a drop-in replacement for Mistral 7B. Continue reading Mistral, Nvidia Bring Enterprise AI to Desktop with NeMo 12B

Firebase Genkit: Developer Framework for AI-Powered Apps

Google is offering developers a toolkit for incorporating generative AI features into mobile and web applications. Firebase Genkit, an open-source framework, is available now in beta. Blending models, cloud services, agents, data sources and more in a “code-centric approach” developers are used to, the Genkit makes building and debugging for AI easier, according to Google. The first release is built for JavaScript and TypeScript developers, making building AI-powered apps available to professionals who specialize in building server-side applications using the Node.js JavaScript runtime. Continue reading Firebase Genkit: Developer Framework for AI-Powered Apps