GTC: Nvidia Unveils Blackwell GPU for Trillion-Parameter LLMs

Nvidia unveiled what it is calling the world’s most powerful AI processing system, the Blackwell GPU, purpose built to power real-time generative AI on trillion-parameter large language models at what the company says will be up to 25x less cost and energy consumption than its predecessors. Blackwell’s capabilities will usher in what the company promises will be a new era in generative AI computing. News from Nvidia’s GTC 2024 developer conference included the NIM software platform, purpose built to streamline the setup of custom and pre-trained AI models in a production environment, and the DGX SuperPOD server, powered by Blackwell. Continue reading GTC: Nvidia Unveils Blackwell GPU for Trillion-Parameter LLMs

Apple Unveils Progress in Multimodal Large Language Models

Apple researchers have gone public with new multimodal methods for training large language models using both text and images. The results are said to enable AI systems that are more powerful and flexible, which could have significant ramifications for future Apple products. These new models, which Apple calls MM1, support up to 30 billion parameters. The researchers identify multimodal large language models (MLLMs) as “the next frontier in foundation models,” which exceed the performance of LLMs and “excel at tasks like image captioning, visual question answering and natural language inference.” Continue reading Apple Unveils Progress in Multimodal Large Language Models

Grok-1 Architecture Open-Sourced for General Release by xAI

Elon Musk’s xAI has released its Grok chatbot and open-sourced part of the underlying Grok-1 model architecture for any developer or entrepreneur to use for purposes including commercial applications. Musk unveiled Grok in November and announced that it would be publicly released this month. The chatbot itself is available to X social premium members, who can ask the cheeky AI questions and get answers with a snarky attitude inspired by “The Hitchhiker’s Guide to the Galaxy” sci-fi novel. The training for Grok’s foundation LLM is said to include X social posts. Continue reading Grok-1 Architecture Open-Sourced for General Release by xAI

Meta Building Giant AI Model to Power Entire Video Ecosystem

Facebook chief Tom Alison says parent company Meta Platforms is building a giant AI model that will eventually “power our entire video ecosystem.” Speaking at the Morgan Stanley Technology, Media & Telecom Conference this week, Alison said the model will drive the company’s video recommendation engine across all platforms that host long-form video as well as the short-form Reels, which are limited to 90 seconds. Alison said the company began experimenting with the new, super-sized AI model last year and found that it helped improve Facebook’s Reels watch time by anywhere from 8-10 percent. Continue reading Meta Building Giant AI Model to Power Entire Video Ecosystem

Anthropic’s Claude 3 AI Is Said to Have ‘Near-Human’ Abilities

Anthropic has released Claude 3, claiming new industry benchmarks that see the family of three new large language models approaching “near-human” cognitive capability in some instances. Accessible via Anthropic’s website, the three new models — Claude 3 Haiku, Claude 3 Sonnet and Claude 3 Opus — represent successively increased complexity and parameter count. Sonnet is powering the current Claude.ai chatbot and is free, for now, requiring only an email sign-in. Opus comes with the the $20 monthly subscription for Claude Pro. Both are generally available from the Anthropic website and via API in 159 countries, with Haiku coming soon. Continue reading Anthropic’s Claude 3 AI Is Said to Have ‘Near-Human’ Abilities

France’s Mistral AI Makes Its Global Debut on Microsoft Azure

Paris-based startup Mistral AI has made an immediate splash in the world of artificial intelligence, securing partnerships with IBM, Microsoft and others nine months after its launch. The company is offering natural language processing models, including its flagship Mistral Large, which becomes only the second LLM (after OpenAI) to land a commercial berth on Microsoft’s Azure cloud, where Meta Platforms’ Llama 2 is available in preview. Boasting “top-tier reasoning capacities” and sophisticated conversational capabilities, Mistral Large specializes in “reasoning, analysis and generation (RAG), is multilingual and supports up to 32,000 tokens.” Continue reading France’s Mistral AI Makes Its Global Debut on Microsoft Azure

MWC: Qualcomm Unveils AI Hub and Promotes 5G, 6G Tech

Qualcomm raised the curtain on a variety of artificial intelligence, 5G, and Wi-Fi technologies at Mobile World Congress Barcelona, which runs through Thursday. The San Diego-based chip designer unveiled an AI Hub it says will help developers create voice-, text- and image-based applications using pre-optimized AI models. Qualcomm’s flagship AI chips — the mobile Snapdragon 8 Gen 3 processor and the PC-centric Snapdragon X Elite — were announced last year. With the first splash of products now heading to market the company is promising to push the boundaries of 5G and 6G. Continue reading MWC: Qualcomm Unveils AI Hub and Promotes 5G, 6G Tech

Reddit Announces IPO on Heels of Expanded Deal with Google

Community message board and social news aggregator Reddit, founded in 2005, has filed to go public on the New York Stock Exchange in an IPO observers say may be complete in a matter of weeks. It is the first social media company to go public in many years, with Snap Inc.’s 2017 offering cited as the most recent stock market splash. Reddit’s bankers are reportedly seeking a $5 billion valuation, about half the $10 billion it was valued at for a 2021 private funding round. Reddit filed with the SEC the same day it announced an “expanded partnership” with Google to use Vertex AI. Continue reading Reddit Announces IPO on Heels of Expanded Deal with Google

Google Targets Global Security with AI Cyber Defense Initiative

Google has unveiled a new policy, the AI Cyber Defense Initiative, designed to harness the power of artificial intelligence to improve global cybersecurity defenses. The proposed policy aims to counteract rapidly evolving threats by using AI to improve threat detection, automate vulnerability management and enhance incident response effectiveness. The Alphabet company introduced its new plan at the Munich Security Conference, where it also announced it has a pool of $2 million to award businesses and academic institutions for research initiatives involving large language models, code verification and other AI uses for cyber offense and defense. Continue reading Google Targets Global Security with AI Cyber Defense Initiative

Amazon Claims ’Emergent Abilities’ for Text-to-Speech Model

Researchers at Amazon have trained what they are calling the largest text-to-speech model ever created, which they claim is exhibiting “emergent” qualities — the ability to inherently improve itself at speaking complex sentences naturally. Called BASE TTS, for Big Adaptive Streamable TTS with Emergent abilities, the new model could pave the way for more human-like interactions with AI, reports suggest. Trained on 100,000 hours of public domain speech data, BASE TTS offers “state-of-the-art naturalness” in English as well as some German, Dutch and Spanish. Text-to-speech models are used in developing voice assistants for smart devices and apps and accessibility. Continue reading Amazon Claims ’Emergent Abilities’ for Text-to-Speech Model

Apple’s Keyframer AI Tool Uses LLMs to Prototype Animation

Apple has taken a novel approach to animation with Keyframer, using large language models to add motion to static images through natural language prompts. “The application of LLMs to animation is underexplored,” Apple researchers say in a paper that describes Keyframer as an “animation prototyping tool.” Based on input from animators and engineers, Keyframer lets users refine their work through “a combination of prompting and direct editing,” the paper explains. The LLM can generate CSS animation code. Users can also use natural language to request design variations. Continue reading Apple’s Keyframer AI Tool Uses LLMs to Prototype Animation

Slack AI Brings Generative Features to Channels and Threads

Slack AI is a new paid add-on for enterprise clients that want to boost productivity using artificial intelligence. Generative capabilities in the initial release include personalized responses to questions, channel recaps and thread summaries that promise to “catch you up on long conversations in one click.” Slack says pilot data indicated customers including Uber and Anthropic “could save an average of 97 minutes per user each week using Slack AI to find answers, distill knowledge and spark ideas.” Slack AI is backward compatible, generating information based on the history built over time on the platform. Continue reading Slack AI Brings Generative Features to Channels and Threads

Apple Launches Open-Source Language-Based Image Editor

Apple has released MGIE, an open-source AI model that edits images using natural language instructions. MGIE, short for MLLM-Guided Image Editing, can also modify and optimize images. Developed in conjunction with University of California Santa Barbara, MGIE is Apple’s first AI model. The multimodal MGIE, which understands text and image input, also crops, resizes, flips, and adds filters based on text instructions using what Apple says is an easier instruction set than other AI editing programs, and is simpler and faster than learning a traditional program, like Apple’s own Final Cut Pro. Continue reading Apple Launches Open-Source Language-Based Image Editor

Yelp Adds 20 Features Plus AI to Help Users and Businesses

Yelp is introducing more than 20 new updates to improve the experience for community members and business owners. Included are AI-powered summaries that make it easier to find businesses, an updated Yelp Elite badge for reviewers who are passionate about specific subjects, and a new visual home feed and search experience geared toward discovery. For those seeking services, the new “Request a Quote” and “Projects” features are available. Artificial intelligence will also power market and competitive insights for business owners, while AI-powered smart budgets provide recommendations to optimize ad spend, “helping local businesses grow.” Continue reading Yelp Adds 20 Features Plus AI to Help Users and Businesses

Conversational Chatbot Optimizes Google Ads, Search Results

Google’s multimodal Gemini large language model will offer chat capabilities that help advertisers build and scale Search campaigns within the Google Ads platform using natural language prompts. “We’ve been actively testing Gemini to further enhance our ads solutions, and, we’re pleased to share that Gemini is now powering the conversational experience,” Google said, explaining the functionality is now available in beta to English language advertisers in the U.S., UK and will be rolling out globally to all English language advertisers over the next few weeks, with additional languages offered in the months ahead. Continue reading Conversational Chatbot Optimizes Google Ads, Search Results