Microsoft Small Language Models Are Ideal for Smartphones

Microsoft, which has been developing small language models (SLMs) for some time, has announced its most-capable SLM family, Phi-3. SLMs can accomplish some of the same functions as LLMs, but are smaller and trained on less data. That smaller footprint makes them well suited to run in a local environment, which means they’re ideal for smartphones, where in theory they would not even need an Internet connection to run. Microsoft claims the Phi-3 open models can outperform “models of the same size and next size up across a variety of benchmarks that evaluate language, coding and math capabilities.” Continue reading Microsoft Small Language Models Are Ideal for Smartphones

Meta AI Assistant Is Launching Across Platforms with Llama 3

Pursuant to his goal of “building the world’s leading AI,” Meta Platforms CEO Mark Zuckerberg announced Friday that Meta AI is upgrading to Llama 3 concurrent with a rollout of its open-source chatbot across the company’s social platforms, integrating it into the search boxes atop WhatsApp, Instagram, Facebook and Messenger. There is also a website, meta.ai, for those who prefer browser access. Reports of Meta upgrading its social AI capabilities began leaking out early last week, albeit on a more limited test scale than what Zuckerberg announced, which, excepting Threads, is cross-platform. Continue reading Meta AI Assistant Is Launching Across Platforms with Llama 3

Databricks DBRX Model Offers High Performance at Low Cost

Databricks, a San Francisco-based company focused on cloud data and artificial intelligence, has released a generative AI model called DBRX that it says sets new standards for performance and efficiency in the open source category. The mixture-of-experts (MoE) architecture contains 132 billion parameters and was pre-trained on 12T tokens of text and code data. Databricks says it provides the open community and enterprises who want to build their own LLMs with capabilities previously limited to closed model APIs. Compared to other open models, Databricks claims it outperforms alternatives including Llama 2-70B and Mixtral on certain benchmarks. Continue reading Databricks DBRX Model Offers High Performance at Low Cost

Grok-1 Architecture Open-Sourced for General Release by xAI

Elon Musk’s xAI has released its Grok chatbot and open-sourced part of the underlying Grok-1 model architecture for any developer or entrepreneur to use for purposes including commercial applications. Musk unveiled Grok in November and announced that it would be publicly released this month. The chatbot itself is available to X social premium members, who can ask the cheeky AI questions and get answers with a snarky attitude inspired by “The Hitchhiker’s Guide to the Galaxy” sci-fi novel. The training for Grok’s foundation LLM is said to include X social posts. Continue reading Grok-1 Architecture Open-Sourced for General Release by xAI

Stability AI Advances Image Generation with Stable Cascade

Stability AI, purveyor of the popular Stable Diffusion image generator, has introduced a completely new model called Stable Cascade. Now in preview, Stable Cascade uses a different architecture than Stable Diffusion’s SDXL that the UK company’s researchers say is more efficient. Cascade builds on a compression architecture called Würstchen (German for “sausage”) that Stability began sharing in research papers early last year. Würstchen is a three-stage process that includes two-step encoding. It uses fewer parameters, meaning less data to train on, greater speed and reduced costs. Continue reading Stability AI Advances Image Generation with Stable Cascade

Apple Launches Open-Source Language-Based Image Editor

Apple has released MGIE, an open-source AI model that edits images using natural language instructions. MGIE, short for MLLM-Guided Image Editing, can also modify and optimize images. Developed in conjunction with University of California Santa Barbara, MGIE is Apple’s first AI model. The multimodal MGIE, which understands text and image input, also crops, resizes, flips, and adds filters based on text instructions using what Apple says is an easier instruction set than other AI editing programs, and is simpler and faster than learning a traditional program, like Apple’s own Final Cut Pro. Continue reading Apple Launches Open-Source Language-Based Image Editor

IBM and Meta Debut AI Alliance for Safe Artificial Intelligence

IBM and Meta Platforms have launched the AI Alliance, a coalition of companies and educational institutions committed to responsible, transparent development of artificial intelligence. The group launched this week with more than 50 global founding participants from industry, startup, academia, research and government. Among the members and collaborators: AMD, CERN, Cerebras, Cornell University, Dell Technologies, Hugging Face, Intel, Linux Foundation, NASA, Oracle, Red Hat, Sony Group, Stability AI, the University of Tokyo and Yale Engineering. The group’s stated purpose is “to support open innovation and open science in AI.” Continue reading IBM and Meta Debut AI Alliance for Safe Artificial Intelligence

Stability AI Intros Real-Time Text-to-Image Generation Model

Stability AI, developer of Stable Diffusion (one of the leading visual content generators, alongside Midjourney and DALL-E), has introduced SDXL Turbo — a new AI model that demonstrates more of the latent possibilities of the common diffusion generation approach: images that update in real time as the user’s prompt updates. This feature was always a possibility even with previous diffusion models given text and images are comprehended differently across linear time, but increased efficiency of generation algorithms and the steady accretion of GPUs and TPUs in a developer’s data center makes the experience more magical. Continue reading Stability AI Intros Real-Time Text-to-Image Generation Model

Google, Microsoft, Sony Tapped for UN AI Governance Board

The United Nations has formed an advisory board on artificial intelligence comprised of 39-members from government, academia and industry who will “undertake analysis and advance recommendations for the international governance of AI.” The move comes as U.S. legislators and tech industry players are also prioritizing model governance. “Globally coordinated AI governance is the only way to harness AI for humanity while addressing its risks and uncertainties,” the UN announced in unveiling the initiative, co-chaired by Carme Artigas, Spain’s secretary of state for digitalization and AI, and James Manyika, SVP of research, technology and society at Google. Continue reading Google, Microsoft, Sony Tapped for UN AI Governance Board

Big Tech Firms Propel Hugging Face to $4.5 Billion Valuation

Hugging Face has collected $235 million in an investment round that includes contributions from Amazon, IBM, Google, Nvidia, Salesforce, AMD, Intel and Qualcomm. The New York-based startup creates and distributes open-source tools for artificial intelligence development, carving an AI-centric niche similar to the more general programming approach taken by the Microsoft-owned GitHub. The incoming cash infusion — earmarked for talent recruitment — gives Hugging Face a lofty $4.5 billion valuation that experts say indicates momentum for open source in what has to date been a highly competitive AI sector. Continue reading Big Tech Firms Propel Hugging Face to $4.5 Billion Valuation

Meta Unveils Llama 2 LLM with Microsoft as Preferred Partner

This week, Meta Platforms released Llama 2, the next generation of its open-source large language model that is free for research and commercial use. Llama 2’s pretrained and fine-tuned language models are available in sizes ranging from 7 to 70 billion parameters. Meta also named Microsoft Azure its “preferred partner for Llama 2,” offering it through the Azure AI model catalog for use with cloud-native tools that leverage content filtering and safety features. Meta says Llama 2 is “also optimized to run locally on Windows,” providing developers a seamless workflow across enterprise and consumer platforms. Continue reading Meta Unveils Llama 2 LLM with Microsoft as Preferred Partner

Meta’s MusicGen AI Works with Language and Song Prompts

Meta Platforms has debuted what’s being called “ChatGPT for audio.” MusicGen is an AI music generator that can create tunes from natural language or song snippets. The company says MusicGen was trained on 20,000 hours of music, including 10,000 hours of “high-quality” licensed songs and 390,000 instrumental tracks. Meta released MusicGen on GitHub this past weekend, and is currently demoing the app on Facebook’s Hugging Face page. Visitors can generate tunes by describing the sound they want. Among Meta’s prompts: “80s driving pop song with heavy drums and synth pads in the background.” Continue reading Meta’s MusicGen AI Works with Language and Song Prompts

IBM Bows Watsonx Suite of Enterprise AI Products, Services

With artificial intelligence development dating back to the 1950s, IBM was clearly ahead of its time. The company has quietly built a commercial portfolio, with more than 100 million customers across 20 industries using its Watson suite, the company says. At its annual Think conference, the company unboxed IBM Watsonx, a next-generation platform that leverages the scale and scope of foundation models to provide custom solutions for data-driven clients. Described as an “enterprise studio for AI builders,” Watsonx is an end-to-end framework that combines the tools, infrastructure and consulting expertise corporations can use to onboard AI. Continue reading IBM Bows Watsonx Suite of Enterprise AI Products, Services

Stability AI Debuts Open Source StableLM Foundation Model

Stability AI has released StableLM, an open source language model that will compete with OpenAI’s GPT-4 to create apps like ChatGPT. The Alpha version of StableLM is available in 3 billion and 7 billion parameters, and the company promises 15 billion to 65 billion parameter models to come. “With the launch of the StableLM suite of models, Stability AI is continuing to make foundational AI technology accessible to all,” the London-based company said. The StableLM models can generate text and code to power various downstream applications with appropriate training. Continue reading Stability AI Debuts Open Source StableLM Foundation Model

Report: Enterprise Supplants Academia as Driving Force of AI

After many years of academia leading the way in the development of artificial intelligence, the tides have shifted and industry has taken over, according to the 2023 AI Index, a report created by Stanford University with help from companies including Google, Anthropic and Hugging Face. “In 2022, there were 32 significant industry-produced machine learning models compared to just three produced by academia,” the report says. The shift in influence is attributed mainly to the large resource demands — in staff, computing power and training data — required to create state of the art AI systems. Continue reading Report: Enterprise Supplants Academia as Driving Force of AI