Microsoft’s Copilot AI Assistant Update Adds Voice and Vision

Microsoft announced that its Copilot AI assistant has received a major overhaul, gaining voice and vision capabilities. Copilot also now has a virtual news reader mode to present headlines, as well as the ability to see what you see and to interact in a more conversational manner. Before a general release, these tools will be trialed among a subset of Copilot Pro users “to gather feedback” and make them “better and safer.” Microsoft AI Executive VP and CEO Mustafa Suleyman says the changes herald “a calmer, more helpful and supportive era of technology, quite unlike anything we’ve seen before.” Continue reading Microsoft’s Copilot AI Assistant Update Adds Voice and Vision

OpenAI Showcases Latest Updates for Voice, Picture and More

OpenAI unveiled major updates at its DevDay conference with the focus largely on making AI more accessible, efficient and affordable. Included were four innovations: Vision Fine-Tuning in the API, Model Distillation, Prompt Caching and the public beta of Realtime API. The approach underscores OpenAI’s effort to empower its developer ecosystem even as it continues to compete for end-users in the enterprise space. The Realtime API gives developers the option of building “nearly real-time” speech-to-speech app experiences, selecting from among six OpenAI voices. Vision Fine-Tuning for GPT-4o enables customization of the model’s visual understanding of images and text. Continue reading OpenAI Showcases Latest Updates for Voice, Picture and More

Accenture Has Plans for Scaling Enterprise AI with Nvidia Unit

Accenture is forming an internal Nvidia Business Group staffed with 30,000 global employees trained to help clients “reinvent processes and scale enterprise AI adoption with AI agents,” the consulting firm announced. Accenture will also use its AI Refinery platform to help companies customize AI models and agents using the full Nvidia AI stack including AI Foundry, AI Enterprise and Omniverse. “With generative AI demand driving $3 billion in Accenture bookings in its recently closed fiscal year, the new group will help clients lay the foundation for agentic AI functionality,” Accenture said. Continue reading Accenture Has Plans for Scaling Enterprise AI with Nvidia Unit

MIT Spinoff Liquid Eschews GPTs for Its Fluid Approach to AI

AI startup Liquid, founded by alums of MIT’s Computer Science and Artificial Intelligence Laboratory (CSAIL), has released its first models. Called Liquid Foundation Models, or LFMs, the multimodal family approaches “intelligence” differently than the pre-trained transformer models that dominate the field. Instead, the LFMs take a path of “first principles,” which MIT describes as “the same way engineers build engines, cars, and airplanes,” explaining that the models are large neural networks with computational units “steeped in theories of dynamic systems, signal processing and numeric linear algebra.” Continue reading MIT Spinoff Liquid Eschews GPTs for Its Fluid Approach to AI

Snapchat: My AI Goes Multimodal with Google Cloud, Gemini

Snap Inc. is leveraging its relationship with Google Cloud to use Gemini for powering generative AI experiences within Snapchat’s My AI chatbot. The multimodal capabilities of Gemini on Vertex AI will greatly increase the My AI chatbot’s ability to understand and operate across different types of information such as text, audio, image, video and code. Snapchatters can use My AI to take advantage of Google Lens-like features, including asking the chatbot “to translate a photo of a street sign while traveling abroad, or take a video of different snack offerings to ask which one is the healthiest option.” Continue reading Snapchat: My AI Goes Multimodal with Google Cloud, Gemini

Allen Institute Announces Vision-Optimized Molmo AI Models

The Allen Institute for AI (also known as Ai2, founded by Paul Allen and led by Ali Farhadi) has launched Molmo, a family of four open-source multimodal models. While advanced models “can perceive the world and communicate with us, Molmo goes beyond that to enable one to act in their worlds, unlocking a whole new generation of capabilities, everything from sophisticated web agents to robotics,” according to Ai2. On some third-party benchmark tests, Molmo’s 72 billion parameter model outperforms other open AI offerings and “performs favorably” against proprietary rivals like OpenAI’s GPT-4o, Google’s Gemini 1.5 and Anthropic’s Claude 3.5 Sonnet, Ai2 says. Continue reading Allen Institute Announces Vision-Optimized Molmo AI Models

Google Unveils New Updates to Its AI-Powered NotebookLM

Google has updated its AI assistant, NotebookLM, allowing the AI note-taking and research tool to find summaries of audio files and YouTube videos. First released at the Google I/O developer conference in 2023, NotebookLM even creates sharable AI-generated audio discussions and podcasts. It allows users to upload file formats including PDFs, Google Docs, Google Slides and websites. The items, including text, can be stored in shareable “notebooks,” organizing material in a central location, and users can ask Google’s Gemini AI questions about the notebook material. Initially embraced by students and educators, it has become equally popular among business users. Continue reading Google Unveils New Updates to Its AI-Powered NotebookLM

Runway Launches $5M AI Film Fund as Open Call to Creators

Artificial intelligence platform Runway has launched The Hundred Film Fund to help finance 100 projects that use its AI to tell stories. Created by the company through its Runway Studios, the Fund is starting with $5 million, “with the potential to grow to $10 million.” Runway is presenting the Fund as “an open call to all creative professionals who have AI-augmented film projects in the pre- or post-production phases and are in need of funding.” Directors, producers and screenwriters are among those invited to apply. The program will consider all formats, from features to shorts, documentaries, experimental projects, music videos and more. Continue reading Runway Launches $5M AI Film Fund as Open Call to Creators

Digital Domain Leverages AWS for Its Virtual Human Initiative

Visual effects studio Digital Domain has brought its Autonomous Virtual Human project to Amazon Web Services, which will provide generative AI and machine learning tools and provide Digital Domain’s creations and processes a home in the global cloud. The collaboration “aims to propel the evolution and global reach of Digital Domain’s AVH technology and expand its use for multiple industries, including entertainment, gaming, healthcare, hospitality, and commercial applications,” Amazon said in a statement that emphasizes “AWS cloud services, particularly Amazon Bedrock,” as providing the infrastructure and adaptability “to drive AVH’s growth.” Continue reading Digital Domain Leverages AWS for Its Virtual Human Initiative

Amazon, Google, Microsoft and OpenAI Join the EU’s AI Pact

The European Commission has released a list of more than 100 companies that have become signatories to the EU’s AI Pact. While Google, Microsoft and OpenAI are among them, Apple and Meta are not. The voluntary AI Pact is aimed at eliciting policies on AI deployment during the period before the legally binding AI Act takes full effect. The EU AI Pact focuses on transparency in three core areas: internal AI governance, high-risk AI systems mapping and promoting AI literacy and awareness among staff to support ethical development. It is aimed at “relevant stakeholders,” across industry, civil society and academia. Continue reading Amazon, Google, Microsoft and OpenAI Join the EU’s AI Pact

Meta Unveils New Open-Source Multimodal Model Llama 3.2

Meta’s Llama 3.2 release includes two new multimodal LLMs, one with 11 billion parameters and one with 90 billion — considered small- and medium-sized — and two lightweight, text-only models (1B and 3B) that fit onto edge and mobile devices. Included are pre-trained and instruction-tuned versions. In addition to text, the multimodal models can interpret images, supporting apps that require visual understanding. Meta says the models are free and open source. Alongside them, the company is releasing “the first official Llama Stack distributions,” enabling “turnkey deployment” with integrated safety. Continue reading Meta Unveils New Open-Source Multimodal Model Llama 3.2

OpenAI Rolls Out Advanced Voice Mode Feature for ChatGPT

As OpenAI gears up to become a for-profit company next year, it is releasing ChatGPT Advanced Voice Mode, which brings a humanlike conversation mode to ChatGPT 4o. All U.S. subscribers to ChatGPT Plus and Team plans will gain access to the new feature, which will also be made available to those paying for ChatGPT Edu and Enterprise plans in the coming weeks. The firm is also adding five new voices and allowing customers to save personalized instructions for the voice assistant, including memory behaviors. Concurrently, executives including CTO Mira Murati have resigned as the company pivots to commerciality. Continue reading OpenAI Rolls Out Advanced Voice Mode Feature for ChatGPT

New Microsoft Safety Tools Fix AI Flubs, Detect Proprietary IP

Microsoft has released a suite of “Trustworthy AI” features that address concerns about AI security and reliability. The four new capabilities include Correction, a content detection upgrade in Microsoft Azure that “helps fix hallucination issues in real time before users see them.” Embedded Content Safety allows customers to embed Azure AI Content Safety on devices where cloud connectivity is intermittent or unavailable, while two new filters flag AI output of protected material. Additionally, a transparency safeguard providing the company’s AI assistant, Microsoft 365 Copilot, with specific “web search query citations” is coming soon. Continue reading New Microsoft Safety Tools Fix AI Flubs, Detect Proprietary IP

Cloudflare Tool Can Prevent AI Bots from Scraping Websites

Cloudflare has released AI Audit, a free set of new tools designed to help websites analyze and control how their content is used by artificial intelligence models. Described as “one-click blocking” to prevent unauthorized AI scraping, Cloudflare says it will also make it easier to identify the content bots scan most, so they can wall it off and negotiate payment in exchange for access. Helping its clients toward a sustainable future, Cloudflare is also creating a marketplace for sites to negotiate fees based on AI audits that trace cyber footprints on server files. Continue reading Cloudflare Tool Can Prevent AI Bots from Scraping Websites

Alibaba Cloud Ups Its AI Game with 100 Open-Source Models

Alibaba Cloud last week globally released more than 100 new open-source variants of its large language foundation model, Qwen 2.5, to the global open-source community. The company has also revamped its proprietary offering as a full-stack AI-computing infrastructure across cloud products, networking and data center architecture, all aimed at supporting the growing demands of AI computing. Alibaba Cloud’s significant contribution was revealed at the Apsara Conference, the annual flagship event held by the cloud division of China’s e-retail giant, often referred to as the Chinese Amazon. Continue reading Alibaba Cloud Ups Its AI Game with 100 Open-Source Models