Google Serving Ads in AI Overviews and Lens Search Results

Having demonstrated how advertisements in its AI Overviews would work back in May at its Google Marketing Live event, the search giant is now adding the feature for U.S. mobile users and plans to include Google Lens shopping ads “above and alongside visual search results by the end of the year.” “The ways people ask questions today have expanded beyond the search box,” notes Google, explaining the move as a response to that evolution, as artificial intelligence technology has helped consumers use their voice and cameras “to explore the world around them.” Continue reading Google Serving Ads in AI Overviews and Lens Search Results

OpenAI Showcases Latest Updates for Voice, Picture and More

OpenAI unveiled major updates at its DevDay conference with the focus largely on making AI more accessible, efficient and affordable. Included were four innovations: Vision Fine-Tuning in the API, Model Distillation, Prompt Caching and the public beta of Realtime API. The approach underscores OpenAI’s effort to empower its developer ecosystem even as it continues to compete for end-users in the enterprise space. The Realtime API gives developers the option of building “nearly real-time” speech-to-speech app experiences, selecting from among six OpenAI voices. Vision Fine-Tuning for GPT-4o enables customization of the model’s visual understanding of images and text. Continue reading OpenAI Showcases Latest Updates for Voice, Picture and More

Snapchat: My AI Goes Multimodal with Google Cloud, Gemini

Snap Inc. is leveraging its relationship with Google Cloud to use Gemini for powering generative AI experiences within Snapchat’s My AI chatbot. The multimodal capabilities of Gemini on Vertex AI will greatly increase the My AI chatbot’s ability to understand and operate across different types of information such as text, audio, image, video and code. Snapchatters can use My AI to take advantage of Google Lens-like features, including asking the chatbot “to translate a photo of a street sign while traveling abroad, or take a video of different snack offerings to ask which one is the healthiest option.” Continue reading Snapchat: My AI Goes Multimodal with Google Cloud, Gemini

OpenAI Rolls Out Advanced Voice Mode Feature for ChatGPT

As OpenAI gears up to become a for-profit company next year, it is releasing ChatGPT Advanced Voice Mode, which brings a humanlike conversation mode to ChatGPT 4o. All U.S. subscribers to ChatGPT Plus and Team plans will gain access to the new feature, which will also be made available to those paying for ChatGPT Edu and Enterprise plans in the coming weeks. The firm is also adding five new voices and allowing customers to save personalized instructions for the voice assistant, including memory behaviors. Concurrently, executives including CTO Mira Murati have resigned as the company pivots to commerciality. Continue reading OpenAI Rolls Out Advanced Voice Mode Feature for ChatGPT

BlackRock Teams with Microsoft to Advance AI Infrastructure

BlackRock has joined forces with Microsoft to launch what will initially be a $30 billion investment fund to finance AI infrastructure — concentrating primarily on building data centers and developing energy projects. The amount could quickly scale to about $100 billion. Abu Dhabi-based tech investment firm MGX is also participating, as is Global Infrastructure Partners (GIP), which owns, operates and invests across energy, transport, digital and waste management. BlackRock announced it is in the process of acquiring GIP, and says a deal expected to close next month. The new fund is called Global AI Infrastructure Investment Partnership (GAIIP). Continue reading BlackRock Teams with Microsoft to Advance AI Infrastructure

Snapchat Is Getting a Redesign and Generative Text-to-Video

A newly redesigned Snapchat experience is built around a three-tab user interface called Simple Snapchat. As part of that effort, the social platform is launching more generative video features, including text-to-video as part of the app’s Lens Studio AR authoring tool. Easy Lens allows the quick generation of Lenses by typing text prompts, making it possible to do things like experiment with Halloween costumes or explore looks for back to school. Launching in beta for select creators, Snap says the new features are designed for all ability levels. The company is also updating its GenAI Suite and adding an Animation Library of “hundreds of high-quality movements.” Continue reading Snapchat Is Getting a Redesign and Generative Text-to-Video

Google Begins Rolling Out Gemini Live Free to Android Users

Google announced the company is making its new AI assistant Gemini Live available free to all Android users. The move follows the feature’s release last month to Gemini Advanced subscribers. This general release will occur gradually, and only in English for the time being. Gemini Live lets users have a more natural, free-flowing conversation with their phones than was available through Google Assistant via the “Hey, Google” prompt. Gemini inquiries are meant to be conversational, eliciting a back and forth that queriers can interrupt, adding more detail or veering to another topic entirely. Continue reading Google Begins Rolling Out Gemini Live Free to Android Users

OpenAI Previews New LLMs Capable of Complex Reasoning

OpenAI is previewing a new series of AI models that can reason and correct complex coding mistakes, providing a more efficient solution for developers. Powered by OpenAI o1, the new models are “designed to spend more time thinking before they respond, much like a person would,” and as a result can “solve harder problems than previous models in science, coding, and math,” OpenAI claims, noting that “through training, they learn to refine their thinking process, try different strategies, and recognize their mistakes.” The first model in the series is being released in preview in OpenAI’s popular ChatGPT and in the company’s API. Continue reading OpenAI Previews New LLMs Capable of Complex Reasoning

New AI Coding App Cursor Gains Following and $60M in Funds

An AI-powered coding app called Cursor is building a fanbase, with everyone from hobbyists to engineers subscribing to the service. The platform reportedly has 30,000 paying customers, among them employees at OpenAI, Midjourney and Perplexity. Referred to as “the ChatGPT of coding,” Cursor uses popular models including GPT-4o and Claude 3.5 Sonnet to automate building apps and other coding tasks. Cursor was launched by two-year-old startup Anysphere, which has raised more than $60 million in Series A funding led by Andreessen Horowitz and Thrive Capital. Continue reading New AI Coding App Cursor Gains Following and $60M in Funds

Gemini Gets Custom Gems AI Assistants and Adds Imagen 3

Google is giving Gemini Advanced, Enterprise and Business subscribers the ability to create personalized AI assistants, which the company calls “Gems.” “Create your own personal AI experts on any topic you want,” the Alphabet company says. The search giant is also reintroducing Gemini’s image generation capabilities with its latest Imagen 3 model, which will be available to everyone. Gemini, which is Google’s ChatGPT competitor, will again have the ability to generate images of people, something Google disabled in February after controversy over some of the images. The company announced it has implemented new guardrails. Continue reading Gemini Gets Custom Gems AI Assistants and Adds Imagen 3

Anthropic Publishes Claude Prompts, Sharing How AI ‘Thinks’

In a move toward increased transparency, San Francisco-based AI startup Anthropic has published the system prompts for three of its most recent large language models: Claude 3 Opus, Claude 3.5 Sonnet and Claude 3 Haiku. The information is now available on the web and in the Claude iOS and Android apps. The prompts are instruction sets that reveal what the models can and cannot do. Anthropic says it will regularly update the information, emphasizing that evolving system prompts do not affect the API. Examples of Claude’s prompts include “Claude cannot open URLs, links, or videos” and, when dealing with images, “avoid identifying or naming any humans.” Continue reading Anthropic Publishes Claude Prompts, Sharing How AI ‘Thinks’

OpenAI Pushes GPT-4o Customization with Free Token Offer

OpenAI announced its newest model, GPT-4o, can now be customized. The company said that the ability to fine-tune the multimodal GPT-4o has been “one of the most requested features from developers.” Customization can move the model toward more specific structure and tone of responses or allow it to follow specific instruction sets geared toward individual use cases. Developers can now implement custom datasets, aiming for better performance at a lower cost. The ChatGPT maker is rolling out the welcome mat by offering 1 million training tokens per day “for free for every organization” through September 23. Continue reading OpenAI Pushes GPT-4o Customization with Free Token Offer

Story Raises $80M to Create Blockchain-Based IP Protection

Palo Alto-based startup PIP Labs announced an $80 million funding round for Story Protocol, a blockchain platform to track intellectual property rights in the era of artificial intelligence and the data scraping that enables model training. CEO and co-founder Seung Yoon “SY” Lee says the company aims to create a more sustainable IP environment for digital consumers and builders. The raise, led by Andreessen Horowitz (a16z) and Polychain Capital, values the startup at $2.25 billion. The move comes after Sahara AI announced it raised $43 million this month to fund a blockchain-based IP tracking system. Continue reading Story Raises $80M to Create Blockchain-Based IP Protection

AMD Buying ZT Systems to Expand Data Center Capabilities

California-based semiconductor manufacturer AMD is looking to take on Nvidia for a bigger share of business from the artificial intelligence boom. AMD plans to purchase data center equipment maker ZT Systems in a cash and stock deal that values the company at $4.9 billion. The deal, which is subject to regulatory approval, is part of AMD’s goal of offering a wider selection of chips, software and system designs to big data enterprise clients such as Microsoft, Google, Meta Platforms and Apple. Privately held ZT Systems, based in New Jersey, makes gear and server solutions for cloud computing and related infrastructure. Continue reading AMD Buying ZT Systems to Expand Data Center Capabilities

Google Rolls Out Its Gemini Live, Challenging ChatGPT Voice

Google has released its AI assistant, Gemini Live, and is positioning it to replace Google Assistant on mobile. Gemini Live is rolling out on Android to subscribers of Gemini Advanced, which is part of the $20 monthly Google One AI Premium plan. Those consumers who purchase the new Pixel 9 Pro — which begins shipping this week — will get the assistant as part of a year of free access to Gemini Advanced, a $240 value, according to the company. Google claims that Gemini Live technology enables natural, flowing conversations with the AI assistant, putting “a sidekick in your pocket.” Continue reading Google Rolls Out Its Gemini Live, Challenging ChatGPT Voice