Gemini Gets Custom Gems AI Assistants and Adds Imagen 3

Google is giving Gemini Advanced, Enterprise and Business subscribers the ability to create personalized AI assistants, which the company calls “Gems.” “Create your own personal AI experts on any topic you want,” the Alphabet company says. The search giant is also reintroducing Gemini’s image generation capabilities with its latest Imagen 3 model, which will be available to everyone. Gemini, which is Google’s ChatGPT competitor, will again have the ability to generate images of people, something Google disabled in February after controversy over some of the images. The company announced it has implemented new guardrails. Continue reading Gemini Gets Custom Gems AI Assistants and Adds Imagen 3

Anthropic Publishes Claude Prompts, Sharing How AI ‘Thinks’

In a move toward increased transparency, San Francisco-based AI startup Anthropic has published the system prompts for three of its most recent large language models: Claude 3 Opus, Claude 3.5 Sonnet and Claude 3 Haiku. The information is now available on the web and in the Claude iOS and Android apps. The prompts are instruction sets that reveal what the models can and cannot do. Anthropic says it will regularly update the information, emphasizing that evolving system prompts do not affect the API. Examples of Claude’s prompts include “Claude cannot open URLs, links, or videos” and, when dealing with images, “avoid identifying or naming any humans.” Continue reading Anthropic Publishes Claude Prompts, Sharing How AI ‘Thinks’

ElevenLabs Reader App Is Available Globally in 32 Languages

New York-based ElevenLabs is going global with its generative AI text-to-speech reader app, which can narrate writings in 32 languages with thousands of voices from which to choose. The audio startup promises “high quality, human-like” AI voices that are “emotionally and contextually aware,” adapting delivery of written cues “to achieve a high emotional range.” ElevenLabs has focused on “creative workflow,” with a voice isolator and audio effects generator tools. Its catalog includes the voices of celebrities Judy Garland, Laurence Olivier, James Dean and Burt Reynolds. Custom models for translation and voiceover work using contemporary actors is a future possibility. Continue reading ElevenLabs Reader App Is Available Globally in 32 Languages

Bill Mandating GenAI Watermarks Gains Support in California

Adobe, OpenAI and Microsoft are among the major firms backing a California bill that would require tech companies to label AI-generated content with watermarks embedded in the metadata. Such data is easily accessible via browser for material circulated on the Internet, and the initiative would likely involve a campaign to educate the general public on how to find it. The proposed law encompasses video and audio as well as images. The three companies currently supporting the bill initially opposed it, using terms like “unworkable” and “overly burdensome.” Continue reading Bill Mandating GenAI Watermarks Gains Support in California

Dropbox Acquires Productivity and Scheduling App Reclaim.ai

Dropbox has purchased Reclaim.ai, a scheduling tool that uses artificial intelligence to boost productivity, popular with Google Calendar users. The privately held Reclaim announced the deal in a blog post that claims a global user base of over 43,000 companies and more than 320,000 people. Launched in 2019, Reclaim investors include Index Ventures and Calendly contributing to cash raise of more than $9.5 million to date. File-sharing app Drobox has been publicly traded since 2018 and has a current market cap of $7.92 billion. Financial terms of the deal have yet to be disclosed. Continue reading Dropbox Acquires Productivity and Scheduling App Reclaim.ai

SAG-AFTRA Strikes a Deal with Narrativ for AI Voice Replicas

SAG-AFTRA announced it is teaming with online talent marketplace Narrativ to provide the guild’s 160,000 members with the option of working with the New York-based AI startup to license their voice replicas for use in digital audio advertising. The deal would make it easy for voice actors to be considered for replicant work and get compensated, according to SAG-AFTRA, which emphasizes that performers will control the particulars, including whether to make their voices available, brand approval and fees. Narrativ also represents visual likenesses, but the SAG-AFTRA announcement is limited to voice work. Continue reading SAG-AFTRA Strikes a Deal with Narrativ for AI Voice Replicas

D-ID Employs AI to Translate Videos into Multiple Languages

D-ID, a platform that uses AI to generate digital humans, has announced D-ID Video Translate in general availability. The tool lets businesses and content creators automatically re-voice videos in multiple languages, “cloning the speaker’s voice and adapting their lip movements from a single upload.” D-ID is making the Video Translate tool, which accommodates 30 different languages, free to D-ID subscribers for a limited time, available through the D-ID Studio or the company’s API. Languages include Arabic, Mandarin, Japanese, Hindi and Ukrainian, in addition to Spanish, German, French and Italian. Users can simultaneously translate content using bulk translation. Continue reading D-ID Employs AI to Translate Videos into Multiple Languages

Google DeepMind Releases Imagen 3 for Free to U.S. Users

Google DeepMind has made its latest AI image generator, Imagen 3, free for use in the U.S. via the company’s ImageFX platform. Imagen 3 will be available in multiple versions, “each optimized for different types of tasks, from generating quick sketches to high-resolution images.” Google announced Imagen 3 at Google I/O in March, and in June made it available to enterprise users through Vertex. Using simplified natural language text input rather than “complex prompt engineering,” Google says Imagen 3 generates high-quality images in a range styles, from photorealistic, painterly and textured to whimsically cartoony. Continue reading Google DeepMind Releases Imagen 3 for Free to U.S. Users

ByteDance Intros Jimeng AI Text-to-Video Generator in China

ByteDance has debuted a text-to-video mobile app in its native China that is available on the company’s TikTok equivalent there, Douyin. Called Jimeng AI, there is speculation that it will be coming to North America and Europe soon via TikTok or ByteDance’s CapCut editing tool, possibly beating competing U.S. technologies like OpenAI’s Sora to market. Jimeng (translation: “dream”) uses text prompts to generate short videos. For now, its responsiveness is limited to prompts written in Chinese. In addition to entertainment, the app is described as applicable to education, marketing and other purposes. Continue reading ByteDance Intros Jimeng AI Text-to-Video Generator in China

xAI’s Grok-2 Generates Realistic Images with Few Guardrails

Grok-2 and Grok-2 mini, the latest generative chatbots from Elon Musk’s xAI, create images with seemingly few guardrails. Early pictures of notable personalities such as Bill Gates, Donald Trump and Kamala Harris in questionable or compromising settings may not appear photorealistic to a trained eye, but they are still described in many cases to be quite realistic. Powered by the FLUX.1 AI model from Black Forest Labs, Grok-2 and Grok-2 mini are available in beta on X social for Premium and Premium+ subscribers and will be coming to xAI’s enterprise API later this month, according to the company. Continue reading xAI’s Grok-2 Generates Realistic Images with Few Guardrails

WordPress Introduces AI Assistant to Help Users with Writing

WordPress parent Automattic has launched Write Brief with AI to help make documents more concise. Available for free to WordPress.com users, Write Brief with AI measures “readability,” suggests edits and will even make them for you. It identifies complex words and offers alternatives and focuses on simplifying convoluted sentences — all from within the editor function in the WordPress dashboard. Write Brief with AI is now built-in to Jetpack for those who host through WordPress.com, available only in English, though the company says it is working to expand language support. Continue reading WordPress Introduces AI Assistant to Help Users with Writing

FTC Rule Takes Aim at Fake Reviews, Influence Manipulators

In a unanimous vote, the Federal Trade Commission has banned the use of fake reviews, such as those generated by artificial intelligence, and also prohibits reviews or testimonials that are paid for, even if written by humans. The new rule, finalized Wednesday, also reins in other deceptive practices, like paying for fake social media followers, in an effort to stem misleading practices that are increasingly used by marketers. Generative AI has made manufactured reviews easily available, though the agency’s readiness to seek fines against knowing violators may make fabulists think twice before using them. Continue reading FTC Rule Takes Aim at Fake Reviews, Influence Manipulators

Meta, Oxford Advance 3D Object Generation with VFusion3D

VFusion3D is the latest AI model unveiled by Meta Platforms, which developed it in conjunction with the University of Oxford. The powerful model, which uses single-perspective images or text prompts to generate high-quality 3D objects, is being hailed as a breakthrough in scalable 3D AI that can potentially transform sectors including VR, gaming and digital design. The platform tackles the challenge of scarce 3D training data in a world teeming with 2D images and text descriptions. The VFusion3D approach leverages what the developers call “a novel method for building scalable 3D generative models utilizing pre-trained video diffusion models.” Continue reading Meta, Oxford Advance 3D Object Generation with VFusion3D

YouTube Tests Expanded Community Fact-Checking for Video

YouTube, which began testing crowdsourced fact-checking in June, is now expanding the experiment by inviting users to try the feature. Likened to the Community Notes accountability method introduced by Twitter and continued under X, YouTube’s as yet unnamed feature lets users provide context and corrections to posts that might be misleading or false. “You can sign up to submit notes on videos you find inaccurate or unclear,” YouTube explains, adding that “after submission, your note is reviewed and rated by others.” Notes widely rated as helpful “may be published and appear below the video.” Continue reading YouTube Tests Expanded Community Fact-Checking for Video

YouTube Invites Content Creators to ‘Brainstorm with Gemini’

YouTube is testing an integration with parent company Google’s Gemini AI. Called Brainstorm with Gemini, it invites creators to ideate with video, titles and thumbnails. The limited test makes the feature available to a handful of creators whose feedback will be used in strategizing how and whether to introduce the feature more broadly. In May, YouTube began testing another AI tool, renaming its “Research” tab “Inspiration.” The Inspiration tool provides topics that its algorithm detects a creator’s audience might find interested, supplying an outline and talking points. Brainstorm is similar but supports Google’s AI branding. Continue reading YouTube Invites Content Creators to ‘Brainstorm with Gemini’