By
Paula ParisiAugust 30, 2024
Google is giving Gemini Advanced, Enterprise and Business subscribers the ability to create personalized AI assistants, which the company calls “Gems.” “Create your own personal AI experts on any topic you want,” the Alphabet company says. The search giant is also reintroducing Gemini’s image generation capabilities with its latest Imagen 3 model, which will be available to everyone. Gemini, which is Google’s ChatGPT competitor, will again have the ability to generate images of people, something Google disabled in February after controversy over some of the images. The company announced it has implemented new guardrails. Continue reading Gemini Gets Custom Gems AI Assistants and Adds Imagen 3
By
Paula ParisiAugust 29, 2024
In a move toward increased transparency, San Francisco-based AI startup Anthropic has published the system prompts for three of its most recent large language models: Claude 3 Opus, Claude 3.5 Sonnet and Claude 3 Haiku. The information is now available on the web and in the Claude iOS and Android apps. The prompts are instruction sets that reveal what the models can and cannot do. Anthropic says it will regularly update the information, emphasizing that evolving system prompts do not affect the API. Examples of Claude’s prompts include “Claude cannot open URLs, links, or videos” and, when dealing with images, “avoid identifying or naming any humans.” Continue reading Anthropic Publishes Claude Prompts, Sharing How AI ‘Thinks’
By
Paula ParisiAugust 29, 2024
New York-based ElevenLabs is going global with its generative AI text-to-speech reader app, which can narrate writings in 32 languages with thousands of voices from which to choose. The audio startup promises “high quality, human-like” AI voices that are “emotionally and contextually aware,” adapting delivery of written cues “to achieve a high emotional range.” ElevenLabs has focused on “creative workflow,” with a voice isolator and audio effects generator tools. Its catalog includes the voices of celebrities Judy Garland, Laurence Olivier, James Dean and Burt Reynolds. Custom models for translation and voiceover work using contemporary actors is a future possibility. Continue reading ElevenLabs Reader App Is Available Globally in 32 Languages
By
Paula ParisiAugust 29, 2024
Canadian generative video startup Viggle AI, which specializes in character motion, has raised $19 million in Series A funding. Viggle was founded in 2022 on the premise of providing a simplified process “to create lifelike animations using simple text-to-video or image-to-video prompts.” The result has been robust adoption among meme creators, with many viral videos circulating among social media platforms powered by Viggle, including one featuring Joaquin Phoenix as the Joker mimicking the movements of rapper Lil Yachty. Viggle’s Discord community has four million members including “both novice and experienced animators,” according to the company. Continue reading Viggle AI Raises $19 Million on the Power of Memes and More
By
Paula ParisiAugust 28, 2024
Adobe, OpenAI and Microsoft are among the major firms backing a California bill that would require tech companies to label AI-generated content with watermarks embedded in the metadata. Such data is easily accessible via browser for material circulated on the Internet, and the initiative would likely involve a campaign to educate the general public on how to find it. The proposed law encompasses video and audio as well as images. The three companies currently supporting the bill initially opposed it, using terms like “unworkable” and “overly burdensome.” Continue reading Bill Mandating GenAI Watermarks Gains Support in California
By
Paula ParisiAugust 28, 2024
Tidal — the music streaming service owned by Jack Dorsey’s Block payment processing company — is launching a royalty-tracking toolkit for songwriters. The new feature lets authors organize disparate publisher information in one place. “Songwriters juggle a mix of collection societies, publishing platforms, royalty management services, streaming services, and single-purpose apps to manage their royalties, careers, and catalog,” explains the company, which claims to be the first platform to serve songwriters “throughout the full writing career cycle.” Tidal has partnered with performing rights organization AllTrack to handle the backend. Continue reading Tidal, AllTrack Team to Provide Songwriter Royalty Snapshots
By
Paula ParisiAugust 27, 2024
Last year Sony entered into a joint venture with Singapore-based startup Startale Labs. Now the first fruits of that collaboration have come to light, with the launch of Soneium, an Ethereum layer 2 blockchain, from Startale and Sony Block Solutions Labs. The platform is first being made available to developers, with plans for an eventual public launch, the goal being “to create new services by leveraging the various businesses and IP within the Sony Group so that Soneium becomes an infrastructure that everyone can use on a daily basis,” according to Sony. Continue reading Sony and Startale Labs Launch Soneium Blockchain for Web3
By
Paula ParisiAugust 27, 2024
OpenAI announced its newest model, GPT-4o, can now be customized. The company said that the ability to fine-tune the multimodal GPT-4o has been “one of the most requested features from developers.” Customization can move the model toward more specific structure and tone of responses or allow it to follow specific instruction sets geared toward individual use cases. Developers can now implement custom datasets, aiming for better performance at a lower cost. The ChatGPT maker is rolling out the welcome mat by offering 1 million training tokens per day “for free for every organization” through September 23. Continue reading OpenAI Pushes GPT-4o Customization with Free Token Offer
By
Paula ParisiAugust 27, 2024
Dropbox has purchased Reclaim.ai, a scheduling tool that uses artificial intelligence to boost productivity, popular with Google Calendar users. The privately held Reclaim announced the deal in a blog post that claims a global user base of over 43,000 companies and more than 320,000 people. Launched in 2019, Reclaim investors include Index Ventures and Calendly contributing to cash raise of more than $9.5 million to date. File-sharing app Drobox has been publicly traded since 2018 and has a current market cap of $7.92 billion. Financial terms of the deal have yet to be disclosed. Continue reading Dropbox Acquires Productivity and Scheduling App Reclaim.ai
By
Paula ParisiAugust 26, 2024
Creating a universal definition of “open source AI” has generated a fair amount of debate and confusion, with many outfits using elastic parameters in order to achieve a fit. Now the Open Source Initiative (OSI) — “the authority that defines Open Source” — has issued what it hopes will become the baseline definition. That definition, which includes the ability to “use the system for any purpose and without having to ask for permission,” excludes a lot of AI platforms that currently describe themselves as “open,” many freely available only for non-commercial use. OSI’s remaining three parameters involve the ability to inspect the system and modify and share it. Continue reading OSI Aims for Industry Standard by Defining ‘Open Source AI’
By
Paula ParisiAugust 26, 2024
Meta Platforms CEO Mark Zuckerberg and Spotify CEO Daniel Ek have joined forces to express displeasure with the European Union’s regulations on artificial intelligence, claiming they are suppressing innovation. That is the opposite of the stated goals of EU lawmakers in passing the regulations. In a joint statement first published in The Economist and then on the Meta and Spotify websites Friday, the duo took aim at alleged EU obstruction to the development of open source AI, suggesting that Europe’s “fragmented regulatory structure, riddled with inconsistent implementation, is hampering innovation and holding back developers.” Continue reading Meta, Spotify Issue Statement Criticizing EU’s AI Regulations
By
Paula ParisiAugust 26, 2024
Samsung Electronics, which teased a glasses-free 3D gaming monitor at CES in January, officially announced the scheduled release of two versions at Gamescom last week. Both sizes employ light field display (LFD) technology to create what Samsung calls “lifelike 3D images” from 2D content by using a lenticular lens on the front panel. “Combined with Eye Tracking and View Mapping technology, Odyssey 3D ensures an optimized 3D experience without the need for separate 3D glasses,” according to Samsung. A built-in stereo camera monitors the movement of both eyes while proprietary View Mapping continuously adjusts the image to fuel depth perception. Continue reading Samsung Set to Release Glasses-Free Odyssey 3D Monitors
By
Paula ParisiAugust 23, 2024
SAG-AFTRA announced it is teaming with online talent marketplace Narrativ to provide the guild’s 160,000 members with the option of working with the New York-based AI startup to license their voice replicas for use in digital audio advertising. The deal would make it easy for voice actors to be considered for replicant work and get compensated, according to SAG-AFTRA, which emphasizes that performers will control the particulars, including whether to make their voices available, brand approval and fees. Narrativ also represents visual likenesses, but the SAG-AFTRA announcement is limited to voice work. Continue reading SAG-AFTRA Strikes a Deal with Narrativ for AI Voice Replicas
By
Paula ParisiAugust 23, 2024
D-ID, a platform that uses AI to generate digital humans, has announced D-ID Video Translate in general availability. The tool lets businesses and content creators automatically re-voice videos in multiple languages, “cloning the speaker’s voice and adapting their lip movements from a single upload.” D-ID is making the Video Translate tool, which accommodates 30 different languages, free to D-ID subscribers for a limited time, available through the D-ID Studio or the company’s API. Languages include Arabic, Mandarin, Japanese, Hindi and Ukrainian, in addition to Spanish, German, French and Italian. Users can simultaneously translate content using bulk translation. Continue reading D-ID Employs AI to Translate Videos into Multiple Languages
By
Paula ParisiAugust 23, 2024
Palo Alto-based startup PIP Labs announced an $80 million funding round for Story Protocol, a blockchain platform to track intellectual property rights in the era of artificial intelligence and the data scraping that enables model training. CEO and co-founder Seung Yoon “SY” Lee says the company aims to create a more sustainable IP environment for digital consumers and builders. The raise, led by Andreessen Horowitz (a16z) and Polychain Capital, values the startup at $2.25 billion. The move comes after Sahara AI announced it raised $43 million this month to fund a blockchain-based IP tracking system. Continue reading Story Raises $80M to Create Blockchain-Based IP Protection