Google Launches Agentspace in the UK and Promotes Chirp 3

Google is expanding its AI presence in the UK market, hosting a splashy launch event there for Agentspace. Google in December launched Agentspace, an AI agent hub that makes it easy for enterprises to build, manage and deploy custom agents using Gemini. The gathering was hosted by Google DeepMind CEO Demis Hassabis, and Google Cloud CEO Thomas Kurian and included participation by local customers BT Group and advertising powerhouse WPP. Google invited UK businesses to store cloud data locally using its $1 billion data center, opening there this year. The company also promoted its new Chirp 3 audio generator, which offers HD voice synthesis. Continue reading Google Launches Agentspace in the UK and Promotes Chirp 3

Baidu Releases New LLMs that Undercut Competition’s Price

Baidu has launched two new AI systems, the native multimodal foundation model Ernie 4.5 and deep-thinking reasoning model Ernie X1. The latter supports features like generative imaging, advanced search and webpage content comprehension. Baidu is touting Ernie X1 as of comparable performance to another Chinese model, DeepSeek-R1, but says it is half the price. Both Baidu models are available to the public, including individual users, through the Ernie website. Baidu, the dominant search engine in China, says its new models mark a milestone in both reasoning and multimodal AI, “offering advanced capabilities at a more accessible price point.” Continue reading Baidu Releases New LLMs that Undercut Competition’s Price

$7.5M Funds NYU’s Sony Audio Institute, Opening This Spring

Sony Corporation has launched the Sony Audio Institute at NYUs Steinhardt School of Culture, Education and Human Development, focusing on innovation in the business and technology of music. Opening this spring, the Sony Audio Institute will serve as an interdisciplinary collaboration that brings together the expertise of Sony’s professional and consumer audio businesses and their leading-edge technologies with NYU students, facilities and faculty. The institute opens with NYU Steinhardt Music Business Program Director Larry Miller at the helm. Miller will focus on the new outfit’s operations full time beginning this fall. Continue reading $7.5M Funds NYU’s Sony Audio Institute, Opening This Spring

Authors Can Use ElevenLabs Audiobook Narration for Spotify

Spotify is boosting its audiobook content by agreeing to accept material narrated using ElevenLabs’ AI voice app. Given that ElevenLabs is currently among the most recognized AI audio providers, this new partnership is expected to boost the quantity of AI-narrated audiobooks on the platform. ElevenLabs content can be distributed to Spotify (and “select other audiobook retailers”) via Spotify’s Findaway Voices platform for indie authors. For $99 per month, authors can generate up to 500 minutes of AI audio startup ElevenLabs’ narration in 29 languages with what Spotify says is “complete control over voice and intonation.” Continue reading Authors Can Use ElevenLabs Audiobook Narration for Spotify

Adobe Firefly Video Now in Public Beta Starting at $10 Month

Adobe’s Firefly video is now in public beta as part of Firefly AI, now multi-modal with video, image and vector generation. Available for $10 for Firefly Standard or $30 for Firefly Pro, the Firefly app offers additional tiers for premium video and audio features, offering a degree of customization based on project needs. Adobe continues to position Firefly as “the only generative AI model that is IP-friendly and commercially safe,” offering the option of contractual IP indemnification to protect against infringement lawsuits “in the unlikely event of a claim involving a Firefly output.” Continue reading Adobe Firefly Video Now in Public Beta Starting at $10 Month

ByteDance’s AI Model Can Generate Video from Single Image

ByteDance has developed a generative model that can use a single photo to generate photorealistic video of humans in motion. Called OmniHuman-1, the multimodal system supports various visual and audio styles and can generate people doing things like singing, dancing, speaking and moving in a natural fashion. ByteDance says its new technology clears hurdles that hinder existing human-generators — obstacles like short play times and over-reliance on high-quality training data. The diffusion transformer-based OmniHuman addressed those challenges by mixing motion-related conditions into the training phase, a solution ByteDance researchers claim is new. Continue reading ByteDance’s AI Model Can Generate Video from Single Image

YouTube Premium Offers Speed Controls and Improved Audio

YouTube is rolling out new experimental features for Premium users and letting those paid plan subscribers access more than one test feature at a time. Among the exploratory features now available to YouTube Premium users is high-quality 256kbps audio on music videos and the ability to “jump ahead” on the web, something previously available only on mobile devices. For iOS users, picture-in-picture and smart downloads for YouTube Shorts are also among the new features. In addition, the company announced bundled pricing for those users who subscribe to both YouTube Premium and Google One Premium. Continue reading YouTube Premium Offers Speed Controls and Improved Audio

CES: LG Wireless OLED TVs Boost Brightness, Include AI Tech

Extreme brightness, advanced AI and a 165Hz refresh rate for gaming are among the features of the LG’s 2025 OLED evo lineup. Powering the OLED evo M5 and OLED evo G5 series is LG’s freshly minted Alpha 11 Gen 2 processor, with improved power and AI capabilities to take it beyond last year’s G4 series in picture and sound. LG calls the line the world’s first wireless OLEDs, with the ability to transmit throughout the home. LG’s Brightness Booster Ultimate — offering “brightness three times higher than conventional OLEDs” — and the Alpha 11 Gen 2 processor enhance the package. Continue reading CES: LG Wireless OLED TVs Boost Brightness, Include AI Tech

CES: Fraunhofer Demonstrates Dynamic Lossless Audio Codec

German research organization Fraunhofer IIS has unveiled LC3plus Lossless, an audio codec that promises to streamline wireless audio transmission by introducing dynamic lossless capabilities to its established LC3plus technology. The new codec represents a complete solution for high-resolution wireless audio, automatically switching between lossless and lossy compression based on available bandwidth. This adaptive approach maintains perfect audio quality when possible while seamlessly falling back to high-quality compression when needed, all while preserving LC3plus’s core benefits of low latency and robust transmission. Continue reading CES: Fraunhofer Demonstrates Dynamic Lossless Audio Codec

CES: Samsung and Google Team on Spatial Audio Standard

Samsung Electronics has teamed with Google on a new spatial sound standard, Eclipsa Audio, that could emerge as a free alternative to Dolby Atmos. On display at CES 2025 in Las Vegas this week, the format is rolling out across Samsung’s line of 2025 TVs and soundbars, and Google will support it on the content side by enabling Eclipsa 3D audio on some YouTube videos this year. Samsung has been a notable holdout on Dolby Vision HDR embracing instead the competing HDR10+. Now the South Korean electronics giant seems to be staking out its own turf in 3D audio, advocating for open source. Continue reading CES: Samsung and Google Team on Spatial Audio Standard

CES Unveiled: Preview of Tech to Be Featured at Trade Show

CES Unveiled 2025 offered a preview of new technologies two days ahead of the official opening of the massive CES show floor in Las Vegas on January 7. From AI-powered tools and robotics to energy-saving innovations and immersive displays, the event showcased a spectrum of advancements. Among the more notable highlights included cognitive AI demonstrated by Neural Lab, the latest brain-computer interface tech from Naqi Logix, AR and smart glasses developed by companies such as Rokid and Mustard, and a variety of interesting video- and audio-related offerings to be showcased at CES. Continue reading CES Unveiled: Preview of Tech to Be Featured at Trade Show

YouTube Expands Access to Improved AI-Powered Dubbing

Hundreds of thousands more YouTube channels are gaining access to its AI-powered auto-dubbing feature, which generates audio translation tracks for YouTube videos, helping to make the platform’s content more accessible to viewers around the world. The expanded rollout targets informational channels in the Partner Program, such as tutorials on cooking, sewing, tourism and home improvement. Availability “will expand to other types of content soon,” according to video streamer, which began testing the feature with select creators last year. Based on technology developed by Aloud, YouTube’s auto-dubbing emerged from the Area 120 internal incubator program. Continue reading YouTube Expands Access to Improved AI-Powered Dubbing

AWS Opens Physical Locations for Fast, Secure Data Uploads

Amazon Web Services has opened AWS Data Transfer Terminals in Los Angeles and New York. These secure physical locations allow customers to bring their storage devices for fast uploads to the AWS Cloud. The enterprise service can significantly reduce data ingestion time for use cases including uploads of “large datasets from fleets of vehicles collecting data in metro areas for training machine learning models” as well as “digital audio and video files from content creators for media processing workloads” and local government organizations compiling geographical and other smart city data. Continue reading AWS Opens Physical Locations for Fast, Secure Data Uploads

Nvidia AI Model Fugatto a Breakthrough in Generative Sound

Nvidia has unveiled an AI sound model research project called Fugatto that “can create any combination of music, voices and sounds” based on text and audio inputs. Described by Nvidia as “the world’s most flexible sound machine,” many appear to agree that the new model represents an audio breakthrough, with the potential to generate a wide array of sounds that have not previously existed. While popular sound models from companies including Suno and ElevenLabs “can compose a song or modify a voice, none have the dexterity of the new offering,” Nvidia claims. Continue reading Nvidia AI Model Fugatto a Breakthrough in Generative Sound

Microsoft Pushes Copilot Studio Agents, Adds Azure Models

Microsoft’s expansion of AI agents within the Copilot Studio ecosystem was a central focus of the company’s Ignite conference. Since the launch of Copilot Studio, more than 100,000 enterprise organizations have created or edited AI agents using the platform. Copilot Studio is getting new features to increase productivity, including multimodal capabilities that take agents beyond text and Retrieval Augmented Generation (RAG) enhancements to enable agents with real-time knowledge from multiple third-party sources, such as Salesforce, ServiceNow, and Zendesk. Integration with Azure is expanded as 1,800 large language models in the Azure catalog are made available. Continue reading Microsoft Pushes Copilot Studio Agents, Adds Azure Models