Adobe Firefly Video Now in Public Beta Starting at $10 Month

Adobe’s Firefly video is now in public beta as part of Firefly AI, now multi-modal with video, image and vector generation. Available for $10 for Firefly Standard or $30 for Firefly Pro, the Firefly app offers additional tiers for premium video and audio features, offering a degree of customization based on project needs. Adobe continues to position Firefly as “the only generative AI model that is IP-friendly and commercially safe,” offering the option of contractual IP indemnification to protect against infringement lawsuits “in the unlikely event of a claim involving a Firefly output.” Continue reading Adobe Firefly Video Now in Public Beta Starting at $10 Month

ByteDance’s AI Model Can Generate Video from Single Image

ByteDance has developed a generative model that can use a single photo to generate photorealistic video of humans in motion. Called OmniHuman-1, the multimodal system supports various visual and audio styles and can generate people doing things like singing, dancing, speaking and moving in a natural fashion. ByteDance says its new technology clears hurdles that hinder existing human-generators — obstacles like short play times and over-reliance on high-quality training data. The diffusion transformer-based OmniHuman addressed those challenges by mixing motion-related conditions into the training phase, a solution ByteDance researchers claim is new. Continue reading ByteDance’s AI Model Can Generate Video from Single Image

YouTube Premium Offers Speed Controls and Improved Audio

YouTube is rolling out new experimental features for Premium users and letting those paid plan subscribers access more than one test feature at a time. Among the exploratory features now available to YouTube Premium users is high-quality 256kbps audio on music videos and the ability to “jump ahead” on the web, something previously available only on mobile devices. For iOS users, picture-in-picture and smart downloads for YouTube Shorts are also among the new features. In addition, the company announced bundled pricing for those users who subscribe to both YouTube Premium and Google One Premium. Continue reading YouTube Premium Offers Speed Controls and Improved Audio

CES: LG Wireless OLED TVs Boost Brightness, Include AI Tech

Extreme brightness, advanced AI and a 165Hz refresh rate for gaming are among the features of the LG’s 2025 OLED evo lineup. Powering the OLED evo M5 and OLED evo G5 series is LG’s freshly minted Alpha 11 Gen 2 processor, with improved power and AI capabilities to take it beyond last year’s G4 series in picture and sound. LG calls the line the world’s first wireless OLEDs, with the ability to transmit throughout the home. LG’s Brightness Booster Ultimate — offering “brightness three times higher than conventional OLEDs” — and the Alpha 11 Gen 2 processor enhance the package. Continue reading CES: LG Wireless OLED TVs Boost Brightness, Include AI Tech

CES: Fraunhofer Demonstrates Dynamic Lossless Audio Codec

German research organization Fraunhofer IIS has unveiled LC3plus Lossless, an audio codec that promises to streamline wireless audio transmission by introducing dynamic lossless capabilities to its established LC3plus technology. The new codec represents a complete solution for high-resolution wireless audio, automatically switching between lossless and lossy compression based on available bandwidth. This adaptive approach maintains perfect audio quality when possible while seamlessly falling back to high-quality compression when needed, all while preserving LC3plus’s core benefits of low latency and robust transmission. Continue reading CES: Fraunhofer Demonstrates Dynamic Lossless Audio Codec

CES: Samsung and Google Team on Spatial Audio Standard

Samsung Electronics has teamed with Google on a new spatial sound standard, Eclipsa Audio, that could emerge as a free alternative to Dolby Atmos. On display at CES 2025 in Las Vegas this week, the format is rolling out across Samsung’s line of 2025 TVs and soundbars, and Google will support it on the content side by enabling Eclipsa 3D audio on some YouTube videos this year. Samsung has been a notable holdout on Dolby Vision HDR embracing instead the competing HDR10+. Now the South Korean electronics giant seems to be staking out its own turf in 3D audio, advocating for open source. Continue reading CES: Samsung and Google Team on Spatial Audio Standard

CES Unveiled: Preview of Tech to Be Featured at Trade Show

CES Unveiled 2025 offered a preview of new technologies two days ahead of the official opening of the massive CES show floor in Las Vegas on January 7. From AI-powered tools and robotics to energy-saving innovations and immersive displays, the event showcased a spectrum of advancements. Among the more notable highlights included cognitive AI demonstrated by Neural Lab, the latest brain-computer interface tech from Naqi Logix, AR and smart glasses developed by companies such as Rokid and Mustard, and a variety of interesting video- and audio-related offerings to be showcased at CES. Continue reading CES Unveiled: Preview of Tech to Be Featured at Trade Show

YouTube Expands Access to Improved AI-Powered Dubbing

Hundreds of thousands more YouTube channels are gaining access to its AI-powered auto-dubbing feature, which generates audio translation tracks for YouTube videos, helping to make the platform’s content more accessible to viewers around the world. The expanded rollout targets informational channels in the Partner Program, such as tutorials on cooking, sewing, tourism and home improvement. Availability “will expand to other types of content soon,” according to video streamer, which began testing the feature with select creators last year. Based on technology developed by Aloud, YouTube’s auto-dubbing emerged from the Area 120 internal incubator program. Continue reading YouTube Expands Access to Improved AI-Powered Dubbing

AWS Opens Physical Locations for Fast, Secure Data Uploads

Amazon Web Services has opened AWS Data Transfer Terminals in Los Angeles and New York. These secure physical locations allow customers to bring their storage devices for fast uploads to the AWS Cloud. The enterprise service can significantly reduce data ingestion time for use cases including uploads of “large datasets from fleets of vehicles collecting data in metro areas for training machine learning models” as well as “digital audio and video files from content creators for media processing workloads” and local government organizations compiling geographical and other smart city data. Continue reading AWS Opens Physical Locations for Fast, Secure Data Uploads

Nvidia AI Model Fugatto a Breakthrough in Generative Sound

Nvidia has unveiled an AI sound model research project called Fugatto that “can create any combination of music, voices and sounds” based on text and audio inputs. Described by Nvidia as “the world’s most flexible sound machine,” many appear to agree that the new model represents an audio breakthrough, with the potential to generate a wide array of sounds that have not previously existed. While popular sound models from companies including Suno and ElevenLabs “can compose a song or modify a voice, none have the dexterity of the new offering,” Nvidia claims. Continue reading Nvidia AI Model Fugatto a Breakthrough in Generative Sound

Microsoft Pushes Copilot Studio Agents, Adds Azure Models

Microsoft’s expansion of AI agents within the Copilot Studio ecosystem was a central focus of the company’s Ignite conference. Since the launch of Copilot Studio, more than 100,000 enterprise organizations have created or edited AI agents using the platform. Copilot Studio is getting new features to increase productivity, including multimodal capabilities that take agents beyond text and Retrieval Augmented Generation (RAG) enhancements to enable agents with real-time knowledge from multiple third-party sources, such as Salesforce, ServiceNow, and Zendesk. Integration with Azure is expanded as 1,800 large language models in the Azure catalog are made available. Continue reading Microsoft Pushes Copilot Studio Agents, Adds Azure Models

YouTube Dream Track Toolset Introduces an AI Remix Feature

YouTube has added a new feature to its Dream Track toolset, which lets select U.S. creators use AI to generate songs using the vocals of artists including John Legend, Demi Lovato, Charli XCX, Charlie Puth and others. Now users can remix Dream Track songs using natural language to describe the changes they would like, stylistic and otherwise. Selecting the “restyle a track” option will steer users to creating a 30-second generative snippet for use in YouTube Shorts. The remixed snippets will credit the original song with “clear attribution” through the Short itself and the Shorts audio pivot page. It will also clearly indicate that the track was restyled with AI, according to Google. Continue reading YouTube Dream Track Toolset Introduces an AI Remix Feature

TikTok Introduces New Feature to Share and Promote Music

In a move that may appeal to music fans as well as marketing professionals, ByteDance-owned video platform TikTok just announced the availability of a new promotional feature called “Share to TikTok” that enables users to share music, podcasts and audiobooks from Apple Music and Spotify to TikTok, directly from the share menus of the streaming services. Content shared to TikTok will feature links directing users to the original sources to foster discovery and engagement on the partnering services. The new feature follows the launch of “Add to Music App,” a tool for users to save songs they discover on the TikTok app to their streaming service of choice. Continue reading TikTok Introduces New Feature to Share and Promote Music

Yahoo Using McAfee’s Modified Image Detector to Flag Fakes

Yahoo News has signed up to use San Jose-based cybersecurity company McAfee’s deepfake image detection technology. The scalable McAfee system can “quickly identify images that may have been produced or modified using AI, including deepfake images,” flagging them for the Yahoo News editorial standards team for human review. The standards team then “determines whether the flagged images meet the platform’s editorial guidelines.” The partnership provides news aggregator Yahoo with an extra layer of protection as it deals with a large network of global publishers in addition to policing its original content. Continue reading Yahoo Using McAfee’s Modified Image Detector to Flag Fakes

OpenAI: sCM Generates Media 50x Faster Than Other Models

OpenAI is taking a new approach to generating media that it says is 50 times faster than the models commonly used today. Called sCM, the approach is a “consistency model,” a variation on the diffusion method used by many leading systems. OpenAI claims its new model is ideal for training for large scale datasets and generating video, audio and images that are of “comparable sample quality to leading diffusion models.” Such models often require hundreds of steps, creating challenges when it comes to real-time applications. OpenAI aims to change this with a faster system that requires less power. Continue reading OpenAI: sCM Generates Media 50x Faster Than Other Models