BodyTalk Dubs into 29 Languages with Facial Moves to Match

Panjaya is a AI startup that aims to disrupt the world of video dubbing with a way to generate “hyperrealistic” recreations of a person’s voice speaking a new language. The system also automatically modifies the imagery to match lip and other physical movements to match the new speech patterns. Called BodyTalk, the technique is the launch point for Panjaya as it emerges from the stealth in which it conducted its R&D the past three years, backed by $9.5 million from venture funds and angel backers. The startup describes BodyTalk as “AI dubbing that looks and feels as natural as the original.” Continue reading BodyTalk Dubs into 29 Languages with Facial Moves to Match

IBM Cloud Is First to Widely Implement Intel Gaudi 3 AI Chips

IBM is the first cloud customer for Intel’s Gaudi 3 AI accelerator chip, which it will make available in early 2025. The Gaudi 3 will be available for hybrid and on-site environments via the IBM Cloud, as part of Watsonx AI and on IBM data platforms. Gaudi 3, which began shipping in Q2 and is expected to go into mass production later this year, is IBM’s AI challenger to GPU accelerators from Nvidia and AMD, the latter having in January begun shipping its own HPC solution, the MI300X. Unlike that chip and Nvidia’s Hopper H100 and more recent Blackwell B200, the Gaudi 3 is not a GPU, but built on an architecture specifically for inference and deep learning. Continue reading IBM Cloud Is First to Widely Implement Intel Gaudi 3 AI Chips

Apple Launches Open-Source Language-Based Image Editor

Apple has released MGIE, an open-source AI model that edits images using natural language instructions. MGIE, short for MLLM-Guided Image Editing, can also modify and optimize images. Developed in conjunction with University of California Santa Barbara, MGIE is Apple’s first AI model. The multimodal MGIE, which understands text and image input, also crops, resizes, flips, and adds filters based on text instructions using what Apple says is an easier instruction set than other AI editing programs, and is simpler and faster than learning a traditional program, like Apple’s own Final Cut Pro. Continue reading Apple Launches Open-Source Language-Based Image Editor

LG Plans to Demo Its New OLED and QNED TV Tech at CES

LG Electronics will showcase its latest television technologies at CES in Las Vegas next week, including its 2024 lineup of QNED and QNED Mini LED TVs with models up to 98 inches, and the company’s top-line M4 and G4 OLED TVs (more on those sets in tomorrow’s ETCentric). LG says the advanced graphics capabilities of faster AI processing will provide viewers with a brighter picture, smoother motion and superior, vibrant colors. The company also announced an upcoming soundbar lineup, featuring premium surround-sound devices specifically designed for its new OLED and QNED TVs for what LG describes as an “elevated home cinema experience.” Continue reading LG Plans to Demo Its New OLED and QNED TV Tech at CES

Netflix Uses Deep Learning to Optimize Streaming in 4K HDR

Netflix has completed a worldwide technology upgrade that improves video quality for Premium subscribers viewing 4K HDR titles. The move is being hailed as welcome news in the wake of a price hike to $22.99 from $19.99 for U.S. Premium customers. Netflix used the “dynamic optimization” video encoding method to implement an HDR variant of the company’s VMAF (Video Multimethod Assessment Fusion) quality metric. The new HDR-VMAF is the result of a collaboration between Netflix and Dolby Laboratories that employs “subjective tests with 4K HDR content using high-end OLED panels,” according to Netflix. Continue reading Netflix Uses Deep Learning to Optimize Streaming in 4K HDR

Musk Staffs xAI with Execs from Top Technology Companies

Elon Musk is sharing additional details about his latest endeavor, an artificial intelligence company called xAI. The CEO of Tesla and SpaceX — and owner, executive chairman and CTO of Twitter — says his new company aspires to “understand the true nature of the universe.” While word began leaking out in February about Musk’s AI plans, he went public with his team on Wednesday (featuring executives from several notable tech firms), communicating via a newly minted website that also includes a recruitment message. Musk plans to release more information today live on Twitter Spaces. Continue reading Musk Staffs xAI with Execs from Top Technology Companies

Mixed Reactions to ‘Pause’ on AI Models Larger than GPT-4

Respected members of the advanced tech community are going on record opposing the faction calling for a “pause” in large-model artificial intelligence development. Meta Platforms chief AI scientist Yann LeCun and DeepLearning.AI founder and CEO Andrew Ng, formerly at Alphabet where he helped launch Google Brain, were joined this past week by Bill Gates and former Google CEO Eric Schmidt in opposing the proposed six-month halt to development of AI models more advanced than OpenAI’s GPT-4, which is said to train on a trillion parameters — more than 500 times that of GPT-3. Continue reading Mixed Reactions to ‘Pause’ on AI Models Larger than GPT-4

CES: Generative AI Is Having Its ‘War of the Worlds’ Moment

ChatGPT came too late (end of November) to make a significant impact on CES this year, but the cacophony of opinions about the generative AI model definitely made its way to Vegas. The timing was perfect. Just as the crypto crash left the hype industry paralyzed, OpenAI launched ChatGPT in what now feels like a nerdy and frustrating tech version of the Rolling Stones’ Altamont concert in ’69 (with computer scientists as the Hells Angels). Make no mistake: this is a landmark achievement in machine learning — perhaps the single greatest since the 2006 paper by Hinton, Salakhutdinov, Osindero and Teh on backpropagation in deep neural networks. However, it’s critical that industries, including M&E, distinguish between hype and reality. Continue reading CES: Generative AI Is Having Its ‘War of the Worlds’ Moment

ChatGPT: OpenAI’s New Chatbot Draws Praise and Criticism

OpenAI’s new AI chatbot, ChatGPT, is taking the world by storm. “Quite simply, the best artificial intelligence chatbot ever released to the general public,” is how The New York Times describes ChatGPT, which more than a million people signed up for when it opened for testing last week. Screenshots of ChatGPT conversations blew up Twitter. “Something big is happening,” tweeted one fan. “I just had a 20-minute conversation with ChatGPT about the history of physics … OMG,” offered another. The acronym stands for “generative pre-trained transformer,” a language model that leverages deep learning to respond to text-based input with human-like responses. Continue reading ChatGPT: OpenAI’s New Chatbot Draws Praise and Criticism

Chinese Game Company Appoints AI CEO and Invests in AR

Online game company and mobile app developer NetDragon Websoft has invested $40 million in Rokid, maker of 5G-friendly AR glasses for business applications. Both companies are based in China. NetDragon has been in the news this past month when it became the first company to appoint an AI as its “rotating CEO.” Following the Rokid announcement, it appears the firm may be interested in developing lifelike AI characters to inhabit its games and augment teaching and enhance its AR initiatives, though to hear NetDragon actual CEO, Liu Dejian tell it, the company can learn a lot from its new c-suite addition, Tang Yu. Continue reading Chinese Game Company Appoints AI CEO and Invests in AR

Nvidia, Intel and ARM Publish New FP8 AI Interchange Format

Nvidia, Intel and ARM have published a draft specification for a common AI interchange format aimed at faster and more efficient system development. The proposed “8-bit floating point” standard, known as FP8, will potentially accelerate both training and operating the systems by reducing memory usage and optimizing interconnect bandwidth. The lower precision number format is a key factor in driving efficiency. Transformer networks, in particular, benefit from an 8-bit floating point precision, and having a common interchange format should facilitate interoperability advances for both hardware and software platforms. Continue reading Nvidia, Intel and ARM Publish New FP8 AI Interchange Format

Humanloop Raises $2.6 Million as Interest in NLP Tech Grows

Interest in natural language processing (NLP) as an AI training tool is exploding, with analysts predicting a bumper crop of new startups. One such startup, Humanloop, is already gaining attention, having just pulled in $2.6 million in seed funding led by Index Ventures with participation by Y Combinator, LocalGlobe and AlbionVC. Founded in 2020 by computer scientists from the University of Cambridge with alumni from Google and Amazon, the company says its technology makes it “significantly” easier for companies to leverage NLP that helps humans “teach” AI algorithms. Continue reading Humanloop Raises $2.6 Million as Interest in NLP Tech Grows

Nvidia Turbo Charges NeMo Megatron Large Training Model

Nvidia has issued a software update for its formidable NeMo Megatron giant language training model, increasing efficiency and speed. Barely a year since Nvidia unveiled Megatron, this latest improvement further leverages the transformer engine architecture that has become synonymous with deep learning since Google introduced the concept in 2017. New features result in what Nvidia says is a 5x reduction in memory requirements and up to a 30 percent gain in speed for models as large as 1 trillion parameters, making NeMo Megatron better at handling transformer tasks across the entire stack. Continue reading Nvidia Turbo Charges NeMo Megatron Large Training Model

Nvidia Touts New H100 GPU and Grace CPU Superchip for AI

Nvidia has begun previewing its latest H100 Tensor Core GPU, promising “an order-of-magnitude performance leap for large-scale AI and HPC” over previous iterations, according to the company. Nvidia founder and CEO Jensen Huang announced the Hopper earlier this year, and IT professionals’ website ServeTheHome recently had a chance to see a H100 SXM5 module demonstrated. Consuming up to 700W in an effort to deliver 60 FP64 Tensor teraflops, the module — which features 80 billion transistors and has 8448/16896 FP64/FP32 cores in addition to 538 Tensor cores — is described as “monstrous” in the best way. Continue reading Nvidia Touts New H100 GPU and Grace CPU Superchip for AI

Advances by OpenAI and DeepMind Boost AI Language Skills

Advances in language comprehension for artificial intelligence are issuing from San Francisco’s OpenAI and London-based DeepMind. OpenAI, which has been working on large language models, says it now lets customers fine-tune its GPT-3 models using their own custom data, while the Alphabet-owned DeepMind is talking-up Gopher, a 280-billion parameter deep-learning language model that has scored impressively on tests. Sophisticated language models have the ability to comprehend natural language, as well as predict and generate text, requirements for creating advanced AI systems that can dispense information and advice or that are required to follow instructions. Continue reading Advances by OpenAI and DeepMind Boost AI Language Skills