By
Paula ParisiDecember 2, 2024
Anticipating what one outlet calls “the likely imminent release of OpenAI’s Sora,” generative AI video competitors are compelled to step up their game. Luma AI has released a major upgrade to its Dream Machine, speeding its already quick video generation and enabling a chat function for natural language prompts, so you can talk to it as with OpenAI’s ChatGPT. In addition to the new interface, Dream Machine is going mobile and adding a new foundation image model, Luma AI Photon, which “has been purpose built to advance the power and capabilities of Dream Machine,” according to the company. Continue reading Luma AI Upgrades Its Video Generator and Adds Image Model
By
Paula ParisiNovember 14, 2024
Ernie, the foundation model for Baidu’s generative AI, has been updated with iRAG technology to mitigate visual hallucinations and a no-code tool called Miaoda that creates apps using natural language. The company behind China’s largest search engine says Ernie now handles 1.5 billion daily user queries, up from 50 million circa its March 2023 launch (a 30x increase). Baidu also debuted Ernie-powered smart glasses from its Xiaodu Technology hardware unit. The Xiaodu AI Glasses features built-in voice activation and cameras for taking photos and video. The news was shared at this week’s Baidu World 2024 in Shanghai. Continue reading Baidu’s Ernie AI Gets Improved Text-to-Image and App Builder
By
Paula ParisiNovember 6, 2024
Amazon Prime Video has begun offering X-Ray Recaps, summaries of favorite TV shows that catch you up without risk of spoilers. The generative AI-powered feature can create snapshots of any requested view — episodes, pieces of episodes or full seasons of TV shows. “Whether you’re a few minutes into a new episode, halfway through a season” or took a break to get popcorn and need a quick refresher, X-Ray Recaps will catch you up “personalized down to the exact minute of where you are watching,” according to Amazon, which assures “guardrails are applied” to ensure the generation of spoiler-free summaries. Continue reading Amazon Prime Video Offers AI-Powered Recaps of TV Shows
By
Rob ScottNovember 4, 2024
Amazon reported major revenue and profit increases during its third quarter, beating Wall Street’s forecasts, based largely on the company’s e-commerce sales and increasing demand for its cloud services. Capital expenditure, which reached a record amount following Amazon’s recent investments in artificial intelligence, will maintain its momentum as the company plans $75 billion capex on developing generative AI services over 2024-2025. “The faster we grow demand, the faster we have to invest capital in data centers, network gear and hardware,” explained CEO Andy Jassy. “We invest in all that upfront in advance of when we can monetize it.” Continue reading Amazon Pushes AI, Records Growth in Q3 Revenue and Profit
By
Rob ScottNovember 4, 2024
Revenue reached an all-time high for Apple’s most recent quarter as iPhone sales experienced an uptick due in part to consumer excitement for the arrival of Apple Intelligence, the company’s heavily advertised set of AI tools. Total sales reached $94.9 billion for the quarter, up 6 percent year-over-year and exceeding the $94.5 billion that financial analysts had predicted. The company’s iPhone business reported sales of $46.2 billion, following disappointing consecutive quarters in the first half of the year. The AI boom resulted in strong quarters for other Big Tech leaders including Alphabet, Amazon, Meta Platforms and Microsoft. Continue reading Jump in iPhone Business Results in Record Quarter for Apple
By
Paula ParisiOctober 16, 2024
OpenAI has announced Swarm, an experimental framework that coordinates networks of AI agents, and true to its name the news has kicked over a hornet’s nest of contentious debate about the ethics of artificial intelligence and the future of enterprise automation. OpenAI emphasizes that Swarm is not an official product and says though it has shared the code publicly it has no intention of maintaining it. “Think of it more like a cookbook,” OpenAI engineer Shyamal Anadkat said in a social media post, calling it “code for building simple agents.” Continue reading OpenAI Tests Open-Source Framework for Autonomous Agents
By
Paula ParisiOctober 10, 2024
OpenAI has added publishing powerhouse Hearst to its formidable list of media partners. The force behind outlets including Cosmopolitan, Elle, Esquire, Car & Driver, Popular Mechanics, San Francisco Chronicle and Houston Chronicle will allow more than 20 magazine brands and over 40 newspapers to provide “a vast array of lifestyle content” as well as local news and niche insights to ChatGPT’s professed 200 million weekly users as well as, presumably, on the prototype SearchGPT that launched in July (with a planned ChatGPT integration). Continue reading Hearst Agrees to Content Deal with OpenAI to Fuel ChatGPT
By
Paula ParisiOctober 9, 2024
Samsung heralded the world of personalized AI at its 10th annual developer conference, where Samsung Electronics Vice Chairman, CEO and Head of Device eXperience Jong-Hee Han said those who own the company’s top of the line TVs will soon have generative AI, ChatGPT and a more responsive relationship with Bixby, Samsung’s smart assistant. The company introduced AI Cast, making it simpler to get intelligence from Galaxy phones to Samsung TVs. The Galaxy S24 series, released early this year, has native AI that will soon generate content that can be beamed to a sprawling TV screen. Continue reading Samsung Developer Conference Emphasizes AI, One UI 7 UX
By
Paula ParisiOctober 8, 2024
On the heels of announcing a $6.6 billion funding round, OpenAI is getting busy with new products including the launch of the latest iteration of ChatGPT. The chatbot will now extend beyond simple questions and answers with Canvas, a new interface that opens in a separate window, allowing collaborative engagement with ChatGPT on writing and coding projects. Launching in beta, Canvas was built with GPT-4o and can be manually selected in the model picker. Canvas is being made available first to global ChatGPT Plus and Team users with Enterprise and Edu users next. The company says it will be available to all free ChatGPT users when it’s out of beta. Continue reading ChatGPT Enhances Collaborative Ability with Canvas Interface
By
Paula ParisiOctober 7, 2024
Having demonstrated how advertisements in its AI Overviews would work back in May at its Google Marketing Live event, the search giant is now adding the feature for U.S. mobile users and plans to include Google Lens shopping ads “above and alongside visual search results by the end of the year.” “The ways people ask questions today have expanded beyond the search box,” notes Google, explaining the move as a response to that evolution, as artificial intelligence technology has helped consumers use their voice and cameras “to explore the world around them.” Continue reading Google Serving Ads in AI Overviews and Lens Search Results
By
Paula ParisiOctober 3, 2024
OpenAI unveiled major updates at its DevDay conference with the focus largely on making AI more accessible, efficient and affordable. Included were four innovations: Vision Fine-Tuning in the API, Model Distillation, Prompt Caching and the public beta of Realtime API. The approach underscores OpenAI’s effort to empower its developer ecosystem even as it continues to compete for end-users in the enterprise space. The Realtime API gives developers the option of building “nearly real-time” speech-to-speech app experiences, selecting from among six OpenAI voices. Vision Fine-Tuning for GPT-4o enables customization of the model’s visual understanding of images and text. Continue reading OpenAI Showcases Latest Updates for Voice, Picture and More
By
Paula ParisiOctober 2, 2024
Snap Inc. is leveraging its relationship with Google Cloud to use Gemini for powering generative AI experiences within Snapchat’s My AI chatbot. The multimodal capabilities of Gemini on Vertex AI will greatly increase the My AI chatbot’s ability to understand and operate across different types of information such as text, audio, image, video and code. Snapchatters can use My AI to take advantage of Google Lens-like features, including asking the chatbot “to translate a photo of a street sign while traveling abroad, or take a video of different snack offerings to ask which one is the healthiest option.” Continue reading Snapchat: My AI Goes Multimodal with Google Cloud, Gemini
By
Paula ParisiSeptember 26, 2024
As OpenAI gears up to become a for-profit company next year, it is releasing ChatGPT Advanced Voice Mode, which brings a humanlike conversation mode to ChatGPT 4o. All U.S. subscribers to ChatGPT Plus and Team plans will gain access to the new feature, which will also be made available to those paying for ChatGPT Edu and Enterprise plans in the coming weeks. The firm is also adding five new voices and allowing customers to save personalized instructions for the voice assistant, including memory behaviors. Concurrently, executives including CTO Mira Murati have resigned as the company pivots to commerciality. Continue reading OpenAI Rolls Out Advanced Voice Mode Feature for ChatGPT
By
Paula ParisiSeptember 23, 2024
BlackRock has joined forces with Microsoft to launch what will initially be a $30 billion investment fund to finance AI infrastructure — concentrating primarily on building data centers and developing energy projects. The amount could quickly scale to about $100 billion. Abu Dhabi-based tech investment firm MGX is also participating, as is Global Infrastructure Partners (GIP), which owns, operates and invests across energy, transport, digital and waste management. BlackRock announced it is in the process of acquiring GIP, and says a deal expected to close next month. The new fund is called Global AI Infrastructure Investment Partnership (GAIIP). Continue reading BlackRock Teams with Microsoft to Advance AI Infrastructure
By
Paula ParisiSeptember 20, 2024
A newly redesigned Snapchat experience is built around a three-tab user interface called Simple Snapchat. As part of that effort, the social platform is launching more generative video features, including text-to-video as part of the app’s Lens Studio AR authoring tool. Easy Lens allows the quick generation of Lenses by typing text prompts, making it possible to do things like experiment with Halloween costumes or explore looks for back to school. Launching in beta for select creators, Snap says the new features are designed for all ability levels. The company is also updating its GenAI Suite and adding an Animation Library of “hundreds of high-quality movements.” Continue reading Snapchat Is Getting a Redesign and Generative Text-to-Video