By
Paula ParisiOctober 27, 2023
The University of Science and Technology of China (USTC) and Tencent YouTu Lab have released a research paper on a new framework called Woodpecker, designed to correct hallucinations in multimodal large language AI models. “Hallucination is a big shadow hanging over the rapidly evolving MLLMs,” writes the group, describing the phenomenon as when MLLMs “output descriptions that are inconsistent with the input image.” Solutions to date focus mainly on “instruction-tuning,” a form of retraining that is data and computation intensive. Woodpecker takes a training-free approach that purports to correct hallucinations from the basis of the generated text. Continue reading Woodpecker: Chinese Researchers Combat AI Hallucinations
By
Paula ParisiOctober 25, 2023
Nvidia Research has debuted Eureka, an AI agent that autonomously teaches robots complex motor skills. Powered by OpenAI’s GPT-4, Eureka has successfully trained a robotic hand to handle a pen with the dexterity of a human — a first, according to Nvidia. Eureka has also enabled robots to do things like open drawers, manipulate scissors and toss and catch balls, along with dozens of other tasks. “Eureka is a first step toward developing new algorithms that integrate generative and reinforcement learning methods to solve hard tasks,” according to Nvidia Senior Director of AI Research Anima Anandkumar said. Continue reading Nvidia Leverages OpenAI’s GPT-4 to Train Dexterous Robots
By
Paula ParisiOctober 11, 2023
OpenAI began previewing vision capabilities for GPT-4 in March, and the company is now starting to roll out the image input and output to users of its popular ChatGPT. The multimodal expansion also includes audio functionality, with OpenAI proclaiming late last month that “ChatGPT can now see, hear and speak.” The upgrade vaults GPT-4 into the multimodal category with what OpenAI is apparently calling GPT-4V (for “Vision,” though equally applicable to “Voice”). “We’re rolling out voice and images in ChatGPT to Plus and Enterprise users,” OpenAI announced. Continue reading ChatGPT Goes Multimodal: OpenAI Adds Vision, Voice Ability
By
Paula ParisiOctober 9, 2023
Likewise, a startup discovery platform backed by Bill Gates, is launching its own free chatbot named Pix. Billed as “the world’s first personal entertainment companion,” Pix helps users find TV shows, movies, books and podcasts, drawing from 600 million consumer data points. Trained on OpenAI models, Pix uses natural-language processing to answer user questions submitted by text, email or on the web at Likewise.com. Responses are promised “within seconds,” and Pix will learn users’ preferences over time. Likewise claims to have more than six million registered users. Continue reading Likewise: Startup Backed by Bill Gates Launches Pix Chatbot
By
Paula ParisiOctober 5, 2023
LinkedIn is unveiling new AI features to improve job hunting, marketing and sales tools for its nearly 1 billion users. The Recruiter talent sourcing platform, LinkedIn Learning and more are all getting AI assists. A central use of AI is “to take on some of workers’ day-to-day drudgery, freeing extra time for the more people-centric, strategic aspects of their job,” according to the social business platform, which just wrapped its 12th annual Talent Connect Summit. The proliferation of evolving generative AI tools is triggering new workflows for recruiters, job hunters and employees. Continue reading LinkedIn Taps OpenAI to Upgrade Business Marketing Tools
By
Paula ParisiSeptember 27, 2023
OpenAI is experimenting with new voice and image capabilities in ChatGPT. According to the company, users can now “speak with ChatGPT and have it talk back,” thanks to an intuitive new interface that, in addition to facilitating voice conversations, will allow users to show ChatGPT an image to discuss. “Snap a picture of a landmark while traveling and have a live conversation about what’s interesting about it,” OpenAI explains, alternatively suggesting you “snap pictures of your fridge and pantry to figure out what’s for dinner” or have it help with homework based on pictures of a math problem. Continue reading OpenAI’s ChatGPT Upgraded with ‘Talk’ Tech, Image Search
By
Paula ParisiSeptember 26, 2023
Amazon has entered into a strategic investment in San Francisco-based Anthropic, founded by former members of OpenAI. The AI startup will train and deploy future models using AWS Trainium and Inferentia chips to train and deploy future foundation models with AWS as its primary cloud provider. In turn, Amazon says it will invest up to $4 billion in Anthropic, as it strives to compete with other technology firms in the race to develop generative AI, seeding growth for what is shaping up to be an entirely new economic and social landscape. Continue reading Amazon Plans to Invest Up to $4 Billion in AI Startup Anthropic
By
Paula ParisiSeptember 20, 2023
Google is implementing a plan to help its Bard AI become more competitive with OpenAI’s ChatGPT. Bard Extensions will allow English-language users to expand the chatbot’s knowledge repository to data from various Google apps, including Gmail, Google Docs, Google Drive, Google Maps, YouTube, and Google Flights and hotels, or even information stored “across multiple apps and services,” Google says. The update boosts search engine capabilities with the travel features, while providing some functionalities of a personal assistant by letting it identify missed emails or summarize the relevant points in a document. Continue reading Google Links Bard AI to Apps Including YouTube, Docs, Drive
By
Paula ParisiSeptember 7, 2023
Walmart is putting generative AI in the hands of roughly 50,000 non-store U.S. employees who will have access to My Assistant, an LLM trained on information. From speeding the drafting process to serving as a creative partner and summarizing documents, “My Assistant has the potential to change how our associates work and solve problems,” Walmart said, emphasizing the launch goes beyond productivity gains. “We believe the key to unlocking transformation lies in the creativity and innovation of our associates. Ideally, this technology will free them from monotonous, repetitive tasks, allowing more time and focus for improving the customer/member experience.” Continue reading Walmart Is ‘Empowering’ 50,000 U.S. Associates with GenAI
By
Paula ParisiAugust 31, 2023
Google is making many of its most powerful cloud computing tools available commercially for the first time, Google Cloud CEO Thomas Kurian shared at the company’s Cloud Next ’23 conference in San Francisco. In a bid to catch up with top AI rivals such as Amazon and Microsoft, the Google Distributed Cloud will open for general business including at the edge with Vertex AI and PaLM 2. Google Cloud will serve up AI from Anthropic, in which it is an investor, as well as from Meta Platforms. In addition, an AI-infused Gmail productivity suite is on the way. Continue reading Google Takes on the Competition with Cloud and AI Services
By
Paula ParisiAugust 10, 2023
Google has debuted Project IDX, an AI-enabled development environment for building full-stack web and multiplatform apps. Comparing app development that works across mobile, web, and desktop platforms to “building a Rube Goldberg machine” with a duct-taped tech stack, Google says Project IDX smooths the process of compiling, testing, deploying and monitoring apps. The browser-based Project IDX is built on the Google Cloud using the Codey family of AI foundation models built on PaLM 2. Currently, IDX supports the JavaScript and Dart languages, with plans for Python, Go and more. Continue reading Google’s Project IDX Offers Full-Stack Dev in a Web Browser
By
Paula ParisiAugust 3, 2023
Meta Platforms is amping up its AI play, with plans to launch a suite of personality-driven chatbots as soon as next month. The company has been developing the series of artificially intelligent character bots with a goal of using them to boost engagement with its social media brands by making them available to have “humanlike discussions” on platforms including Facebook, Instagram and WhatsApp. Internally dubbed “personas,” the chatbots simulate characters ranging from historical figures like Abraham Lincoln to a surfer dude that dispenses travel advice. Continue reading Meta Plans Personality-Driven Chatbots to Boost Engagement
By
Paula ParisiJuly 27, 2023
Microsoft Cloud drove record sales and profits for Q2, which saw a year-over-year revenue gain of 8 percent to $56.2 billion for April through June. Net income topped $20 billion, a 20 percent gain that beat analyst expectations and the company’s own estimates. Microsoft Cloud revenue for Q2 was up 21 percent, to $30.3 billion. And the company is beginning to see the results of its investments in artificial intelligence. Q2 is Microsoft’s second record-setting quarter this year, topping the three-month high of $52.9 billion in Q1. The previous profit record was $18.8 billion in Q4 2021. Continue reading Microsoft Q2 Marks a Quarterly Sales Record of $56.2 Billion
By
Paula ParisiJuly 21, 2023
Microsoft is launching Bing Chat Enterprise, a business-focused version of Bing Chat with data privacy and governance controls. The company is also introducing Visual Search in Bing Chat and new AI features for Azure, revealed at its Inspire 2023 conference this week. In addition, the cloud-based Copilot plan “combines the power of large language models with your data in the Microsoft Graph and Microsoft 365 apps” for a new way of working using only natural language prompts. Currently in early access, Copilot will be priced at $30 per user per month for Microsoft 365 E3, E5, Business Standard and Business Premium subscribers. Continue reading Microsoft Intros Bing Chat Enterprise, New AI Tools for Azure
By
Paula ParisiJuly 21, 2023
Apple is reportedly developing tools it could use to enter the artificial intelligence space, joining rivals such as Microsoft and Google, which have already released popular products. In Cupertino, the company is said to have built a framework for large language models, which power AI-based chatbot offerings similar to Google’s Bard and OpenAI’s ChatGPT. Called Ajax, the platform is the basis for what is referred to inside the company as Apple GPT. Though Apple has built automation into its products for some time, it could now be preparing to make a direct play for the generative AI market. Continue reading Apple Chatbot ‘Ajax’ Could Be Next Major Player in AI Space