By
Paula ParisiDecember 1, 2023
Amazon has launched five new capabilities to its SageMaker service, including Sagemaker HyperPod, which accelerates large language and foundation model training and tuning. Sagemaker HyperPod is said to shorten the training time by up to 40 percent using its purpose-built infrastructure designed for distributed training at scale. By optimizing acceleration, SageMaker Inference reduces foundation model deployment costs by 50 percent and latency by 20 percent on average, Amazon claims. “SageMaker HyperPod removes the undifferentiated heavy lifting involved in building and optimizing machine learning infrastructure,” said Amazon. Continue reading SageMaker HyperPod: Amazon Accelerates AI Model Training
By
Paul BennunNovember 28, 2023
Bill Gates has published his thinking about the future of computing, and fascinatingly, it’s the same as his prediction from decades ago: agents. No mere bots — and certainly not anthropomorphized paperclips — agents (to Gates) will abstract almost all HCI to a natural language conversation with systems that have our permission to take meaningful actions. Gates makes a highly specific prediction: within five years, the very idea of an app itself will seem as outdated as a rotary phone dial does next to an iPhone. A conversational UI will sit on top of a language model that has access to as much of our private data as we wish to give it. Continue reading Bill Gates Imagines Agents as the Human-Computer Interface
By
Paula ParisiNovember 27, 2023
Nvidia logged another record quarter, with Q3 revenue of $18.12 billion, up 206 percent from a year ago and a 34 percent increase from Q2 that exceeded both its own and analyst projections. The surge, attributed to increasing demand for the chips that drive artificial intelligence, logged primarily under Nvidia’s data center results a record $14.51 billion, up 279 percent from the prior year and 41 percent from Q2. Profits swelled to $9.2 billion, a stunning 1,259 percent increase from 2022’s $680 million. The results for Nvidia’s Q3 were for the three-month period ending October 31. Continue reading Nvidia Sales Surge as Rivals Circle and China Sanctions Loom
By
Rob ScottNovember 20, 2023
Rumors were running rampant over the weekend as an unanticipated executive shuffle played out at OpenAI. It began on Friday when CEO Sam Altman was pushed out by the OpenAI board. President and co-founder Greg Brockman quickly resigned in solidarity, followed by several top researchers. Reports circulated the following day that investors were pressuring the board into reconsidering its decision, but by Sunday evening, OpenAI announced that former Twitch leader Emmett Shear would serve as the new interim chief. Shortly after, Microsoft CEO Satya Nadella said Altman, Brockman and other OpenAI employees would join Microsoft to lead an advanced AI research unit. Continue reading Sam Altman Joins Microsoft After Abruptly Ousted by OpenAI
By
Paula ParisiNovember 13, 2023
Former Apple designers Imran Chaudhri and Bethany Bongiorno last week officially launched the Humane Ai Pin they’re positioning as a smartphone replacement. The $700 wearable magnetically attaches to clothing. A $24 per month T-Mobile data subscription is required for connectivity. Described as “a download device and software platform built from the ground up for AI,” it’s got an ultra-wide RGB camera, depth sensors and motion sensors, and a speaker that creates “a bubble of sound” that can be loud or soft. Preorders for the Ai Pin begin November 16, with shipments scheduled to begin in early 2024. Continue reading Humane’s $700 Ai Pin Is Positioned to Replace Smartphones
By
Paula ParisiNovember 10, 2023
Samsung Electronics has unveiled a generative AI model called Samsung Gauss designed specifically for artificial intelligence apps on mobile devices. If Samsung deploys Gauss to its smartphones anytime soon it would be among the first handset makers to natively integrate generative AI, putting it ahead of Apple. Gauss was revealed at the Samsung AI Forum 2023 in Korea, held by Samsung Research. Gauss is a large language model that facilitates tasks such as composing emails, summarizing documents and translating content. Samsung says it also enables smarter device control when integrated into products. Continue reading Samsung Is an Early Mover in Mobile LLM Space with Gauss
By
Paula ParisiNovember 10, 2023
Startup Flip AI has built a custom LLM to run its observability platform. Observability is the act of monitoring corporate IT systems, ferreting out issues or identifying potential problems before they occur. It’s a 24/7 process, and can slow down sites or apps, sometimes causing crashes. Not to be confused with the PDF reader app, Flip AI has trained an LLM specifically to monitor new and emerging challenges. Concurrently, Flip AI has announced $6.5 million in seed funding led by Factory with participation from Morgan Stanley Next Level Fund and GTM Capital. Continue reading Startup Flip AI Creates Custom LLM to Address Observability
By
Paula ParisiNovember 8, 2023
Now anyone can make their own GPT chatbot, for fun or productivity — no coding skills necessary — and soon will be able to list it on a marketplace called the GPT Store. This was among the news announcements to come out of OpenAI’s first developer conference — OpenAI DevDay in San Francisco — where a new, lower-priced model called GPT-4 Turbo with 128K context, was unveiled, along with a new Assistants API, GPT-4 Turbo with Vision and the DALL-E 3 API. Now in preview, GPT-4 Turbo “is more capable and has knowledge of world events up to April 2023,” according to OpenAI. Continue reading OpenAI Intros GPT-4 Turbo, Creator Chatbots at Dev Confab
By
Paula ParisiNovember 7, 2023
Elon Musk’s startup xAI has unveiled its first product, a large language model with chatbot capabilities named Grok, currently available via an early access waitlist with plans to go wide to Premium+ subscribers to the X social platform (formerly Twitter) following beta tests. The company says Grok has “access to search tools and real-time information” and is extremely up-to-date, but “as with all the LLMs trained on next-token prediction, our model can still generate false or contradictory information.” The chatbot is distinguished by sarcasm and wit, “so please don’t use it if you hate humor,” xAI warns. Continue reading Elon Musk’s xAI Rolling Out ‘Grok’ LLM in Early Access Beta
By
Paula ParisiNovember 2, 2023
Social question and answer platform Quora has inserted itself on the leading edge of companies helping creators monetize AI chatbots. Quora’s AI chatbot platform Poe will pay those who create prompt bots on Poe as well as developers of server bots that integrate with the Poe API. “Since this is the beginning of a new market, there are lots of opportunities to provide a valuable service for the world and make money at the same time,” said Quora CEO Adam D’Angelo, envisioning a thriving bot economy across categories from tutoring and therapy to storytelling and roleplay. Continue reading Quora Plans to Foster Chatbot Creator Economy with Poe AI
By
Paula ParisiOctober 27, 2023
The University of Science and Technology of China (USTC) and Tencent YouTu Lab have released a research paper on a new framework called Woodpecker, designed to correct hallucinations in multimodal large language AI models. “Hallucination is a big shadow hanging over the rapidly evolving MLLMs,” writes the group, describing the phenomenon as when MLLMs “output descriptions that are inconsistent with the input image.” Solutions to date focus mainly on “instruction-tuning,” a form of retraining that is data and computation intensive. Woodpecker takes a training-free approach that purports to correct hallucinations from the basis of the generated text. Continue reading Woodpecker: Chinese Researchers Combat AI Hallucinations
By
Paula ParisiOctober 19, 2023
The U.S. Department of Commerce is further curtailing the ability of American companies to sell China advanced chips for artificial intelligence. The national security objective is to avoid providing Beijing with sophisticated silicon that could potentially fuel breakthroughs, giving the nation an advantage in what’s been couched as an “AI arms race.” China is a large market for semiconductors, and the move is said to be fueling tension on both sides of the globe. The new restrictions attempt to plug loopholes in rules the Biden administration introduced in October 2022. Continue reading U.S. Tightens Export Regulations for AI Chip Sales to China
By
Paula ParisiOctober 11, 2023
OpenAI began previewing vision capabilities for GPT-4 in March, and the company is now starting to roll out the image input and output to users of its popular ChatGPT. The multimodal expansion also includes audio functionality, with OpenAI proclaiming late last month that “ChatGPT can now see, hear and speak.” The upgrade vaults GPT-4 into the multimodal category with what OpenAI is apparently calling GPT-4V (for “Vision,” though equally applicable to “Voice”). “We’re rolling out voice and images in ChatGPT to Plus and Enterprise users,” OpenAI announced. Continue reading ChatGPT Goes Multimodal: OpenAI Adds Vision, Voice Ability
By
Paula ParisiOctober 11, 2023
Startup Reka AI is releasing in preview its first artificial intelligence assistant, Yasa-1. The multimodal AI is described as “a language assistant with visual and auditory sensors.” The year-old company says it “trained Yasa-1 from scratch,” including pretraining foundation models “from ground zero,” then aligning them and optimizing to its training and server infrastructures. “Yasa-1 is not just a text assistant, it also understands images, short videos and audio (yes, sounds too),” said Reka AI co-founder and Chief Scientist Yi Tay. Yasa-1 is available via Reka’s APIs and as docker containers for on-site or virtual private cloud deployment. Continue reading Yasa-1: Startup Reka Launches New AI Multimodal Assistant
By
Paula ParisiOctober 10, 2023
Dell Technologies is expanding its Generative AI Solutions portfolio to help enterprise customers add GenAI to their workflow. The expansion includes support for advanced infrastructure and collaborative data solutions that optimize and help secure intelligence gathering and utilization. Dell takes a “validated design” approach to optimization and acceleration, testing different hardware configurations designed to fit the needs of various use cases. Dell has partnered with Nvidia for validated GenAI design for model customization, and with Starburst on data lakehouse solutions that tap multi-cloud data for AI end-use. Continue reading Dell Partnering with Nvidia and Starburst for GenAI Solutions