By
Paula ParisiSeptember 16, 2024
OpenAI is previewing a new series of AI models that can reason and correct complex coding mistakes, providing a more efficient solution for developers. Powered by OpenAI o1, the new models are “designed to spend more time thinking before they respond, much like a person would,” and as a result can “solve harder problems than previous models in science, coding, and math,” OpenAI claims, noting that “through training, they learn to refine their thinking process, try different strategies, and recognize their mistakes.” The first model in the series is being released in preview in OpenAI’s popular ChatGPT and in the company’s API. Continue reading OpenAI Previews New LLMs Capable of Complex Reasoning
By
Paula ParisiSeptember 6, 2024
Anthropic has launched the Claude Enterprise subscription plan to compete with OpenAI’s ChatGPT Enterprise business solution. Focused on security and administrative controls, Claude Enterprise is designed to help organizations securely collaborate with artificial intelligence using proprietary internal data. Pricing will vary based on the number of seats and how Claude is used but is expected to be more expensive than Claude Pro and Claude Teams ($20 and $25 per month, respectively). An expanded 500K context window, more usage capacity, and a native GitHub integration for work on entire codebases are advantages Anthropic touts for Claude Enterprise. Continue reading Anthropic Announces Enhanced Claude Enterprise Plan for AI
By
Paula ParisiAugust 15, 2024
VFusion3D is the latest AI model unveiled by Meta Platforms, which developed it in conjunction with the University of Oxford. The powerful model, which uses single-perspective images or text prompts to generate high-quality 3D objects, is being hailed as a breakthrough in scalable 3D AI that can potentially transform sectors including VR, gaming and digital design. The platform tackles the challenge of scarce 3D training data in a world teeming with 2D images and text descriptions. The VFusion3D approach leverages what the developers call “a novel method for building scalable 3D generative models utilizing pre-trained video diffusion models.” Continue reading Meta, Oxford Advance 3D Object Generation with VFusion3D
By
Paula ParisiAugust 6, 2024
A new generative AI startup called Black Forest Labs has hit the scene, debuting with a suite of text-to-image models branded FLUX.1. Based in Germany, Black Forest was founded by some of the researchers involved in developing Stable Diffusion and has raised $31 million in funding from principal investor Andreessen Horowitz and angels including CAA founder and former talent agent Michael Ovitz. The FLUX.1 suite focuses on “image detail, prompt adherence, style diversity and scene complexity,” the company says of its three initial variants: FLUX.1 [pro], FLUX.1 [dev] and FLUX.1 [schnell]. Continue reading Black Forest Labs Announces Suite of Text-to-Image Models
By
Paula ParisiJuly 31, 2024
Meta Platforms CEO Mark Zuckerberg unveiled the latest version of computer vision platform SAM 2, an update on the company’s Segment Anything Model that automates for video what the original SAM did for still images — identifying the edges of an object and isolating it in the frame. Zuckerberg demonstrated SAM 2 as part of a SIGGRAPH 2024 keynote session in which he was interviewed by Nvidia CEO Jensen Huang. “Being able to do this in video and have it be zero shot and tell it what you want, it’s pretty cool,” Zuckerberg said. Meta is sharing the code and model weights for SAM 2 with a permissive Apache 2.0 license. Continue reading Mark Zuckerberg Unveils SAM 2 AI Tech for Segmenting Video
By
Paula ParisiJuly 30, 2024
The U.S. Commerce Department has issued a large package of material designed to help AI developers and those using the systems with an approach to identifying and mitigating risks stemming from generative AI and foundation models. Prepared by the National Institute of Standards and Technology and the AI Safety Institute, the guidance includes the initial public draft of its guidelines on “Managing Misuse Risk for Dual-Use Foundation Models.” Dual-use refers to models that can be used for good or ill. The release also includes an open-source software test called Dioptra. Apple is the latest to join the government’s voluntary commitments to responsible AI innovation. Continue reading Apple Joins the Safe AI Initiative as NIST Amps Up Outreach
By
Paula ParisiJuly 1, 2024
The world’s first AI-powered movie camera has surfaced. Still in development, it aims to enable filmmakers to turn footage into AI imagery in real time while shooting. Called the CMR-M1, for camera model 1, it is the product of creative tech agency SpecialGuestX and media firm 1stAveMachine, with the goal of providing creatives with a familiar interface for AI imagemaking. It was inspired by the Cine-Kodak device, the first portable 16mm camera. “We designed a camera that serves as a physical interface to AI models,” said Miguel Espada, co-founder and executive creative technologist at SpecialGuestX, a company that does not think directors will use AI sitting at a keyboard. Continue reading New Prototype Is the World’s First AI-Powered Movie Camera
By
Paula ParisiJune 18, 2024
Zeta Labs has raised $2.9 million in pre-seed round funding and launched JACE, an AI assistant that can autonomously complete complex tasks. The LLM-powered JACE agent executes in-browser actions on command. In fact, Zeta claims JACE is so autonomous that it eliminates the need to be sitting in front of a computer while it executes requests — just tell it what you’d like it to do and let it go. London-based Zeta says it will use the money to expand its engineering team, host training models and improve JACE’s speed and reliability. Continue reading UK’s Zeta Labs Unveils JACE, a Next Generation AI Assistant
By
Paula ParisiJune 4, 2024
A year after its announcement, Fable is launching Showrunner, a platform that lets anyone make TV-style animated content by writing prompts that are turned into shows by generative AI. The San Francisco company run by CEO Edward Saatchi with recruits from Oculus, Pixar and various AI startups is launching 10 shows that let users make their own episodes “from their couch,” waiting only minutes to see the finished result, according to Saatchi, who says a 15-word prompt is enough to generate 10- to 20-minute episodes. Saatchi is hoping Fable’s shows can garner an audience by self-publishing on Amazon Prime. Continue reading Fable Launches Showrunner Animated Episodic TV Generator
By
Paula ParisiMay 20, 2024
Google is offering developers a toolkit for incorporating generative AI features into mobile and web applications. Firebase Genkit, an open-source framework, is available now in beta. Blending models, cloud services, agents, data sources and more in a “code-centric approach” developers are used to, the Genkit makes building and debugging for AI easier, according to Google. The first release is built for JavaScript and TypeScript developers, making building AI-powered apps available to professionals who specialize in building server-side applications using the Node.js JavaScript runtime. Continue reading Firebase Genkit: Developer Framework for AI-Powered Apps
By
Paula ParisiMay 17, 2024
In a move aimed at launching more accessible Android apps, Google has open-sourced code for Project Gameface, a hands-free game control feature released last year that allows users to move a computer with facial and head gestures. Developers will now have more Gameface resources with which to build Android applications for physically challenged users, “to make every Android device more accessible.” Project Gameface evolved as a collaboration with quadriplegic video game streamer Lance Carr, who has muscular dystrophy. The technology uses a smartphone’s front camera to track movement. Continue reading Google Adds Open-Source Gameface for Android Developers
By
Paula ParisiMay 15, 2024
IBM has released a family of its Granite AI models to the open-source community. The series of decoder-only Granite code models are purpose-built to write computer code for enterprise developers, with training in 116 programming languages. These Granite models range in size from 3 to 34 billion parameters in base model and instruction-tuned variants. They offer a range of uses, from modernizing older code with new languages to optimizing programs for on-device memory constraints, such as might be experienced when conforming for mobile gadgets. In addition to generation, the models can repair and explain code. Continue reading IBM Introduces Granite LLMs for Enterprise Code Developers
By
Paula ParisiMay 9, 2024
Amazon Web Services has added Bedrock Studio to the lineup of offerings at Bedrock, its cloud-based managed service for artificial intelligence. Available beginning this week in public preview, Bedrock Studio aims to greatly simplify the development of artificial intelligence apps for its subscribers, allowing them to get going “within minutes,” according to AWS. Developers can log in to the Amazon Bedrock Studio web experience using single sign-on (SSO) company credentials “and instantly start experimenting with Bedrock FMs and tools to build applications,” the cloud giant says, describing the studio as a “rapid prototyping environment” for generative AI. Continue reading AWS Simplifies AI Development with Amazon Bedrock Studio
Amazon has pulled the plug on CodeWhisperer, which has been incorporated into its Q Developer product, announced at November’s re:Invent as part of an AI-powered AWS enterprise suite called Amazon Q, which also includes Q Business. Both Q Developer, which enables natural language coding, and Q Business, a data-driven productivity tool are now in general release, and Q Apps has just been added in preview, letting employees “build generative AI-powered apps from their company’s data, without any prior coding experience.” The move comes as Amazon seeks to gain ground on the Microsoft-owned GitHub’s AI coding products. Continue reading AWS CodeWhisperer Is Rebranded as Part of Amazon Q Suite
GitHub has introduced Copilot Workspace, a Copilot-native developer environment for artificial intelligence, in technical preview. Developers are invited to sign up for a waitlist for the service, which allows the use of natural language to plan, build, test and run code. The Microsoft-owned company has introduced various aspects of Copilot over the past few years, adding an autocomplete pair programmer in 2022, and in 2023 Copilot Chat for natural language coding, debugging and testing, “allowing developers to converse with their code in real time.” The “task-centric” Copilot Workspace leverages different agents for a “start-to-finish experience.” Continue reading GitHub Puts Copilot Workspace Developer Platform in Preview