By
Paula ParisiDecember 13, 2022
Alphabet’s AI offshoot DeepMind has created an AI tool called Dramatron that can help co-write scripts, generating things like plot points, character and location descriptions and dialogue. While a human will still need to manage the process by editing and rewriting Dramatron’s suggestions, the app is designed to make the screenwriting process faster and easier. To deploy Dramatron, users will need an OpenAI API key and, ideally, a Perspective API key to minimize the risk of “offensive text.” In addition to AI researchers, DeepMind tested the tool with 15 playwrights and screenwriters who used it to co-write scripts. Continue reading DeepMind Tool Provides AI-Powered Screenplay Assistance
By
Paula ParisiDecember 7, 2022
OpenAI’s new AI chatbot, ChatGPT, is taking the world by storm. “Quite simply, the best artificial intelligence chatbot ever released to the general public,” is how The New York Times describes ChatGPT, which more than a million people signed up for when it opened for testing last week. Screenshots of ChatGPT conversations blew up Twitter. “Something big is happening,” tweeted one fan. “I just had a 20-minute conversation with ChatGPT about the history of physics … OMG,” offered another. The acronym stands for “generative pre-trained transformer,” a language model that leverages deep learning to respond to text-based input with human-like responses. Continue reading ChatGPT: OpenAI’s New Chatbot Draws Praise and Criticism
By
Paula ParisiNovember 18, 2022
Microsoft has entered into a multi-year deal with Nvidia to build what they’re calling “one of the world’s most advanced supercomputers,” powered by Microsoft Azure’s advanced supercomputing infrastructure combined with Nvidia GPUs, networking and full stack of AI software to help enterprises train, deploy and scale AI, including large, state-of-the-art models. “AI is fueling the next wave of automation across enterprises and industrial computing, enabling organizations to do more with less as they navigate economic uncertainties,” Microsoft cloud and AI group executive VP Scott Guthrie said of the alliance. Continue reading Microsoft, Nvidia Partner on Azure-Hosted AI Supercomputer
By
Paula ParisiOctober 14, 2022
Microsoft announced it is integrating OpenAI’s DALL-E 2 into its new Microsoft Designer app, as well as its Microsoft Edge browser and the Image Creator tool in its Bing search engine. Microsoft provides cloud computing services to OpenAI and has partnered with OpenAI in AI commercialization efforts including the Azure OpenAI Service, now in preview, and GitHub Copilot. The Designer web app can be used to create designs for posters, presentations, invitations and other graphics that can be printed and used for display or shared on social or business media. Continue reading Microsoft Integrates DALL-E 2 into Designer and Creator Apps
By
Paula ParisiOctober 10, 2022
AI image generators like OpenAI’s DALL-E 2 and Google’s Imagen have been generating a lot of attention recently. Now AI text-to-video generators are edging into the spotlight, with Google debuting Imagen Video on the heels of Meta AI’s Make-A-Video rollout last month. Imagen Video has been used to generate videos of up to 25-minutes at a 24 fps, 1280×768 pixel spec. Imagen Video was trained “on a combination of an internal dataset consisting of 14 million video-text pairs and 60 million image-text pairs,” resulting in some unusual functionality, according to Google Research. Continue reading Google and Meta Are Developing AI Text-to-Video Generators
By
Paula ParisiSeptember 26, 2022
OpenAI has released a new open source AI speech recognition model called Whisper that can recognize and translate audio at levels it says compare in accuracy and robustness to human abilities. Case uses include transcription of speeches, interviews, podcasts and conversations. “Moreover, it enables transcription in multiple languages, as well as translation from those languages into English,” says OpenAI, which is open-sourcing models and inference code on GitHub “to serve as a foundation for building useful applications and for further research on robust speech processing.” Continue reading OpenAI Rolls Out Open-Source Speech Recognition System
By
Paula ParisiSeptember 21, 2022
OpenAI has begun allowing users of its DALL-E 2 image-generating system to work with facial image uploads. The program previously allowed only computer-generated faces in an effort to prevent deepfakes and misuse, but OpenAI says improvements to its safety system succeeded in “minimizing the potential of harm” from things like explicit, political or violent content. OpenAI will continue to prohibit use of unauthorized photos and will seek to protect right of publicity, though it remains to be seen how effective that will be. In the past, customers have complained the company was overzealous in its policing. Continue reading OpenAI Expands DALL-E 2 Functionality with Facial Uploads
By
Paula ParisiAugust 26, 2022
Virtual character developer platform Inworld AI has raised $50 million in a Series A funding round led by Section 32 and Intel Capital. The Mountain View-based startup — one of six companies chosen to participate in the 2022 Disney Accelerator — will create virtual characters for games, the metaverse and other entertainment and marketing applications. Because it is focused on providing an interior life, or “mind,” Inworld AI is platform agnostic, with APIs that work across Unity, Unreal Engine, Omniverse and others. Another convenient feature: it lets developers build characters by describing them in natural language. Continue reading Inworld Raises $50M to Create AI-Powered Virtual Characters
By
Paula ParisiAugust 18, 2022
Stability AI is in the first stage of release of Stable Diffusion, a text-to-image generator similar in functionality to OpenAI’s DALL-E 2, with one important distinction: this open-source newcomer lacks the filters that prevent the earlier system from creating images of public figures or content deemed excessively toxic. Last week the Stable Diffusion code was made available to just over a thousand researchers and the Los Altos-based startup anticipates a public release in the coming weeks. The unfettered unleashing of a powerful imaging system has stirred controversy in the AI community, raising ethical questions. Continue reading Stability AI Releases Stable Diffusion Text-to-Image Generator
By
Paula ParisiAugust 12, 2022
OpenAI’s powerful text-to-image generator DALL-E 2 is still in beta, but businesses are already testing it for commercial use. Apparel firm Stitch Fix has been using it to visualize fabric and color personalization, while Heinz tapped the AI system for a marketing campaign. Cosmopolitan used it to design a magazine cover. Others have leveraged the image engine to generate logos and thumbnails. These early adopters are identifying technical issues that OpenAI says it is addressing as it readies DALL-E 2 for enterprise. Foremost among the complaints is the lack of a dedicated API for public use. Continue reading Businesses Experiment with DALL-E 2, Report Mixed Results
By
Paula ParisiAugust 2, 2022
Nvidia has issued a software update for its formidable NeMo Megatron giant language training model, increasing efficiency and speed. Barely a year since Nvidia unveiled Megatron, this latest improvement further leverages the transformer engine architecture that has become synonymous with deep learning since Google introduced the concept in 2017. New features result in what Nvidia says is a 5x reduction in memory requirements and up to a 30 percent gain in speed for models as large as 1 trillion parameters, making NeMo Megatron better at handling transformer tasks across the entire stack. Continue reading Nvidia Turbo Charges NeMo Megatron Large Training Model
By
Paula ParisiJuly 26, 2022
OpenAI is expanding its beta outreach for DALL-E 2 by inviting an additional one million waitlisted people to join the AI imaging platform over the coming weeks. DALL-E users will receive 50 credits during their first month of use and 15 credits every subsequent month, with each credit redeemable for an original DALL-E-prompted generation (returning four images) or an edit or variation prompt (which returns three images). Additional credits may be purchased in 115-generation increments for $15. Starting this month, users get rights to commercialize their DALL-E images. However, the move highlights the legal implications of AI and possible copyright infringement. Continue reading Legal Questions Loom as OpenAI Widens Access to DALL-E
By
Paula ParisiJune 28, 2022
New AI-powered coding tools such as Amazon’s CodeWhisperer and Copilot from GitHub and OpenAI may be giving some developers the jitters. Following splashy debuts for both programs last week, GitHub CEO Thomas Dohmke offered public assurances that Copilot is not designed to replace coders, but to speed the process, alleviating a software developer shortage. Similar to Copilot, CodeWhisperer can autocomplete Java, JavaScript and Python functions based on a comment or some keystrokes. Amazon says it trained the system using billions of lines of open source code, publicly available documentation and its own codebase. Continue reading AI Coding Tools Speed Process to Offset Developer Shortage
By
Paula ParisiJune 16, 2022
Adobe is releasing an open source developer toolkit that aims to prevent the spread of visual misinformation by including additional metadata that Adobe calls Content Credentials. The system is also designed to help content creators indelibly tag authorship to their work. Announced in 2019, the Content Authenticity Initiative (CAI) project has released a whitepaper introducing the system, which is integrated into Adobe software. The CAI has teamed with hardware manufacturers and newsrooms to help ubiquitize its vision. The Associated Press, The New York Times and The Wall Street Journal have signed aboard. Continue reading Adobe Debuts ‘Content Credentials’ to Battle Misinformation
By
Paula ParisiMay 27, 2022
Microsoft is previewing its express design in Power Apps, which can instantly generate low-code apps directly from design files and images. In a few clicks, anyone can now create web and mobile apps from inputs including paper forms, PDFs, sketches on the whiteboard or even assets designed in professional programs like Figma. As part of the Microsoft Power Platform, Power Apps uses advanced AI to accelerate design. “We’re particularly excited about our integration with Figma, the collaborative design platform, where so much software is designed today,” said Microsoft vice president of Power Apps Ryan Cunningham. Continue reading AI-Driven Microsoft Power Apps Offers Development Shortcuts