TensorRT-LLM Archives

Mistral, Nvidia Bring Enterprise AI to Desktop with NeMo 12B

By Paula Parisi
July 24, 2024

Nvidia and French startup Mistral AI are jointly releasing a new language model called Mistral NeMo 12B that brings enterprise AI capabilities to the desktop without the need for major cloud resources. Developers can easily customize and deploy the new LLM for applications supporting chatbots, multilingual tasks, coding and summarization, according to Nvidia. “NeMo 12B offers a large context window of up to 128k tokens,” explains Mistral, adding that “its reasoning, world knowledge, and coding accuracy are state-of-the-art in its size category.” Available under the Apache 2.0 license, it is easy to implement as a drop-in replacement for Mistral 7B. Continue reading Mistral, Nvidia Bring Enterprise AI to Desktop with NeMo 12B

Nvidia’s Open Models to Provide Free Training Data for LLMs

By Paula Parisi
June 18, 2024

Nvidia is expanding its substantive influence in the AI sphere with Nemotron-4 340B, a family of open models designed to generate synthetic LLM training data for commercial applications across numerous fields. Through what Nvidia is calling a “uniquely permissive” free open model license, Nemotron-4 340B provides a scalable way for developers to build LLMs. Synthetic data is artificially generated data designed to mimic the characteristics and structure of data found in the real world. The offering is being called “groundbreaking” and an important step toward the democratization of artificial intelligence. Continue reading Nvidia’s Open Models to Provide Free Training Data for LLMs