Low-latency REST API for serving text-embeddings
AI-powered tool for developers, simplifying coding tasks
Developer AI Persona Search Agent
Deterministic LLMs Outputs for AI Applications and AI Agents
A library for accelerating Transformer models on NVIDIA GPUs
Time-lapse Video Generation Models as Metamorphic Simulators
The open-source data curation platform for LLMs
Easily turn large sets of image urls to an image dataset
Build portable, production-ready MLOps pipelines
Open source framework for deep learning satellite and aerial imagery
20+ high-performance LLMs with recipes to pretrain, finetune at scale
SAPIEN Manipulation Skill Framework
The fastest way to bring multi-agent workflows to production
Fault-tolerant, highly scalable GPU orchestration
Simplifies the local serving of AI models from any source
LLM training in simple, raw C/CUDA
A fast, powerful, and simple hierarchical vision transformer
Training Large Language Model to Reason in a Continuous Latent Space
CodeGeeX4-ALL-9B, a versatile model for all AI software development
Tool for exploring and debugging transformer model behaviors
The NVIDIA AgentIQ toolkit is an open-source library
Finetune Llama 3.2, Mistral, Phi & Gemma LLMs 2-5x faster
SWE-agent takes a GitHub issue and tries to automatically fix it
Finding the Scaling Law of Agents. A multi-agent framework
Multilingual Automatic Speech Recognition with word-level timestamps