A fast TTS architecture with conditional flow matching
Foundational model for human-like, expressive TTS
A TTS model capable of generating ultra-realistic dialogue
AI discovers 520000 stable inorganic crystal structures for research
Sharp Monocular Metric Depth in Less Than a Second
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
A Powerful Native Multimodal Model for Image Generation
Educational framework exploring multi-agent orchestration
A series of math-specific large language models of our Qwen2 series
Inference framework for 1-bit LLMs
Library of self-supervised methods for visual representation
NVIDIA Federated Learning Application Runtime Environment
Detecting silent model failure. NannyML estimates performance
PyTorch extensions for fast R&D prototyping and Kaggle farming
Multilingual sentence & image embeddings with BERT
Python package built to ease deep learning on graph
A neural network that transforms a design mock-up into static websites
20+ high-performance LLMs with recipes to pretrain, finetune at scale
SAPIEN Manipulation Skill Framework
The fastest way to bring multi-agent workflows to production
CRAB: Cross-environment Agent Benchmark for Multimodal Language Model
A code-first agent framework for seamlessly planning analytics tasks
Supercharge Your LLM Application Evaluations
A Telegram bot that integrates with OpenAI's official ChatGPT APIs
GPU environment management and cluster orchestration