Recovering the Visual Space from Any Views
Contexts Optical Compression
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
GLM-4 series: Open Multilingual Multimodal Chat LMs
Sharp Monocular Metric Depth in Less Than a Second
Multimodal model achieving SOTA performance
Official implementation of DreamCraft3D
Open-source pre-training implementation of Google's LaMDA in PyTorch
Large language model developed and released by NVIDIA
Hermes 4 FP8: hybrid reasoning Llama-3.1-405B model by Nous Research
Large-scale xAI model for local inference with SGLang, Grok-2.5
Efficient MoE model for million-token reasoning and coding
Quantized 675B multimodal instruct model optimized for NVFP4
Efficient 8B multimodal model tuned for advanced reasoning tasks.
High-precision 14B multimodal model built for advanced reasoning tasks
Text-to-image model optimized for artistic quality and safe generation
High-compute ultra-reasoning model surpassing model surpassing GPT-5
Jan-v1-edge: efficient 1.7B reasoning model optimized for edge devices
Small 3B-base multimodal model ideal for custom AI on edge hardware
Compact 3B-param multimodal model for efficient on-device reasoning
Powerful 14B-base multimodal model — flexible base for fine-tuning