Code for running inference and finetuning with SAM 3 model
AlphaFold 3 inference pipeline
Recovering the Visual Space from Any Views
Official Python inference and LoRA trainer package
Phi-3.5 for Mac: Locally-run Vision and Language Models
A Powerful Native Multimodal Model for Image Generation
Fast, Sharp & Reliable Agentic Intelligence
HY-Motion model for 3D character animation generation
Collection of Gemma 3 variants that are trained for performance
Provides convenient access to the Anthropic REST API from any Python 3
INT4/INT5/INT8 and FP16 inference on CPU for RWKV language model
Achieving 3+ generation speedup on reasoning tasks
Uncommon Objects in 3D dataset
A theoretical reconstruction of the Claude Mythos architecture
GPT4V-level open-source multi-modal model based on Llama3-8B
LLM-based Reinforcement Learning audio edit model
Runtime extension of Proximus enabling Deployment on AMD Ryzen™ AI
GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
A method to increase the speed and lower the memory footprint
An implementation of model parallel GPT-2 and GPT-3-style models
Large language model developed and released by NVIDIA
LL model providing reasoning and conversational capabilities
Open language model developed by NVIDIA as part of Nemotron-3 family
High-performance MoE model with MLA, MTP, and multilingual reasoning