Reference PyTorch implementation and models for DINOv3
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
Powerful AI language model (MoE) optimized for efficiency/performance
Lets make video diffusion practical
High-Resolution Image Synthesis with Latent Diffusion Models
The most powerful local music generation model
HY-Motion model for 3D character animation generation
PyTorch code and models for the DINOv2 self-supervised learning
Qwen3 is the large language model series developed by Qwen team
Open-source, high-performance AI model with advanced reasoning
From Images to High-Fidelity 3D Assets
A Customizable Image-to-Video Model based on HunyuanVideo
An experimental version of DeepSeek model
A Systematic Framework for Interactive World Modeling
Release for Improved Denoising Diffusion Probabilistic Models
Implementation of "MobileCLIP" CVPR 2024
RGBD video generation model conditioned on camera input
4M: Massively Multimodal Masked Modeling
Memory-efficient and performant finetuning of Mistral's models
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Official DeiT repository
Awesome multilingual OCR toolkits based on PaddlePaddle
Official Python inference and LoRA trainer package
Language modeling in a sentence representation space
Repo for SeedVR2 & SeedVR