Powerful AI language model (MoE) optimized for efficiency/performance
Reference PyTorch implementation and models for DINOv3
MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training
PyTorch implementation of JiT
DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models
Lets make video diffusion practical
High-Resolution Image Synthesis with Latent Diffusion Models
The most powerful local music generation model
HY-Motion model for 3D character animation generation
PyTorch code and models for the DINOv2 self-supervised learning
From Images to High-Fidelity 3D Assets
A Customizable Image-to-Video Model based on HunyuanVideo
Open-source, high-performance AI model with advanced reasoning
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
An experimental version of DeepSeek model
Qwen3 is the large language model series developed by Qwen team
A Systematic Framework for Interactive World Modeling
Awesome multilingual OCR toolkits based on PaddlePaddle
Language modeling in a sentence representation space
Implementation of "MobileCLIP" CVPR 2024
Repo for SeedVR2 & SeedVR
Official Python inference and LoRA trainer package
From Vibe Coding to Agentic Engineering
RGBD video generation model conditioned on camera input
Multimodal model achieving SOTA performance