Awesome multilingual OCR toolkits based on PaddlePaddle
High-Resolution Image Synthesis with Latent Diffusion Models
Personalize Any Characters with a Scalable Diffusion Transformer
Contexts Optical Compression
Collection of Gemma 3 variants that are trained for performance
Robust Speech Recognition Across Languages, Dialects
From Images to High-Fidelity 3D Assets
A Powerful Native Multimodal Model for Image Generation
Lets make video diffusion practical
Pokee Deep Research Model Open Source Repo
Qwen3-TTS is an open-source series of TTS models
Code for running inference and finetuning with SAM 3 model
Phi-3.5 for Mac: Locally-run Vision and Language Models
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Powerful AI language model (MoE) optimized for efficiency/performance
Advanced language and coding AI model
Z80-μLM is a 2-bit quantized language model
Inference script for Oasis 500M
HY-Motion model for 3D character animation generation
Qwen-Image is a powerful image generation foundation model
The official repo of Qwen chat & pretrained large language model
AlphaFold 3 inference pipeline
Video Object and Interaction Deletion
Agentic, Reasoning, and Coding (ARC) foundation models
Reference PyTorch implementation and models for DINOv3