Awesome multilingual OCR toolkits based on PaddlePaddle
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Agentic, Reasoning, and Coding (ARC) foundation models
Official inference repo for FLUX.1 models
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Advanced language and coding AI model
Official inference repo for FLUX.2 models
A Family of Open Sourced Music Foundation Models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Text and image to video generation: CogVideoX and CogVideo
State-of-the-art TTS model under 25MB
Recovering the Visual Space from Any Views
Accurate × Fast × Comprehensive
Visual Causal Flow
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Qwen2.5-VL is the multimodal large language model series
Official repository for LTX-Video
The Clay Foundation Model - An open source AI model and interface
Lets make video diffusion practical
The official repo of Qwen chat & pretrained large language model
Programmatic access to the AlphaGenome model
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Open-source deep-learning framework
Generate Any 3D Scene in Seconds