Agentic, Reasoning, and Coding (ARC) foundation models
Open-source, high-performance AI model with advanced reasoning
Powerful AI language model (MoE) optimized for efficiency/performance
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Native and Compact Structured Latents for 3D Generation
gpt-oss-120b and gpt-oss-20b are two open-weight language models
The most powerful local music generation model
Official Python inference and LoRA trainer package
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Official inference repo for FLUX.1 models
Wan2.1: Open and Advanced Large-Scale Video Generative Model
Awesome multilingual OCR toolkits based on PaddlePaddle
A theoretical reconstruction of the Claude Mythos architecture
Fast stable diffusion on CPU and AI PC
Official inference repo for FLUX.2 models
Advanced language and coding AI model
Python inference and LoRA trainer package for the LTX-2 audio–video
Open-source multi-speaker long-form text-to-speech model
Qwen3-TTS is an open-source series of TTS models
Text and image to video generation: CogVideoX and CogVideo
State-of-the-art TTS model under 25MB
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
A Family of Open Sourced Music Foundation Models
Reference PyTorch implementation and models for DINOv3
Qwen2.5-VL is the multimodal large language model series