A series of math-specific large language models of our Qwen2 series
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Capable of understanding text, audio, vision, video
Qwen2.5-VL is the multimodal large language model series
The most powerful local music generation model
Robust Speech Recognition Across Languages, Dialects
A Systematic Framework for Interactive World Modeling
Official inference repo for FLUX.1 models
Diffusion Transformer with Fine-Grained Chinese Understanding
Qwen3-Coder is the code version of Qwen3
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
ChatGPT interface with better UI
Wan2.2: Open and Advanced Large-Scale Video Generative Model
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Open-source deep-learning framework
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
Qwen2.5-Coder is the code version of Qwen2.5, the large language model
State-of-the-art TTS model under 25MB
High-Fidelity and Controllable Generation of Textured 3D Assets
Designed for text embedding and ranking tasks
StudioOllamaUI is a local, portable interface for Ollama
AI Suite for upscaling, interpolating & restoring images/videos
A minimal PyTorch re-implementation of the OpenAI GPT
A latent text-to-image diffusion model
A mix of GAN implementations including progressive growing