Advanced language and coding AI model
Multimodal Diffusion with Representation Alignment
Fast stable diffusion on CPU and AI PC
code for Mesh R-CNN, ICCV 2019
One-click local MCP server installation in desktop apps
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Foundation Models for Time Series
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Qwen3-omni is a natively end-to-end, omni-modal LLM
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Audio foundation model excelling in audio understanding
MiniMax-M2, a model built for Max coding & agentic workflows
Implementation of "MobileCLIP" CVPR 2024
Pokee Deep Research Model Open Source Repo
Capable of understanding text, audio, vision, video
Extension index for stable-diffusion-webui
Ultra-Efficient LLMs on End Device
Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation
Learning Continuous Signed Distance Functions for Shape Representation
Multimodal agent model for coding, orchestration, and autonomy
Flagship MoE model for advanced reasoning, coding, and agents