A Systematic Framework for Interactive World Modeling
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
GPT4V-level open-source multi-modal model based on Llama3-8B
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
A series of math-specific large language models of our Qwen2 series
Implementation of the Surya Foundation Model for Heliophysics
Chinese and English multimodal conversational language model
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
High-Fidelity and Controllable Generation of Textured 3D Assets
Open-source framework for intelligent speech interaction
Qwen3-omni is a natively end-to-end, omni-modal LLM
A state-of-the-art open visual language model
Inference code for scalable emulation of protein equilibrium ensembles
The Clay Foundation Model - An open source AI model and interface
Audio foundation model excelling in audio understanding
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Tiny vision language model
Video Object and Interaction Deletion
Open Source Speech Language Model
Long-form streaming TTS system for multi-speaker dialogue generation
Open-source industrial-grade ASR models
A Pragmatic VLA Foundation Model
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Hunyuan Translation Model Version 1.5
Block Diffusion for Ultra-Fast Speculative Decoding