Pokee Deep Research Model Open Source Repo
Easy Docker setup for Stable Diffusion with user-friendly UI
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
GLM-4 series: Open Multilingual Multimodal Chat LMs
A Pragmatic VLA Foundation Model
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Phi-3.5 for Mac: Locally-run Vision and Language Models
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Sharp Monocular Metric Depth in Less Than a Second
Provides convenient access to the Anthropic REST API from any Python 3
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Qwen3-ASR is an open-source series of ASR models
Block Diffusion for Ultra-Fast Speculative Decoding
Tongyi Deep Research, the Leading Open-source Deep Research Agent
Stable Virtual Camera: Generative View Synthesis with Diffusion Models
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
State-of-the-art (SoTA) text-to-video pre-trained model
Qwen3-omni is a natively end-to-end, omni-modal LLM
An Efficient Agentic Model for Computer Use
Unified Multimodal Understanding and Generation Models
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
A series of math-specific large language models of our Qwen2 series
Fast-stable-diffusion + DreamBooth
Bidirectional token-classification model for identifiable info