The common language for platforms, agents and businesses.
Unified web UI for training and running open models locally
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
Parallax is a distributed model serving framework
Easy Docker setup for Stable Diffusion with user-friendly UI
Designed for training LLM/VLM agents via RL
Chat & pretrained large vision language model
A state-of-the-art open visual language model
Chinese and English multimodal conversational language model
LTX-Video Support for ComfyUI
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
The power of Claude Code / GeminiCLI / CodexCLI
Get started w/ building Fullstack Agents using Gemini 2.5 & LangGraph
A personal context-agent that learns how you work
Outcome driven agent development framework that evolves
An Efficient Agentic Model for Computer Use
State-of-the-art (SoTA) text-to-video pre-trained model
Framework for building realtime multimodal voice AI agents apps
The official Python SDK for UCP
Open-source MCP server that gives your coding agent
Sharp Monocular Metric Depth in Less Than a Second
A Customizable Image-to-Video Model based on HunyuanVideo
Accelerate local LLM inference and finetuning
AI based photo editing website for changing image background