Models for object and human mesh reconstruction
Python bindings for llama.cpp
LTX-Video Support for ComfyUI
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
Lets make video diffusion practical
State-of-the-art TTS model under 25MB
Foundation model for image generation
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Industrial-level controllable zero-shot text-to-speech system
DeepSeek Coder: Let the Code Write Itself
Qwen2.5-VL is the multimodal large language model series
Stable Diffusion WebUI Forge is a platform on top of Stable Diffusion
An experimental version of DeepSeek model
The official repo of Qwen chat & pretrained large language model
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Video Object and Interaction Deletion
Accurate × Fast × Comprehensive
Open-Source Financial Large Language Models
Contexts Optical Compression
Provides convenient access to the Anthropic REST API from any Python 3
Recovering the Visual Space from Any Views
Bidirectional token-classification model for identifiable info
Advancing Open-source World Models
A Systematic Framework for Interactive World Modeling