Fast and Universal 3D reconstruction model for versatile tasks
Python inference and LoRA trainer package for the LTX-2 audio–video
A multimodal model for brain response prediction
GLM-Image: Auto-regressive for Dense-knowledge and High-fidelity Image
Qwen2.5-VL is the multimodal large language model series
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
AI Suite for upscaling, interpolating & restoring images/videos