A Family of Open Sourced Music Foundation Models
LTX-Video Support for ComfyUI
Advancing Open-source World Models
A Pragmatic VLA Foundation Model
Fast and Universal 3D reconstruction model for versatile tasks
Per-Pixel Classification is Not All You Need for Semantic Segmentation
Qwen2.5-VL-3B-Instruct: Multimodal model for chat, vision & video