Recovering the Visual Space from Any Views
Accurate × Fast × Comprehensive
Open-Source Financial Large Language Models
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Open-source deep-learning framework
Foundation Models for Time Series
An Efficient Agentic Model for Computer Use
Diffusion Bee is the easiest way to run Stable Diffusion locally
Robust Speech Recognition Across Languages, Dialects
Advancing Open-source World Models
A Systematic Framework for Interactive World Modeling
Qwen2.5-VL is the multimodal large language model series
Bidirectional token-classification model for identifiable info
General-purpose image editing model that delivers high-fidelity
Easy Docker setup for Stable Diffusion with user-friendly UI
PyTorch code and models for the DINOv2 self-supervised learning
Open-source large language model family from Tencent Hunyuan
A 0.1B Omni model trained from scratch
Qwen3-ASR is an open-source series of ASR models
Block Diffusion for Ultra-Fast Speculative Decoding
Official repository for LTX-Video
Official implementation of Watermark Anything with Localized Messages
Multimodal Diffusion with Representation Alignment
Diffusion model(SD,Flux,Wan,Qwen Image,Z-Image,...) inference
Phi-3.5 for Mac: Locally-run Vision and Language Models