Python SDK for Claude Agent
Bidirectional token-classification model for identifiable info
General-purpose image editing model that delivers high-fidelity
Easy Docker setup for Stable Diffusion with user-friendly UI
PyTorch code and models for the DINOv2 self-supervised learning
Open-source large language model family from Tencent Hunyuan
Code for running inference with the SAM 3D Body Model 3DB
Sharp Monocular Metric Depth in Less Than a Second
Towards self-verifiable mathematical reasoning
An Efficient Agentic Model for Computer Use
Robust Speech Recognition Across Languages, Dialects
A 0.1B Omni model trained from scratch
Qwen3-ASR is an open-source series of ASR models
Block Diffusion for Ultra-Fast Speculative Decoding
Official repository for LTX-Video
Official implementation of Watermark Anything with Localized Messages
Multimodal Diffusion with Representation Alignment
A Customizable Image-to-Video Model based on HunyuanVideo
Generating Immersive, Explorable, and Interactive 3D Worlds
Tongyi Deep Research, the Leading Open-source Deep Research Agent
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Qwen-Image is a powerful image generation foundation model
Qwen3-omni is a natively end-to-end, omni-modal LLM
Phi-3.5 for Mac: Locally-run Vision and Language Models
Revolutionizing Database Interactions with Private LLM Technology