Tongyi Deep Research, the Leading Open-source Deep Research Agent
Video Object and Interaction Deletion
Accurate × Fast × Comprehensive
Qwen-Image-Layered: Layered Decomposition for Inherent Editablity
The official repo of Qwen chat & pretrained large language model
gpt-oss-120b and gpt-oss-20b are two open-weight language models
Recovering the Visual Space from Any Views
Contexts Optical Compression
Sharp Monocular Metric Depth in Less Than a Second
Project Lyra: Open Generative 3D World Models
Open-source deep-learning framework
State-of-the-art TTS model under 25MB
Audio foundation model excelling in audio understanding
Tool for exploring and debugging transformer model behaviors
Multimodal Diffusion with Representation Alignment
PyTorch code and models for the DINOv2 self-supervised learning
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Official implementation of DreamCraft3D
A Customizable Image-to-Video Model based on HunyuanVideo
An AI-powered security review GitHub Action using Claude
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Python bindings for llama.cpp
An Efficient Agentic Model for Computer Use
Open Source Speech Language Model
Long-form streaming TTS system for multi-speaker dialogue generation