Agentic, Reasoning, and Coding (ARC) foundation models
Video Object and Interaction Deletion
Reference PyTorch implementation and models for DINOv3
Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Implementation of the Surya Foundation Model for Heliophysics
Official implementation of DreamCraft3D
A 0.1B Omni model trained from scratch
Open Source Speech Language Model
Qwen3-ASR is an open-source series of ASR models
Code for running inference with the SAM 3D Body Model 3DB
High-resolution models for human tasks
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal-Driven Architecture for Customized Video Generation
Multimodal Diffusion with Representation Alignment
Native and Compact Structured Latents for 3D Generation
High-Resolution 3D Assets Generation with Large Scale Diffusion Models
A Family of Open Sourced Music Foundation Models
A Production-ready Reinforcement Learning AI Agent Library
Netease Youdao's open-source embedding and reranker models
A theoretical reconstruction of the Claude Mythos architecture
Diversity-driven optimization and large-model reasoning ability
Qwen3-Coder is the code version of Qwen3
Uncommon Objects in 3D dataset
Advancing Open-source World Models