Fast and Universal 3D reconstruction model for versatile tasks
Foundation Models for Time Series
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Research code artifacts for Code World Model (CWM)
State-of-the-art TTS model under 25MB
DeepMind model for tracking arbitrary points across videos & robotics
code for Mesh R-CNN, ICCV 2019
Uncommon Objects in 3D dataset
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
A series of math-specific large language models of our Qwen2 series
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
High-Fidelity and Controllable Generation of Textured 3D Assets
Qwen3-omni is a natively end-to-end, omni-modal LLM
Inference code for scalable emulation of protein equilibrium ensembles
Audio foundation model excelling in audio understanding
Revolutionizing Database Interactions with Private LLM Technology
Tiny vision language model
Repo for SeedVR2 & SeedVR
Official implementation of Watermark Anything with Localized Messages
Generate Any 3D Scene in Seconds
FAIR Sequence Modeling Toolkit 2
ICLR2024 Spotlight: curation/training code, metadata, distribution
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Official implementation of DreamCraft3D