Tongyi Deep Research, the Leading Open-source Deep Research Agent
Generate Any 3D Scene in Seconds
Release for Improved Denoising Diffusion Probabilistic Models
The lightweight PyTorch wrapper for high-performance AI research
"Big Model" trains a visual multimodal VLM with 26M parameters
Research code artifacts for Code World Model (CWM)
Large Multimodal Models for Video Understanding and Editing
The official repo of Qwen chat & pretrained large language model
FAIR Sequence Modeling Toolkit 2
High-resolution models for human tasks
Global weather forecasting model using graph neural networks and JAX
Investment Research for Everyone, Everywhere
CogView4, CogView3-Plus and CogView3(ECCV 2024)
Diversity-driven optimization and large-model reasoning ability
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Models for object and human mesh reconstruction
AlphaFold 3 inference pipeline
Reproduction of Poetiq's record-breaking submission to the ARC-AGI-1
State-of-the-art Image & Video CLIP, Multimodal Large Language Models
Deep Research framework, combining language models with tools
VMZ: Model Zoo for Video Modeling
Benchmarking Multimodal Agents for Open-Ended Tasks
Tooling for the Common Objects In 3D dataset
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments
Code for running inference with the SAM 3D Body Model 3DB