Generating Immersive, Explorable, and Interactive 3D Worlds
Repo for SeedVR2 & SeedVR
GLM-4 series: Open Multilingual Multimodal Chat LMs
Foundation model for image generation
Hunyuan Translation Model Version 1.5
Block Diffusion for Ultra-Fast Speculative Decoding
A Family of Open Sourced Music Foundation Models
A Powerful Native Multimodal Model for Image Generation
A series of math-specific large language models of our Qwen2 series
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Recovering the Visual Space from Any Views
A Unified Framework for Text-to-3D and Image-to-3D Generation
Multimodal-Driven Architecture for Customized Video Generation
Bidirectional token-classification model for identifiable info
Open-Source Financial Large Language Models
HY-Motion model for 3D character animation generation
A Multi-Modal World Model for Reconstructing, Generating, Simulation
Code for running inference with the SAM 3D Body Model 3DB
Pokee Deep Research Model Open Source Repo
Unified Multimodal Understanding and Generation Models
Global weather forecasting model using graph neural networks and JAX
Sharp Monocular Metric Depth in Less Than a Second
CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning
RGBD video generation model conditioned on camera input