Code for running inference with the SAM 3D Body Model 3DB
Pokee Deep Research Model Open Source Repo
GLM-4-Voice | End-to-End Chinese-English Conversational Model
A series of math-specific large language models of our Qwen2 series
1B text generation model based on the HRM architecture
Foundation model for image generation
Z80-μLM is a 2-bit quantized language model
Personalize Any Characters with a Scalable Diffusion Transformer
Open-source deep-learning framework
Fast and Universal 3D reconstruction model for versatile tasks
PyTorch code and models for the DINOv2 self-supervised learning
Advancing Open-source World Models
A Systematic Framework for Interactive World Modeling
Unified Multimodal Understanding and Generation Models
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Mixture-of-Experts Vision-Language Models for Advanced Multimodal
Renderer for the harmony response format to be used with gpt-oss
Implementation of the Surya Foundation Model for Heliophysics
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
RGBD video generation model conditioned on camera input
Qwen3-omni is a natively end-to-end, omni-modal LLM
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
The official PyTorch implementation of Google's Gemma models
Pretrained time-series foundation model developed by Google Research
Bidirectional token-classification model for identifiable info