GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Official inference repo for FLUX.2 models
Code for running inference with the SAM 3D Body Model 3DB
Qwen3 is the large language model series developed by Qwen team
An experimental version of DeepSeek model
MapAnything: Universal Feed-Forward Metric 3D Reconstruction
Visual Causal Flow
PyTorch code and models for the DINOv2 self-supervised learning
Inference code for scalable emulation of protein equilibrium ensembles
Accurate × Fast × Comprehensive
Advancing Open-source World Models
CLIP, Predict the most relevant text snippet given an image
GLM-4 series: Open Multilingual Multimodal Chat LMs
Lets make video diffusion practical
Diversity-driven optimization and large-model reasoning ability
Models for object and human mesh reconstruction
Recovering the Visual Space from Any Views
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
A Powerful Native Multimodal Model for Image Generation
High-Fidelity and Controllable Generation of Textured 3D Assets
OCR expert VLM powered by Hunyuan's native multimodal architecture
Ling is a MoE LLM provided and open-sourced by InclusionAI
4M: Massively Multimodal Masked Modeling
Long-form streaming TTS system for multi-speaker dialogue generation