Advancing Open-source World Models
Qwen3 is the large language model series developed by Qwen team
Models for object and human mesh reconstruction
An experimental version of DeepSeek model
Lets make video diffusion practical
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Visual Causal Flow
Recovering the Visual Space from Any Views
Z80-μLM is a 2-bit quantized language model
PyTorch code and models for the DINOv2 self-supervised learning
Ling is a MoE LLM provided and open-sourced by InclusionAI
Diversity-driven optimization and large-model reasoning ability
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Inference code for scalable emulation of protein equilibrium ensembles
ChatGLM-6B: An Open Bilingual Dialogue Language Model
A Customizable Image-to-Video Model based on HunyuanVideo
CLIP, Predict the most relevant text snippet given an image
Repo for SeedVR2 & SeedVR
A Powerful Native Multimodal Model for Image Generation
GLM-4 series: Open Multilingual Multimodal Chat LMs
4M: Massively Multimodal Masked Modeling
Collection of Gemma 3 variants that are trained for performance
High-Fidelity and Controllable Generation of Textured 3D Assets
Long-form streaming TTS system for multi-speaker dialogue generation
Block Diffusion for Ultra-Fast Speculative Decoding