Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Models for object and human mesh reconstruction
Qwen3 is the large language model series developed by Qwen team
Lets make video diffusion practical
An experimental version of DeepSeek model
Visual Causal Flow
Z80-μLM is a 2-bit quantized language model
Recovering the Visual Space from Any Views
PyTorch code and models for the DINOv2 self-supervised learning
One-click local MCP server installation in desktop apps
Ling is a MoE LLM provided and open-sourced by InclusionAI
Diversity-driven optimization and large-model reasoning ability
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
ChatGLM-6B: An Open Bilingual Dialogue Language Model
CLIP, Predict the most relevant text snippet given an image
A Customizable Image-to-Video Model based on HunyuanVideo
A Powerful Native Multimodal Model for Image Generation
4M: Massively Multimodal Masked Modeling
Collection of Gemma 3 variants that are trained for performance
Long-form streaming TTS system for multi-speaker dialogue generation
Block Diffusion for Ultra-Fast Speculative Decoding
LTX-Video Support for ComfyUI
Repo of Qwen2-Audio chat & pretrained large audio language model
The official PyTorch implementation of Google's Gemma models
Implementation of the Surya Foundation Model for Heliophysics