Accurate × Fast × Comprehensive
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
An experimental version of DeepSeek model
ChatGPT interface with better UI
Recovering the Visual Space from Any Views
ChatGLM-6B: An Open Bilingual Dialogue Language Model
tiktoken is a fast BPE tokeniser for use with OpenAI's models
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
PyTorch code and models for the DINOv2 self-supervised learning
Ling is a MoE LLM provided and open-sourced by InclusionAI
Diversity-driven optimization and large-model reasoning ability
Long-form streaming TTS system for multi-speaker dialogue generation
LTX-Video Support for ComfyUI
Repo for SeedVR2 & SeedVR
Block Diffusion for Ultra-Fast Speculative Decoding
GLM-4 series: Open Multilingual Multimodal Chat LMs
4M: Massively Multimodal Masked Modeling
High-Fidelity and Controllable Generation of Textured 3D Assets
Collection of Gemma 3 variants that are trained for performance
A Powerful Native Multimodal Model for Image Generation
Designed for text embedding and ranking tasks
The official PyTorch implementation of Google's Gemma models
OCR expert VLM powered by Hunyuan's native multimodal architecture
Repo of Qwen2-Audio chat & pretrained large audio language model
Large Multimodal Models for Video Understanding and Editing