An experimental version of DeepSeek model
Diversity-driven optimization and large-model reasoning ability
GLM-4.5: Open-source LLM for intelligent agents by Z.ai
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Visual Causal Flow
4M: Massively Multimodal Masked Modeling
Repo of Qwen2-Audio chat & pretrained large audio language model
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Accurate × Fast × Comprehensive
Designed for text embedding and ranking tasks
Inference code for scalable emulation of protein equilibrium ensembles
A 0.1B Omni model trained from scratch
High-Fidelity and Controllable Generation of Textured 3D Assets
Ling is a MoE LLM provided and open-sourced by InclusionAI
ChatGPT interface with better UI
Recovering the Visual Space from Any Views
CLIP, Predict the most relevant text snippet given an image
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Repo for SeedVR2 & SeedVR
Large Multimodal Models for Video Understanding and Editing
MOSS‑TTS Family open‑source speech and sound generation model
Collection of Gemma 3 variants that are trained for performance
A SOTA open-source image editing model
OCR expert VLM powered by Hunyuan's native multimodal architecture
code for Mesh R-CNN, ICCV 2019