GLM-4.5: Open-source LLM for intelligent agents by Z.ai
Python inference and LoRA trainer package for the LTX-2 audio–video
Image generation model with single-stream diffusion transformer
From Vibe Coding to Agentic Engineering
Accurate × Fast × Comprehensive
Programmatic access to the AlphaGenome model
Visual Causal Flow
Diversity-driven optimization and large-model reasoning ability
Flux 2 image generation model pure C inference
ChatGLM-6B: An Open Bilingual Dialogue Language Model
State of the art LLM and coding model
Official inference repo for FLUX.2 models
Large Multimodal Models for Video Understanding and Editing
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Proxy that exposes Antigravity provided claude / gemini models
An experimental version of DeepSeek model
Ling is a MoE LLM provided and open-sourced by InclusionAI
Moonshot's most powerful AI model
PyTorch code and models for the DINOv2 self-supervised learning
Recovering the Visual Space from Any Views
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
CLIP, Predict the most relevant text snippet given an image
MOSS‑TTS Family open‑source speech and sound generation model
4M: Massively Multimodal Masked Modeling
tiktoken is a fast BPE tokeniser for use with OpenAI's models