Image generation model with single-stream diffusion transformer
Visual Causal Flow
Programmatic access to the AlphaGenome model
Moonshot's most powerful AI model
Proxy that exposes Antigravity provided claude / gemini models
Flux 2 image generation model pure C inference
ChatGLM-6B: An Open Bilingual Dialogue Language Model
Official inference repo for FLUX.2 models
Diversity-driven optimization and large-model reasoning ability
Designed for text embedding and ranking tasks
State of the art LLM and coding model
Large Multimodal Models for Video Understanding and Editing
An experimental version of DeepSeek model
Accurate × Fast × Comprehensive
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
Recovering the Visual Space from Any Views
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Ling is a MoE LLM provided and open-sourced by InclusionAI
CLIP, Predict the most relevant text snippet given an image
MOSS‑TTS Family open‑source speech and sound generation model
4M: Massively Multimodal Masked Modeling
PyTorch code and models for the DINOv2 self-supervised learning
tiktoken is a fast BPE tokeniser for use with OpenAI's models
Clean and efficient FP8 GEMM kernels with fine-grained scaling
Access to Anthropic's safety-first language model APIs