Accurate × Fast × Comprehensive
Inference script for Oasis 500M
A PyTorch library for implementing flow matching algorithms
PyTorch code and models for the DINOv2 self-supervised learning
tiktoken is a fast BPE tokeniser for use with OpenAI's models
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
New family of code large language models (LLMs)
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
CodeGeeX2: A More Powerful Multilingual Code Generation Model
Tongyi Deep Research, the Leading Open-source Deep Research Agent
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Qwen3-omni is a natively end-to-end, omni-modal LLM
Inference code for scalable emulation of protein equilibrium ensembles
The Clay Foundation Model - An open source AI model and interface
Robust Speech Recognition Across Languages, Dialects
Official code base for LeWorldModel: Stable End-to-End Joint-Embedding
The official PyTorch implementation of Google's Gemma models
Programmatic access to the AlphaGenome model
Open Source Speech Language Model
Long-form streaming TTS system for multi-speaker dialogue generation
Open-source industrial-grade ASR models
OpenTinker is an RL-as-a-Service infrastructure for foundation models
Multimodal embedding and reranking models built on Qwen3-VL
Z80-μLM is a 2-bit quantized language model