A high performance implementation of HDBSCAN clustering
Powerful AI language model (MoE) optimized for efficiency/performance
Robust Speech Recognition via Large-Scale Weak Supervision
Code for running inference with the SAM 3D Body Model 3DB
A Python vector database you just need, no more, no less
A robust, efficient, low-latency speech-to-text library
Real-time behaviour synthesis with MuJoCo, using Predictive Control
Document Image Parsing via Heterogeneous Anchor Prompting”
Models for object and human mesh reconstruction
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
The official Python SDK for Model Context Protocol servers and clients
VMZ: Model Zoo for Video Modeling
Build multi-modal Agents with memory, knowledge, tools and reasoning
Fault-tolerant, highly scalable GPU orchestration
Context data platform for building observable, self-learning AI agents
Integrating LLMs into structured NLP pipelines
Implementation of DeepLabCut
Industrial-strength Natural Language Processing (NLP)
An open phone agent model & framework
Photorealistic Synthetic Dataset for Holistic Indoor Scene
LLM abstractions that aren't obstructions
End-to-end speech processing toolkit
Omnilingual ASR Open-Source Multilingual SpeechRecognition
PyTorch code and models for V-JEPA self-supervised learning from video
[CVPR 2025 Best Paper Award] VGGT