PyTorch code and models for V-JEPA self-supervised learning from video
A PyTorch library for implementing flow matching algorithms
An implementation of a deep learning recommendation model (DLRM)
Official DeiT repository
[CVPR 2025 Best Paper Award] VGGT
Memory-efficient and performant finetuning of Mistral's models
Diffusion Transformer with Fine-Grained Chinese Understanding
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
Bailing is a voice dialogue robot similar to GPT-4o
An Open Source text-to-speech system built by inverting Whisper
MARS5 speech model (TTS) from CAMB.AI
Demo of a customer service use case implemented with the OpenAI Agents
Build Vision Agents quickly with any model or video provider
Virtual AI anchor that combines state-of-the-art technology
Unified Multimodal Understanding and Generation Models
LLM powered fuzzing via OSS-Fuzz
Beyond the Imitation Game collaborative benchmark for measuring
Expose your FastAPI endpoints as Model Context Protocol (MCP) tools
PyTorch code and models for VJEPA2 self-supervised learning from video
Language modeling in a sentence representation space
Dataset of GPT-2 outputs for research in detection, biases, and more
Code for Language models can explain neurons in language models paper
Evals is a framework for evaluating LLMs and LLM systems
The ChatGPT Retrieval Plugin lets you easily find personal documents
Scalable machine learning for time series forecasting