Framework for building neural networks
StreamSpeech is a seamless model for offline speech recognition
Advanced techniques for RAG systems
Fast and Universal 3D reconstruction model for versatile tasks
A Model Context Protocol server for searching and analyzing arXiv
4M: Massively Multimodal Masked Modeling
This repository contains the official implementation of FastVLM
Refer and Ground Anything Anywhere at Any Granularity
Set of tools to assess and improve LLM security
MobileLLM Optimizing Sub-billion Parameter Language Models
A Production-ready Reinforcement Learning AI Agent Library
A PyTorch library for implementing flow matching algorithms
An implementation of a deep learning recommendation model (DLRM)
[CVPR 2025 Best Paper Award] VGGT
Memory-efficient and performant finetuning of Mistral's models
Diffusion Transformer with Fine-Grained Chinese Understanding
NVIDIA Isaac GR00T N1.5 is the world's first open foundation model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
The Memory layer for AI Agents
Machine Learning Pipelines for Kubeflow
Training PyTorch models with differential privacy
JAX-based neural network library
Models and examples built with TensorFlow
Concatenate a directory full of files into a single prompt
Open source AI VTuber platform with voice chat and Live2D avatars