A Model Context Protocol server for searching and analyzing arXiv
4M: Massively Multimodal Masked Modeling
MobileLLM Optimizing Sub-billion Parameter Language Models
A Production-ready Reinforcement Learning AI Agent Library
PyTorch code and models for V-JEPA self-supervised learning from video
A PyTorch library for implementing flow matching algorithms
ImageBind One Embedding Space to Bind Them All
[CVPR 2025 Best Paper Award] VGGT
Code to accompany "A Method for Animating Children's Drawings"
Anthropic's educational courses
Repo of Qwen2-Audio chat & pretrained large audio language model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Structured outputs for llms
Low-latency REST API for serving text-embeddings
Scalable and user friendly neural forecasting algorithms.
Machine Learning Pipelines for Kubeflow
Synthetic data generators for tabular and time-series data
Easy-to-use,Modular and Extendible package of deep-learning models
JAX-based neural network library
Making large AI models cheaper, faster and more accessible
The easiest way to use deep metric learning in your application
Package of deep-learning based CTR models
OCR expert VLM powered by Hunyuan's native multimodal architecture
Prevent PyTorch's `CUDA error: out of memory` in just 1 line of code
Unifying 3D Mesh Generation with Language Models