Generate Any 3D Scene in Seconds
Sharp Monocular Metric Depth in Less Than a Second
Implementation of DeepLabCut
2D and 3D Face alignment library build using pytorch
Medical imaging toolkit for deep learning
Benchmarking Multimodal Agents for Open-Ended Tasks
DeepMind model for tracking arbitrary points across videos & robotics
State-of-the-art (SoTA) text-to-video pre-trained model
End-to-end pipeline converting generative videos
A text-to-speech, speech-to-text and speech-to-speech library
The machine learning toolkit for time series analysis in Python
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
State-of-the-art diffusion models for image and audio generation
The data structure for multimodal data
Unsupervised Learning for Image Registration
Framework for building AI-powered interactive digital humans and agent
SAPIEN Manipulation Skill Framework
An Open Source package that allows video game creators
A Systematic Framework for Interactive World Modeling
Simple and easily configurable grid world environments
A Next-Generation Training Engine Built for Ultra-Large MoE Models
Framework for building neural networks
Graph Neural Network Library for PyTorch
Geometric deep learning extension library for PyTorch
Deep learning optimization library: makes distributed training easy