Implementation of Make-A-Video, new SOTA text to video generator
PyTorch code and models for VJEPA2 self-supervised learning from video
Build cross-modal and multimodal applications on the cloud
Interactive video and image annotation tool for computer vision
C++ library for high performance inference on NVIDIA GPUs
PyTorch3D is FAIR's library of reusable components for deep learning
Techniques for deep learning with satellite & aerial imagery
The data structure for multimodal data
Deep Learning API and Server in C++14 support for Caffe, PyTorch
A GPU-accelerated library containing highly optimized building blocks
Pre-trained Deep Learning models and demos
MNN is a blazing fast, lightweight deep learning framework
PyTorch code and models for V-JEPA self-supervised learning from video
Making large AI models cheaper, faster and more accessible
Toolkit for making machine learning and data analysis applications
A Python library for audio data augmentation
MII makes low-latency and high-throughput inference possible
A distributed system for embedding-based vector retrieval
Open Source Computer Vision Library
A computer vision framework to create and deploy apps in minutes
Distributed training framework for TensorFlow, Keras, PyTorch, etc.
The fastai book, published as Jupyter Notebooks
A flexible and efficient library for deep learning
Face Mask Detection system based on computer vision and deep learning
Gluon CV Toolkit