Audiocraft is a library for audio processing and generation
Segmentation models with pretrained backbones. PyTorch
Towards Studio-Grade Character Animation via In-Context Learning of 3D
Diagram as Code for prototyping cloud system architectures
Software that uses AI to perform real-time voice conversion
VMZ: Model Zoo for Video Modeling
The easiest way to use deep metric learning in your application
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
A Python library for audio data augmentation
Plug-n-play module turning text-to-image models into animation
OpenMMLab Model Deployment Framework
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
The PyTorch-based audio source separation toolkit for researchers
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
One-click face swap
OpenMMLab Image Classification Toolbox and Benchmark
Learning to Act by Watching Unlabeled Online Videos
Image Restoration Toolbox (PyTorch). Training and testing codes
A data augmentations library for audio, image, text, and video
Robust Video Matting in PyTorch, TensorFlow, TensorFlow.js, ONNX
The official pytorch implementation of our paper
Easy-OCR solution and Tesseract trainer for GNU/Linux
Super resolution using a CNN, based on the work of the DGtal team
Super-scale your images and run experiments with Residual Dense
Starter code for working with the YouTube-8M dataset