Speech recognition module for Python
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
OpenMMLab Model Deployment Framework
3D reconstruction software
Machine learning, conversational dialog engine for creating chat bots
Image processing in Python
A Python library for audio data augmentation
Embed images and sentences into fixed-length vectors
Stable Diffusion built-in to Blender
VMZ: Model Zoo for Video Modeling
Audiocraft is a library for audio processing and generation
Image inpainting tool powered by SOTA AI Model
MMEditing is a low-level vision toolbox based on PyTorch
Open Source Differentiable Computer Vision Library
code for Mesh R-CNN, ICCV 2019
The easiest way to use deep metric learning in your application
Image/video AI upscaler app (BSRGAN)
Code to accompany "A Method for Animating Children's Drawings"
VGGSfM: Visual Geometry Grounded Deep Structure From Motion
Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis
Open Source Computer Vision Library
OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark
The PyTorch-based audio source separation toolkit for researchers
Run the Stable Diffusion releases in a Docker container
Suite with Real-ESRGAN, BSRGAN , IRCNN, GFPGAN & RIFE. v4.3