The most powerful and modular diffusion model GUI, api and backend
The data structure for multimodal data
Build cross-modal and multimodal applications on the cloud
PyExe: YouTube thumbnail downloader (type-b) [I.S.A]
An end-to-end PyTorch framework for image and video classification
Real-ESRGAN aims at developing Practical Algorithms
GFPGAN aims at developing Practical Algorithms
A data augmentations library for audio, image, text, and video
Gluon CV Toolkit
We estimate dense, flicker-free, geometrically consistent depth
Identification codes
World's simplest facial recognition api for Python & the command line