A GUI tool for extracting hard-coded subtitle (hardsub) from videos
The data structure for multimodal data
The Triton Inference Server provides an optimized cloud
Build cross-modal and multimodal applications on the cloud
A computer vision framework to create and deploy apps in minutes
Gluon CV Toolkit
We estimate dense, flicker-free, geometrically consistent depth
Deep Learning (Flower Book) mathematical derivation
World's simplest facial recognition api for Python & the command line