A GUI tool for extracting hard-coded subtitle (hardsub) from videos
The data structure for multimodal data
Chatbot daemon that connects to your favorite chat services
Build cross-modal and multimodal applications on the cloud
Build AI-powered semantic search applications
The Triton Inference Server provides an optimized cloud
GFPGAN aims at developing Practical Algorithms
Based on the Disco Diffusion, version of the AI art creation software
A computer vision framework to create and deploy apps in minutes
Official implementation for UniVL video and language training models
Gluon CV Toolkit
Software tool that converts text to video for more engaging experience
The leading software for creating deepfakes
Deep Learning (Flower Book) mathematical derivation
Easy-OCR solution and Tesseract trainer for GNU/Linux
Basic Utilities for PyTorch Natural Language Processing (NLP)
Identification codes
World's simplest facial recognition api for Python & the command line
Just Another Speech Recognition and Text to Speech software.
Pattern recognition for ADL events