Pre-trained Deep Learning models and demos
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Library for OCR-related tasks powered by Deep Learning
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Han Language Processing
ImageBind One Embedding Space to Bind Them All
AutoGluon: AutoML for Image, Text, and Tabular Data
Petastorm library enables single machine or distributed training
Improve your resumes with Resume Matcher
A machine learning software for extracting information
The data structure for multimodal data
Audiocraft is a library for audio processing and generation
MII makes low-latency and high-throughput inference possible
A distributed system for embedding-based vector retrieval
ktrain is a Python library that makes deep learning AI more accessible
Build cross-modal and multimodal applications on the cloud
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Implementation of Make-A-Video, new SOTA text to video generator
A collection of various deep learning architectures, models, and tips
An Embedded Computer Vision & Machine Learning Library
Applications of Deep Neural Networks
The deep learning toolkit for speech-to-text
Simple command line tool for text to image generation
Free and open source text-to-speech software
Deep learning for text to speech