A GUI tool for extracting hard-coded subtitle (hardsub) from videos
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Library for OCR-related tasks powered by Deep Learning
Audiocraft is a library for audio processing and generation
Implementation of Make-A-Video, new SOTA text to video generator
Industrial-strength Natural Language Processing (NLP)
AutoGluon: AutoML for Image, Text, and Tabular Data
MII makes low-latency and high-throughput inference possible
Han Language Processing
ktrain is a Python library that makes deep learning AI more accessible
ImageBind One Embedding Space to Bind Them All
The data structure for multimodal data
Build cross-modal and multimodal applications on the cloud
A Unified Toolkit for Deep Learning Based Document Image Analysis
Simple command line tool for text to image generation
Pre-trained and Reproduced Deep Learning Models
Deep learning based natural language and speech processing platform
Library of deep learning models and datasets
Deal or No Deal? End-to-End Learning for Negotiation Dialogues
Natural Language Processing Best Practices & Examples
Stuttering Chinese word segmentation
IPTV/NVR/CCTV/Video cloud https://fastocloud.com
Named-entity recognition using neural networks
Implementation of research papers on Deep Learning+ NLP+ CV in Python
PyTorch tutorials and fun projects including neural talk