A GUI tool for extracting hard-coded subtitle (hardsub) from videos
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Library for OCR-related tasks powered by Deep Learning
Connect MATLAB to LLM APIs, including OpenAI® Chat Completions
Audiocraft is a library for audio processing and generation
Implementation of Make-A-Video, new SOTA text to video generator
Han Language Processing
ImageBind One Embedding Space to Bind Them All
AutoGluon: AutoML for Image, Text, and Tabular Data
A machine learning software for extracting information
The data structure for multimodal data
Improve your resumes with Resume Matcher
Pre-trained Deep Learning models and demos
Interview guide for machine learning, mathematics, and deep learning
A collection of various deep learning architectures, models, and tips
A distributed system for embedding-based vector retrieval
ktrain is a Python library that makes deep learning AI more accessible
MII makes low-latency and high-throughput inference possible
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Build cross-modal and multimodal applications on the cloud
An Embedded Computer Vision & Machine Learning Library
Applications of Deep Neural Networks
The deep learning toolkit for speech-to-text
Simple command line tool for text to image generation
Free and open source text-to-speech software