A GUI tool for extracting hard-coded subtitle (hardsub) from videos
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Audiocraft is a library for audio processing and generation
Library for OCR-related tasks powered by Deep Learning
Connect MATLAB to LLM APIs, including OpenAI® Chat Completions
Implementation of Make-A-Video, new SOTA text to video generator
AutoGluon: AutoML for Image, Text, and Tabular Data
Han Language Processing
ImageBind One Embedding Space to Bind Them All
Improve your resumes with Resume Matcher
MII makes low-latency and high-throughput inference possible
The data structure for multimodal data
A distributed system for embedding-based vector retrieval
Deep Learning API and Server in C++14 support for Caffe, PyTorch
Pre-trained Deep Learning models and demos
A machine learning software for extracting information
ktrain is a Python library that makes deep learning AI more accessible
Interview guide for machine learning, mathematics, and deep learning
Build cross-modal and multimodal applications on the cloud
A collection of various deep learning architectures, models, and tips
An Embedded Computer Vision & Machine Learning Library
Applications of Deep Neural Networks
The deep learning toolkit for speech-to-text
Simple command line tool for text to image generation
Free and open source text-to-speech software