Python binding to the Apache Tika™ REST services
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Ready-to-use OCR with 80+ supported languages
Label Studio is a multi-type data labeling and annotation tool
Models for the spaCy Natural Language Processing (NLP) library
Library for OCR-related tasks powered by Deep Learning
Open source machine learning framework to automate text conversations
Toolkit for conversational AI
An open source implementation of CLIP
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
A community-supported supercharged version of paperless
ONNX-TensorRT: TensorRT backend for ONNX
Machine learning, conversational dialog engine for creating chat bots
AutoGluon: AutoML for Image, Text, and Tabular Data
Tool for visualizing and tracking your machine learning experiments
Evaluate and monitor ML models from validation to production
Python package for AutoML on Tabular Data with Feature Engineering
Embed images and sentences into fixed-length vectors
Python implementation of TextRank algorithms
Standalone, small, language-neutral
Implementation of Imagen, Google's Text-to-Image Neural Network
Create videos with Stable Diffusion
A very simple framework for state-of-the-art NLP
Algorithms for outlier, adversarial and drift detection
Python framework for adversarial attacks, and data augmentation