Python binding to the Apache Tika™ REST services
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Ready-to-use OCR with 80+ supported languages
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Label Studio is a multi-type data labeling and annotation tool
Models for the spaCy Natural Language Processing (NLP) library
A community-supported supercharged version of paperless
Machine learning, conversational dialog engine for creating chat bots
Evaluate and monitor ML models from validation to production
Open source machine learning framework to automate text conversations
Library for OCR-related tasks powered by Deep Learning
Toolkit for conversational AI
Tool for visualizing and tracking your machine learning experiments
An open source implementation of CLIP
Python implementation of TextRank algorithms
AutoGluon: AutoML for Image, Text, and Tabular Data
Python package for AutoML on Tabular Data with Feature Engineering
A Python package for segmenting geospatial data with the SAM
Build cross-modal and multimodal applications on the cloud
Implementation of Imagen, Google's Text-to-Image Neural Network
Create videos with Stable Diffusion
A very simple framework for state-of-the-art NLP
Algorithms for outlier, adversarial and drift detection
Python framework for adversarial attacks, and data augmentation
Data loaders and abstractions for text and NLP