Python binding to the Apache Tika™ REST services
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Label Studio is a multi-type data labeling and annotation tool
Ready-to-use OCR with 80+ supported languages
Models for the spaCy Natural Language Processing (NLP) library
Library for OCR-related tasks powered by Deep Learning
Open source machine learning framework to automate text conversations
A community-supported supercharged version of paperless
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Toolkit for conversational AI
Machine learning, conversational dialog engine for creating chat bots
An open source implementation of CLIP
MMEditing is a low-level vision toolbox based on PyTorch
Python implementation of TextRank algorithms
Implementation of Imagen, Google's Text-to-Image Neural Network
Elyra extends JupyterLab with an AI centric approach
Create videos with Stable Diffusion
A very simple framework for state-of-the-art NLP
Algorithms for outlier, adversarial and drift detection
Evaluate and monitor ML models from validation to production
Python framework for adversarial attacks, and data augmentation
AutoGluon: AutoML for Image, Text, and Tabular Data
Python package for AutoML on Tabular Data with Feature Engineering
Data loaders and abstractions for text and NLP
Solve end to end problems using Llama model family