Enhances Tesseract OCR output using LLMs (local or API)
Open source libraries and APIs to build custom preprocessing pipelines
A simple tool for reading in poorly redacted documents
Easily compute clip embeddings and build a clip retrieval system
OCR software, free and offline
Cut videos with a text editor
Automatic Speech Recognition with Word-level Timestamps
Python binding to the Apache Tika™ REST services
Agent harness to make your slop code well-engineered and beautiful
Advanced NLP with spaCy: A free online course
A very simple framework for state-of-the-art NLP
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Audiocraft is a library for audio processing and generation
End-to-end speech processing toolkit
Faster Whisper transcription with CTranslate2
Public opinion analysis system
Use Microsoft Edge's online text-to-speech service from Python
Stable Diffusion web UI
Pretrained model hub for Keras 3
Open source no-code system for text annotation and building of text
Voice Recognition to Text Tool
Deep Research framework, combining language models with tools
Fast and customizable framework for automatic ML model creation
Python library for scraping and analyzing online news articles easily
Chinese XLNet pre-trained model