Toolkit for conversational AI
Open source semantic search and text analytics for large document sets
A Repo For Document AI
A free, open source, and extensible speech-to-text application
Readest is a modern, feature-rich ebook reader
Screenshots, word marking, OCR, AI, translation software
Easy-to-use and powerful NLP library with Awesome model zoo
The most accurate natural language detection library for Rust
A high-quality PDF to Markdown tool based on large language model
AI-powered tool for generating, optimizing, and translating subtitles
Easy-to-use and high-performance NLP and LLM framework
Go efficient multilingual NLP and text segmentation
Enhances Tesseract OCR output using LLMs (local or API)
Generate audiobooks from EPUBs, PDFs and text with captions
Easily compute clip embeddings and build a clip retrieval system
A fast, helpful, and open-source document parser
OCR software, free and offline
A very simple framework for state-of-the-art NLP
Python binding to the Apache Tika™ REST services
Automatic Speech Recognition with Word-level Timestamps
Apache OpenNLP
Open source libraries and APIs to build custom preprocessing pipelines
An opinionated CLI to transcribe Audio files w/ Whisper on-device
Agent harness to make your slop code well-engineered and beautiful
Advanced NLP with spaCy: A free online course