OCR model for complex documents with layout-aware structured outputs
Document (PDF, Word, PPTX ...) extraction and parse API
Generate audiobooks from EPUBs, PDFs and text with captions
Enhances Tesseract OCR output using LLMs (local or API)
Open source healthcare AI
A Repo For Document AI
Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex
Stable Diffusion web UI
Text mining using tidy tools
OCR software, free and offline
Faster Whisper transcription with CTranslate2
Comprehensive Gradio WebUI for audio processing
Visual Causal Flow
Stanford CoreNLP, a Java suite of core NLP tools
Persian NLP Toolkit
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Stable Diffusion web UI
Contexts Optical Compression
A full spaCy pipeline and models for scientific/biomedical documents
Readest is a modern, feature-rich ebook reader
Use Microsoft Edge's online text-to-speech service from Python
AI tool for automatic batch short video creation and editing
A TTS that fits in your CPU (and pocket)
Easy-to-use and high-performance NLP and LLM framework
Apache OpenNLP