OCR model for complex documents with layout-aware structured outputs
Document (PDF, Word, PPTX ...) extraction and parse API
Generate audiobooks from EPUBs, PDFs and text with captions
Enhances Tesseract OCR output using LLMs (local or API)
Open source healthcare AI
A Repo For Document AI
Free, local, open-source Cowork for Gemini CLI, Claude Code, Codex
Stable Diffusion web UI
OCR software, free and offline
Text mining using tidy tools
Comprehensive Gradio WebUI for audio processing
Visual Causal Flow
Stanford CoreNLP, a Java suite of core NLP tools
Persian NLP Toolkit
Open speech-to-speech models and pipelines by Hugging Face toolkit AI
Contexts Optical Compression
Readest is a modern, feature-rich ebook reader
A full spaCy pipeline and models for scientific/biomedical documents
Use Microsoft Edge's online text-to-speech service from Python
AI tool for automatic batch short video creation and editing
Screenshots, word marking, OCR, AI, translation software
A TTS that fits in your CPU (and pocket)
Easy-to-use and high-performance NLP and LLM framework
Apache OpenNLP
Deep Research framework, combining language models with tools