Open-source platform for extracting structured data from documents
Documind is an advanced document processing tool that leverages AI to extract structured data from PDFs. It is built to handle PDF conversions, extract relevant information, and format results as specified by customizable schemas.
Command-line toolset for extracting text from files
Command-line toolset for extracting text from files (documents, images, archives) into SQLite with OCR support.
Simple, expandable, one shell script only.