A community-supported supercharged version of paperless
Open Source Document Management System for Digital Archives
The all-in-one Desktop & Docker AI application with full RAG and AI
Assist in organizing your piles of documents
Python tool for converting files and office documents to Markdown
Readest is a modern, feature-rich ebook reader
Generate audiobooks from EPUBs, PDFs and text with captions
An AI personal assistant for your digital brain
A full spaCy pipeline and models for scientific/biomedical documents
The ChatGPT Retrieval Plugin lets you easily find personal documents
Interact with your documents using the power of GPT
An open-source RAG-based tool for chatting with your documents
Python scraper based on AI
OCRmyPDF adds an OCR text layer to scanned PDF files
A Repo For Document AI
Contexts Optical Compression
ContextGem: Effortless LLM extraction from documents
OCR software, free and offline
A machine learning software for extracting information
File Parser optimised for LLM Ingestion with no loss
Tongyi Deep Research, the Leading Open-source Deep Research Agent
ktrain is a Python library that makes deep learning AI more accessible
Haystack is an open source NLP framework to interact with your data
Build AI-powered semantic search applications
Open source libraries and APIs to build custom preprocessing pipelines