Python tool for converting files and office documents to Markdown
A community-supported supercharged version of paperless
Open Source Document Management System for Digital Archives
Package for converting and rendering markdown documents in TeX
Small python-gtk application, to merge or split PDFs
The awesome document factory
An open-source RAG-based tool for chatting with your documents
OCR software, free and offline
Multi-tool for semantic search
Fault-tolerant Python3 package for searching LaTeX documents
Interact with your documents using the power of GPT
An AI personal assistant for your digital brain
Python bindings for MuPDF's rendering library.
Library for OCR-related tasks powered by Deep Learning
An on-premises, OCR-free unstructured data extraction
Chat with your documents using local AI
Enhances Tesseract OCR output using LLMs (local or API)
LLM framework for document understanding and semantic retrieval
Generate audiobooks from EPUBs, PDFs and text with captions
AI-powered document analysis and tagging for Paperless-ngx
A high-quality PDF to Markdown tool based on large language model
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System
Visual Causal Flow
A Repo For Document AI
Open source RAG framework for building scalable modular AI apps