Open source semantic search and text analytics for large document sets
AI-powered document analysis and tagging for Paperless-ngx
LLM framework for document understanding and semantic retrieval
A high-quality tool for convert PDF to Markdown and JSON
A Repo For Document AI
Open Source Document Management System for Digital Archives
Use LLMs and LLM Vision (OCR) to handle paperless-ngx
An on-premises, OCR-free unstructured data extraction
Get your documents ready for gen AI
Document (PDF, Word, PPTX ...) extraction and parse API
Semantic search and document parsing tools for the command line
Assist in organizing your piles of documents
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Private chat with local GPT with document, images, video, etc.
Structured data extraction and instruction calling with ML, LLM
A fast, helpful, and open-source document parser
Cherry Studio is a desktop client that supports for multiple LLMs
Multilingual Document Layout Parsing in a Single Vision-Language Model
A Model Context Protocol (MCP) server implementation
A community-supported supercharged version of paperless
A high-quality PDF to Markdown tool based on large language model
The all-in-one Desktop & Docker AI application with full RAG and AI
Document content and metadata extraction microservice
A system for agentic LLM-powered data processing and ETL
AI tool converting video/audio into structured documents instantly