Open source semantic search and text analytics for large document sets
AI-powered document analysis and tagging for Paperless-ngx
LLM framework for document understanding and semantic retrieval
A Repo For Document AI
A high-quality tool for convert PDF to Markdown and JSON
Open Source Document Management System for Digital Archives
Get your documents ready for gen AI
Document (PDF, Word, PPTX ...) extraction and parse API
Use LLMs and LLM Vision (OCR) to handle paperless-ngx
An on-premises, OCR-free unstructured data extraction
Semantic search and document parsing tools for the command line
Assist in organizing your piles of documents
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Structured data extraction and instruction calling with ML, LLM
Document content and metadata extraction microservice
Private chat with local GPT with document, images, video, etc.
Cherry Studio is a desktop client that supports for multiple LLMs
A fast, helpful, and open-source document parser
A Model Context Protocol (MCP) server implementation
Multilingual Document Layout Parsing in a Single Vision-Language Model
A high-quality PDF to Markdown tool based on large language model
Unified framework for building enterprise RAG pipelines
A community-supported supercharged version of paperless
The all-in-one Desktop & Docker AI application with full RAG and AI
A system for agentic LLM-powered data processing and ETL