Get your documents ready for gen AI
AI-powered document analysis and tagging for Paperless-ngx
A high-quality tool for convert PDF to Markdown and JSON
Open Source Document Management System for Digital Archives
A Repo For Document AI
Document (PDF, Word, PPTX ...) extraction and parse API
An on-premises, OCR-free unstructured data extraction
LLM framework for document understanding and semantic retrieval
Private chat with local GPT with document, images, video, etc.
Multilingual Document Layout Parsing in a Single Vision-Language Model
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Document content and metadata extraction microservice
Parse files for optimal RAG
A Python Object-Document-Mapper for working with MongoDB
Multi-tool for semantic search
Low code web framework for real world applications
The official implementation of RAPTOR
RAG-Anything: All-in-One RAG Framework
File Parser optimised for LLM Ingestion with no loss
A community-supported supercharged version of paperless
Edit PDF files with Nano Banana
AI tool converting video/audio into structured documents instantly
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Structured data extraction and instruction calling with ML, LLM
Python bindings for MuPDF's rendering library.