AI-powered document analysis and tagging for Paperless-ngx
A high-quality tool for convert PDF to Markdown and JSON
LLM framework for document understanding and semantic retrieval
Open Source Document Management System for Digital Archives
Get your documents ready for gen AI
Document (PDF, Word, PPTX ...) extraction and parse API
A Repo For Document AI
An on-premises, OCR-free unstructured data extraction
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Structured data extraction and instruction calling with ML, LLM
Document content and metadata extraction microservice
A high-quality PDF to Markdown tool based on large language model
Parse files for optimal RAG
A Model Context Protocol (MCP) server implementation
Multilingual Document Layout Parsing in a Single Vision-Language Model
AI tool converting video/audio into structured documents instantly
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Chat with your documents using local AI
File Parser optimised for LLM Ingestion with no loss
A system for agentic LLM-powered data processing and ETL
Private chat with local GPT with document, images, video, etc.
Multi-tool for semantic search
Unified framework for building enterprise RAG pipelines
The official implementation of RAPTOR
Generate audiobooks from EPUBs, PDFs and text with captions