Chinese version of Google open source project style guide
AI-powered document analysis and tagging for Paperless-ngx
LLM framework for document understanding and semantic retrieval
A high-quality tool for convert PDF to Markdown and JSON
Open Source Document Management System for Digital Archives
A Repo For Document AI
An on-premises, OCR-free unstructured data extraction
Get your documents ready for gen AI
Document (PDF, Word, PPTX ...) extraction and parse API
A Python Object-Document-Mapper for working with MongoDB
"LightRAG: Simple and Fast Retrieval-Augmented Generation"
Sync and Async ODM (Object Document Mapper) for MongoDB
RAG-Anything: All-in-One RAG Framework
Private chat with local GPT with document, images, video, etc.
Structured data extraction and instruction calling with ML, LLM
Multilingual Document Layout Parsing in a Single Vision-Language Model
Document oriented database optimized for you
A community-supported supercharged version of paperless
A Model Context Protocol (MCP) server implementation
A high-quality PDF to Markdown tool based on large language model
Low code web framework for real world applications
Document content and metadata extraction microservice
A system for agentic LLM-powered data processing and ETL
AI tool converting video/audio into structured documents instantly
Unified framework for building enterprise RAG pipelines