A fast, helpful, and open-source document parser
eXist Native XML Database and Application Platform
A Model Context Protocol (MCP) server implementation
PDF Reader Library for Native Julia.
Document oriented database optimized for you
PDF Parser for AI-ready data. Automate PDF accessibility
Node.js client for Google Cloud Firestore: a NoSQL document database
Multilingual Document Layout Parsing in a Single Vision-Language Model
A high-quality PDF to Markdown tool based on large language model
Unified framework for building enterprise RAG pipelines
A community-supported supercharged version of paperless
CLI tool for saving complete web pages as a single HTML file
EF Core-like CouchDB experience for .NET
The all-in-one Desktop & Docker AI application with full RAG and AI
Low code web framework for real world applications
Self-hostable warranty tracker to monitor expirations, store receipts
A web extension that helps you view JSON documents in the browser
A system for agentic LLM-powered data processing and ETL
NoSQL embedded document store for Java
PHP7 / Laravel Multi-format Streaming Parser
AI tool converting video/audio into structured documents instantly
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
iText for Java represents the next level of SDKs for developers
The official ArangoDB JavaScript driver