AI-powered document analysis and tagging for Paperless-ngx
Document (PDF, Word, PPTX ...) extraction and parse API
A Repo For Document AI
A Model Context Protocol (MCP) server implementation
A high-quality tool for convert PDF to Markdown and JSON
Get your documents ready for gen AI
Text mining using tidy tools
Open source semantic search and text analytics for large document sets
ExtractThinker is a Document Intelligence library for LLMs
A system for agentic LLM-powered data processing and ETL
RAG-Anything: All-in-One RAG Framework
Document content and metadata extraction microservice
An open source collaborative multi-agent OS
PHP low-level client for Elasticsearch
Application implementation with business use cases
LongBench v2 and LongBench (ACL 25'&24')
Unified framework for building enterprise RAG pipelines
Autonomous agents for everyone
Clean network diagrams, One-time setup, zero upkeep
Chat with your documents using local AI
Full-stack Open-source Self-Evolving General AI Agent
Question and Answer based on Anything
Multi-tool for semantic search
Research-oriented chatbot framework
Your fully private, open-source, on-device AI assistant