AI-powered document analysis and tagging for Paperless-ngx
A Repo For Document AI
Document (PDF, Word, PPTX ...) extraction and parse API
A high-quality tool for convert PDF to Markdown and JSON
Get your documents ready for gen AI
Open source semantic search and text analytics for large document sets
Text mining using tidy tools
A Model Context Protocol (MCP) server implementation
ExtractThinker is a Document Intelligence library for LLMs
Private chat with local GPT with document, images, video, etc.
Document content and metadata extraction microservice
PHP low-level client for Elasticsearch
LongBench v2 and LongBench (ACL 25'&24')
A system for agentic LLM-powered data processing and ETL
Multi-tool for semantic search
RAG-Anything: All-in-One RAG Framework
Autonomous agents for everyone
Open-Source Financial Large Language Models
Clean network diagrams, One-time setup, zero upkeep
Full-stack Open-source Self-Evolving General AI Agent
Public repository for Agent Skills
Research-oriented chatbot framework
Optimized Workforce Learning for General Multi-Agent Assistance
Extract and convert data from any document, images, pdfs, word doc
Chat with your documents using local AI