AI-powered document analysis and tagging for Paperless-ngx
A Repo For Document AI
Document (PDF, Word, PPTX ...) extraction and parse API
A Model Context Protocol (MCP) server implementation
Get your documents ready for gen AI
A high-quality tool for convert PDF to Markdown and JSON
A system for agentic LLM-powered data processing and ETL
ExtractThinker is a Document Intelligence library for LLMs
Document content and metadata extraction microservice
RAG-Anything: All-in-One RAG Framework
LongBench v2 and LongBench (ACL 25'&24')
Multi-tool for semantic search
Question and Answer based on Anything
Unified framework for building enterprise RAG pipelines
Chat with your documents using local AI
Research-oriented chatbot framework
Semantic search and workflows for medical/scientific papers
Public repository for Agent Skills
Optimized Workforce Learning for General Multi-Agent Assistance
Private chat with local GPT with document, images, video, etc.
Topic Modelling for Humans
ContextGem: Effortless LLM extraction from documents
Open source healthcare AI
Running large language models on a single GPU
Open-Source Financial Large Language Models