LLM framework for document understanding and semantic retrieval
AI-powered document analysis and tagging for Paperless-ngx
Semantic search and document parsing tools for the command line
An on-premises, OCR-free unstructured data extraction
Document (PDF, Word, PPTX ...) extraction and parse API
Open source semantic search and text analytics for large document sets
Multilingual Document Layout Parsing in a Single Vision-Language Model
A high-quality PDF to Markdown tool based on large language model
Canvas-based WYSIWYG rich text editor with advanced layout tools
Chat with your documents using local AI
AI tool converting video/audio into structured documents instantly
A system for agentic LLM-powered data processing and ETL
Your fully private, open-source, on-device AI assistant
The official implementation of RAPTOR
Accurate × Fast × Comprehensive
Multi-tool for semantic search
Public repository for Agent Skills
Extract and convert data from any document, images, pdfs, word doc
Multi-platform SDK for integrating GitHub Copilot Agent into apps
RAG Web UI is an intelligent dialogue system based on RAG
AI PDF chatbot agent built with LangChain & LangGraph
LongBench v2 and LongBench (ACL 25'&24')
Application implementation with business use cases
Fast and efficient unstructured data extraction
Document Management System and Content Management System