Semantic search and document parsing tools for the command line
LLM framework for document understanding and semantic retrieval
AI-powered document analysis and tagging for Paperless-ngx
Document (PDF, Word, PPTX ...) extraction and parse API
An on-premises, OCR-free unstructured data extraction
Open source semantic search and text analytics for large document sets
Multilingual Document Layout Parsing in a Single Vision-Language Model
Canvas-based WYSIWYG rich text editor with advanced layout tools
Chat with your documents using local AI
A high-quality PDF to Markdown tool based on large language model
A system for agentic LLM-powered data processing and ETL
AI tool converting video/audio into structured documents instantly
Your fully private, open-source, on-device AI assistant
The official implementation of RAPTOR
Multi-tool for semantic search
Extract and convert data from any document, images, pdfs, word doc
Accurate × Fast × Comprehensive
Public repository for Agent Skills
RAG Web UI is an intelligent dialogue system based on RAG
AI PDF chatbot agent built with LangChain & LangGraph
Multi-platform SDK for integrating GitHub Copilot Agent into apps
LongBench v2 and LongBench (ACL 25'&24')
Application implementation with business use cases
OCR model for complex documents with layout-aware structured outputs
Fast and efficient unstructured data extraction