Open source semantic search and text analytics for large document sets
AI-powered document analysis and tagging for Paperless-ngx
LLM framework for document understanding and semantic retrieval
An on-premises, OCR-free unstructured data extraction
Document (PDF, Word, PPTX ...) extraction and parse API
Semantic search and document parsing tools for the command line
Multilingual Document Layout Parsing in a Single Vision-Language Model
A high-quality PDF to Markdown tool based on large language model
AI tool converting video/audio into structured documents instantly
A system for agentic LLM-powered data processing and ETL
RAG Web UI is an intelligent dialogue system based on RAG
The official implementation of RAPTOR
Multi-tool for semantic search
ChatOllama is an open-source AI chatbot
Chat with your documents using local AI
Accurate × Fast × Comprehensive
Canvas-based WYSIWYG rich text editor with advanced layout tools
Your fully private, open-source, on-device AI assistant
AI PDF chatbot agent built with LangChain & LangGraph
Application implementation with business use cases
LongBench v2 and LongBench (ACL 25'&24')
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Extract and convert data from any document, images, pdfs, word doc
OCR model for complex documents with layout-aware structured outputs
Multi-platform SDK for integrating GitHub Copilot Agent into apps