LLM framework for document understanding and semantic retrieval
Pluggable SOTA multi-object tracking modules for segmentation
An on-premises, OCR-free unstructured data extraction
Open Source Document Management System for Digital Archives
AI-powered document analysis and tagging for Paperless-ngx
Leaderboard Comparing LLM Performance at Producing Hallucinations
Multilingual Document Layout Parsing in a Single Vision-Language Model
Documentation for the Krixik Python client
Document content and metadata extraction microservice
The official implementation of RAPTOR
Document (PDF, Word, PPTX ...) extraction and parse API
Multi-tool for semantic search
OpenLIT is an open-source LLM Observability tool
Private chat with local GPT with document, images, video, etc.
An open-source, modern-design AI training tracking and visualization
Natural language workflows for AI agents
A community-supported supercharged version of paperless
Structured data extraction and instruction calling with ML, LLM
DeepCode: Open Agentic Coding
A system for agentic LLM-powered data processing and ETL
[CVPR 2025 Best Paper Award] VGGT
Git-based data version control for machine learning workflows
Reading book source
Unified framework for building enterprise RAG pipelines
Interact with your documents using the power of GPT