AI tool converting video/audio into structured documents instantly
Chat with your documents using local AI
File Parser optimised for LLM Ingestion with no loss
Code repository for PDFStitcher, a utility to stitch together PDFs
The official implementation of RAPTOR
ID-based RAG FastAPI: Integration with Langchain and PostgreSQL
Edit PDF files with Nano Banana
Automate the management and configuration of infrastructures at scale
Multi-tool for semantic search
Parse files for optimal RAG
Accurate × Fast × Comprehensive
Generate audiobooks from EPUBs, PDFs and text with captions
Python bindings for MuPDF's rendering library.
DeepCode: Open Agentic Coding
Document Index for Vectorless, Reasoning-based RAG
Fully featured framework for fast, easy and documented API development
95% on SimpleQA (e.g. Qwen3.6-27B on a 3090)
Optimized Workforce Learning for General Multi-Agent Assistance
Research-oriented chatbot framework
LongBench v2 and LongBench (ACL 25'&24')
OCR model for complex documents with layout-aware structured outputs
GLM-4.6V/4.5V/4.1V-Thinking, towards versatile multimodal reasoning
Library for OCR-related tasks powered by Deep Learning
Enhances Tesseract OCR output using LLMs (local or API)
A Python SOAP client