File Parser optimised for LLM Ingestion with no loss
ID-based RAG FastAPI: Integration with Langchain and PostgreSQL
A system for agentic LLM-powered data processing and ETL
Multi-tool for semantic search
Parse files for optimal RAG
Your fully private, open-source, on-device AI assistant
A high-quality PDF to Markdown tool based on large language model
Accurate × Fast × Comprehensive
Extract and convert data from any document, images, pdfs, word doc
Generate audiobooks from EPUBs, PDFs and text with captions
RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine
Text mining using tidy tools
Public repository for Agent Skills
RAG Web UI is an intelligent dialogue system based on RAG
AI PDF chatbot agent built with LangChain & LangGraph
Interact with your documents using the power of GPT
LongBench v2 and LongBench (ACL 25'&24')
Model Context Protocol (MCP) server to interact with Firebase service
Research-oriented chatbot framework
BISHENG is an open LLM devops platform for next generation apps
OCR model for complex documents with layout-aware structured outputs
A persistent, network resilient, full text search library
ExtractThinker is a Document Intelligence library for LLMs
Fast and efficient unstructured data extraction
Multi-platform SDK for integrating GitHub Copilot Agent into apps