ExtractThinker is a Document Intelligence library for LLMs
PDF to Markdown with vision models
The awesome document factory
CLI tool to extract (meta)data from PDF and manipulate PDF files
BISHENG is an open LLM devops platform for next generation apps
Interact with your documents using the power of GPT
Leaderboard Comparing LLM Performance at Producing Hallucinations
A Python SOAP client
JupyterLab extension for live editing of LaTeX documents
A Heterogeneous Benchmark for Information Retrieval
ContextGem: Effortless LLM extraction from documents
Ready-to-use OCR with 80+ supported languages
borb is a library for reading, creating and manipulating PDF files
RAGLite is a Python toolkit for Retrieval-Augmented Generation
An open-source RAG-based tool for chatting with your documents
Powerful and highly extensible command-line based document
Revolutionizing Database Interactions with Private LLM Technology
The official Python client for the Huggingface Hub
Semantic search and workflows for medical/scientific papers
Visual Causal Flow
Topic Modelling for Humans
Open Source Generative Process Automation
Simple PDF generation for Python
Solve end to end problems using Llama model family
Paste Markdown and AI responses into Word Excel instantly fast