OCR software, free and offline
Contexts Optical Compression
OCRmyPDF adds an OCR text layer to scanned PDF files
Awesome multilingual OCR toolkits based on PaddlePaddle
OCR expert VLM powered by Hunyuan's native multimodal architecture
The official Python library for the OpenAI API
The official Python Library for the Groq API
Convert AI papers to GUI
A Repo For Document AI
Open Source Document Management System for Digital Archives
A framework to enable multimodal models to operate a computer
GUI for a Vocal Remover that uses Deep Neural Networks
A community-supported supercharged version of paperless
Open source personal AI Assistant for Linux, Windows and Mac
Run Local LLMs on Any Device. Open-source
A Python application to add watermarks (text or image) to PDF files
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Qwen3-omni is a natively end-to-end, omni-modal LLM
21 Lessons, Get Started Building with Generative AI
Agentic, Reasoning, and Coding (ARC) foundation models
A Powerful Desktop Full-Text Search Engine, Just Like Local Google.
DoWhy is a Python library for causal inference
Adds support for Yandex Smart Home (Alice voice assistant)
Training data (data labeling, annotation, workflow) for all data types