OCR software, free and offline
Contexts Optical Compression
OCRmyPDF adds an OCR text layer to scanned PDF files
Awesome multilingual OCR toolkits based on PaddlePaddle
Ready-to-use OCR with 80+ supported languages
The official Python library for the OpenAI API
Library for OCR-related tasks powered by Deep Learning
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
The official Python Library for the Groq API
Convert AI papers to GUI
A Repo For Document AI
Open Source Document Management System for Digital Archives
A framework to enable multimodal models to operate a computer
GUI for a Vocal Remover that uses Deep Neural Networks
A community-supported supercharged version of paperless
Open source personal AI Assistant for Linux, Windows and Mac
Run Local LLMs on Any Device. Open-source
Image inpainting tool powered by SOTA AI Model
A GPT-4o Level MLLM for Vision, Speech and Multimodal Live Streaming
Qwen3-omni is a natively end-to-end, omni-modal LLM
21 Lessons, Get Started Building with Generative AI
Agentic, Reasoning, and Coding (ARC) foundation models
A Powerful Desktop Full-Text Search Engine, Just Like Local Google.
DoWhy is a Python library for causal inference