A GUI tool for extracting hard-coded subtitle (hardsub) from videos
Open Source OCR Engine
OCR software, free and offline
Contexts Optical Compression
Accurate × Fast × Comprehensive
PDF to Markdown with vision models
OCRmyPDF adds an OCR text layer to scanned PDF files
OCR offline image text recognition command line windows program
Visual Causal Flow
Awesome multilingual OCR toolkits based on PaddlePaddle
Enhances Tesseract OCR output using LLMs (local or API)
Library for OCR-related tasks powered by Deep Learning
A cross-platform software for text translation and recognition
A pure Javascript Multilingual OCR
JavaScript OCR and text extraction for images and PDFs
AI tool that removes hardcoded subtitles and text from videos locally
Ready-to-use OCR with 80+ supported languages
Screenshots, word marking, OCR, AI, translation software
Readest is a modern, feature-rich ebook reader
Implementation of Video Diffusion Models
Free open-source non-linear video editor
Qwen3-VL, the multimodal large language model series by Alibaba Cloud
Convert AI papers to GUI
Implementation of Make-A-Video, new SOTA text to video generator
Use LLMs and LLM Vision (OCR) to handle paperless-ngx