Recognition and resolution of numbers, units, date/time, etc.
Oobabooga - The definitive Web UI for local AI, with powerful features
High-performance inference server for text embeddings models API layer
Document (PDF, Word, PPTX ...) extraction and parse API
Module for automatic summarization of text documents and HTML pages
Large Language Model Text Generation Inference
Text generator is a handy plugin for Obsidian
A playground to generate images from any text prompt using SD
Hypernetworks that adapt LLMs for specific benchmark tasks
Provides line-oriented text file editing capabilities
Speech-to-text, text-to-speech, and speaker recognition
Open Source OCR Engine
A GUI tool for extracting hard-coded subtitle (hardsub) from videos
AI tool that removes hardcoded subtitles and text from videos locally
A free, open source, and extensible speech-to-text application
Readest is a modern, feature-rich ebook reader
OCRmyPDF adds an OCR text layer to scanned PDF files
Awesome multilingual OCR toolkits based on PaddlePaddle
Speech Note Linux app. Note taking, reading and translating
Comprehensive Gradio WebUI for audio processing
A cross-platform software for text translation and recognition
Claude Code skill that removes signs of AI-generated writing from text
Focus on prompting and generating
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Wan2.1: Open and Advanced Large-Scale Video Generative Model