OCRmyPDF adds an OCR text layer to scanned PDF files
Web presentation editor replicating many PowerPoint features online
Integrate the opencode AI assistant with Neovim
Speech Note Linux app. Note taking, reading and translating
Awesome multilingual OCR toolkits based on PaddlePaddle
Focus on prompting and generating
Code for openai.fm, a demo for the OpenAI Speech API
Claude Code skill that removes signs of AI-generated writing from text
Speech to Text to Speech, sends text as OSC messages
Generate audiobooks from EPUBs, PDFs and text with captions
TTS with kokoro and onnx runtime
The media player for language learning, with dual subtitles
A pure Javascript Multilingual OCR
A cross-platform software for text translation and recognition
OCR offline image text recognition command line windows program
Official inference repo for FLUX.1 models
Wan2.2: Open and Advanced Large-Scale Video Generative Model
SOTA Open Source TTS
Tokenizer-Free TTS for Multilingual Speech Generation
A simple native web interface that uses ChatTTS to synthesize text
Code for running inference and finetuning with SAM 3 model
Robust Speech Recognition via Large-Scale Weak Supervision
Wan2.1: Open and Advanced Large-Scale Video Generative Model
A robust, efficient, low-latency speech-to-text library
Qwen3-TTS is an open-source series of TTS models