Focus on prompting and generating
Extract one time password (OTP) secrets from QR codes
Image/video AI upscaler app (BSRGAN)
TTS with kokoro and onnx runtime
Python tool for converting files and office documents to Markdown
Awesome multilingual OCR toolkits based on PaddlePaddle
Qwen3-TTS is an open-source series of TTS models
The most powerful and modular diffusion model GUI, api and backend
OCR software, free and offline
Open-Source Python3 tool for recognizing layouts, tables, and math
SOTA Open Source TTS
Open source plain text editor designed for writing novels
Generate audiobooks from EPUBs, PDFs and text with captions
Wan2.2: Open and Advanced Large-Scale Video Generative Model
Wan2.1: Open and Advanced Large-Scale Video Generative Model
TikZ figures for concepts in physics/chemistry/ML
Library for OCR-related tasks powered by Deep Learning
Official inference repo for FLUX.1 models
A simple native web interface that uses ChatTTS to synthesize text
Contexts Optical Compression
Tokenizer-Free TTS for Multilingual Speech Generation
Mozc - a Japanese Input Method Editor designed for multi-platform
Handwritten Text Recognition (HTR) system implemented with TensorFlow
Robust Speech Recognition via Large-Scale Weak Supervision
High-Quality Voice Cloning TTS for 600+ Languages