Hypernetworks that adapt LLMs for specific benchmark tasks
Document (PDF, Word, PPTX ...) extraction and parse API
A playground to generate images from any text prompt using SD
Practical productivity tools for Claude Code, Codex-CLI
Readest is a modern, feature-rich ebook reader
Text and image to video generation: CogVideoX and CogVideo
OCR offline image text recognition command line windows program
Awesome multilingual OCR toolkits based on PaddlePaddle
A single Gradio + React WebUI with extensions for ACE-Step
Qwen3-TTS is an open-source series of TTS models
Chat with it via text and voice
Text mining using tidy tools
Generate audiobooks from EPUBs, PDFs and text with captions
A TTS that fits in your CPU (and pocket)
Code for openai.fm, a demo for the OpenAI Speech API
The media player for language learning, with dual subtitles
A robust, efficient, low-latency speech-to-text library
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Canvas-based WYSIWYG rich text editor with advanced layout tools
Framework for building real-time voice and multimodal AI agents
Stanford CoreNLP, a Java suite of core NLP tools
IronClaw is OpenClaw inspired but focused on privacy & security
Deep Research framework, combining language models with tools
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Enhances Tesseract OCR output using LLMs (local or API)