Document (PDF, Word, PPTX ...) extraction and parse API
A playground to generate images from any text prompt using SD
Hypernetworks that adapt LLMs for specific benchmark tasks
Practical productivity tools for Claude Code, Codex-CLI
Readest is a modern, feature-rich ebook reader
Text and image to video generation: CogVideoX and CogVideo
Awesome multilingual OCR toolkits based on PaddlePaddle
OCR offline image text recognition command line windows program
A single Gradio + React WebUI with extensions for ACE-Step
Qwen3-TTS is an open-source series of TTS models
Chat with it via text and voice
Text mining using tidy tools
Code for openai.fm, a demo for the OpenAI Speech API
Screenshots, word marking, OCR, AI, translation software
Generate audiobooks from EPUBs, PDFs and text with captions
A robust, efficient, low-latency speech-to-text library
Deep Research framework, combining language models with tools
The media player for language learning, with dual subtitles
A TTS that fits in your CPU (and pocket)
State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX
Canvas-based WYSIWYG rich text editor with advanced layout tools
Framework for building real-time voice and multimodal AI agents
Reading book source
Stanford CoreNLP, a Java suite of core NLP tools
World's first open-source, agentic video production system