A pure Javascript Multilingual OCR
A MCP Server for the RAG Web Browser Actor
An easy 1-click way to create beautiful artwork on your PC using AI
A minimal LLM chat app that runs entirely in your browser
JavaScript OCR and text extraction for images and PDFs
A playground to generate images from any text prompt using SD
Speech to Text to Speech, sends text as OSC messages
A simple, high-quality voice conversion tool focused on ease of use
Speech Note Linux app. Note taking, reading and translating
Code for openai.fm, a demo for the OpenAI Speech API
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Lightning-fast, on-device TTS, running natively via ONNX
Ollama JavaScript library
A simple native web interface that uses ChatTTS to synthesize text
OpenAI gpt-image-2 API
Canvas-based WYSIWYG rich text editor with advanced layout tools
A persistent, network resilient, full text search library
A nearly-live implementation of OpenAI's Whisper
Speech-AI-Forge is a project developed around TTS generation model
A Web UI for easy subtitle using whisper model
Self-host the powerful Chatterbox TTS model
A sound cloning tool with a web interface, using your voice
Use Microsoft Edge's online text-to-speech service from Python
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
A fast TTS architecture with conditional flow matching