A pure Javascript Multilingual OCR
A MCP Server for the RAG Web Browser Actor
A playground to generate images from any text prompt using SD
An easy 1-click way to create beautiful artwork on your PC using AI
A minimal LLM chat app that runs entirely in your browser
JavaScript OCR and text extraction for images and PDFs
Speech to Text to Speech, sends text as OSC messages
Ito, smart dictation in every application
A simple, high-quality voice conversion tool focused on ease of use
Code for openai.fm, a demo for the OpenAI Speech API
Run local LLMs like llama, deepseek, kokoro etc. inside your browser
Lightning-fast, on-device TTS, running natively via ONNX
A simple native web interface that uses ChatTTS to synthesize text
Ollama JavaScript library
Canvas-based WYSIWYG rich text editor with advanced layout tools
A persistent, network resilient, full text search library
A nearly-live implementation of OpenAI's Whisper
Speech-AI-Forge is a project developed around TTS generation model
Self-host the powerful Chatterbox TTS model
A Web UI for easy subtitle using whisper model
A sound cloning tool with a web interface, using your voice
Use Microsoft Edge's online text-to-speech service from Python
WebAssembly binding for llama.cpp - Enabling on-browser LLM inference
A fast TTS architecture with conditional flow matching
Browser extension and cross-platform desktop app based on ChatGPT API