Control Any Computer Using LLMs
Document (PDF, Word, PPTX ...) extraction and parse API
High-performance inference server for text embeddings models API layer
Focus on prompting and generating
A playground to generate images from any text prompt using SD
An easy 1-click way to create beautiful artwork on your PC using AI
Stable Diffusion web UI
TTS with kokoro and onnx runtime
GUI for a Vocal Remover that uses Deep Neural Networks
OCR software, free and offline
LLM Frontend for Power Users
Python binding to the Apache Tika™ REST services
A simple native web interface that uses ChatTTS to synthesize text
Code for openai.fm, a demo for the OpenAI Speech API
Speech Note Linux app. Note taking, reading and translating
A sound cloning tool with a web interface, using your voice
Coding agent for DeepSeek models that runs in your terminal
A simple, high-quality voice conversion tool focused on ease of use
Generate audiobooks from e-books
Self-hosted AI workspace from PewDiePie
A text-to-speech, speech-to-text and speech-to-speech library
OCR offline image text recognition command line windows program
Interface for OuteTTS models
A minimal LLM chat app that runs entirely in your browser
Python library and CLI tool to interface with Google Translate