A pure Javascript Multilingual OCR
Python library and CLI tool to interface with Google Translate
Self-host the powerful Chatterbox TTS model
Generate music based on natural language prompts using LLMs
Open source text-to-speech tool, supports extra-long text
A fast, helpful, and open-source document parser
Label Studio is a multi-type data labeling and annotation tool
Google Gen AI Python SDK provides an interface for developers
Awesome multilingual OCR toolkits based on PaddlePaddle
A collection of awesome-lists for AI, creativity and art. AI
The most powerful and modular diffusion model GUI, api and backend
LLM abstractions that aren't obstructions
Claude Code skill that removes signs of AI-generated writing from text
The free, Open Source alternative to OpenAI, Claude and others
Transcribe on your own
chat web app for teams, sass with user management and ratelimit
Run Codex Mobile Anywhere: Linux, Windows, or Termux on Android
Clippy, now with some AI
A Web UI for easy subtitle using whisper model
Central interface to connect your LLM's with external data
User-friendly AI Interface
A single Gradio + React WebUI with extensions for ACE-Step
Deploy your private Gemini application for free with one click
The agent that grows with you
A fast TTS architecture with conditional flow matching