A text-to-speech, speech-to-text and speech-to-speech library
WhatsApp MCP server enabling AI access to chats and messaging
The Triton Inference Server provides an optimized cloud
A nearly-live implementation of OpenAI's Whisper
Mopidy is an extensible music server written in Python
Instant voice cloning by MIT and MyShell. Audio foundation model
Automated Music Discovery and Collection Manager
Swing Music is a beautiful, self-hosted music player
Free, high-quality text-to-speech API endpoint to replace OpenAI
The music player of today
Music Assistant is a free, opensource Media library manager
Interface for OuteTTS models
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Voice Recognition to Text Tool
Video editing with Python
Official MiniMax Model Context Protocol (MCP) server
A simple native web interface that uses ChatTTS to synthesize text
A lightweight text-to-speech model with zero-shot voice cloning
Framework for building realtime multimodal voice AI agents apps
tensorboard for pytorch (and chainer, mxnet, numpy, etc.)
Build cross-modal and multimodal applications on the cloud
Minimal Debian-based RDP thin client OS with admin/user lock modes.
UFONet - Denial of Service Toolkit