Synchronized Translation for Videos
EPUB to audiobook converter, optimized for Audiobookshelf
A sound cloning tool with a web interface, using your voice
Comprehensive Gradio WebUI for audio processing
Code for openai.fm, a demo for the OpenAI Speech API
Generate audiobooks from e-books
Speech to Text to Speech, sends text as OSC messages
Offline inference engine for art, real-time voice conversations
TTS with kokoro and onnx runtime
Automatically translates the text of a video based on a subtitle file
Cross-platform AI language practice app
Offline Text To Speech synthesis for python
Towards Human-Sounding Speech
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Dicio assistant app for Android
C++ inference library for multiple SVC/TTS
Controllable and fast Text-to-Speech for over 7000 languages
Python library and CLI tool to interface with Google Translate
The python library for real-time communication
An Open Source text-to-speech system built by inverting Whisper
Framework for building neural networks
MARS5 speech model (TTS) from CAMB.AI
Real-time voice interactive digital human
Multi-Voice and Prompt-Controlled TTS Engine
Amica is an open source interface for interactive communication