TTS with kokoro and onnx runtime
Offline Text To Speech synthesis for python
Offline inference engine for art, real-time voice conversations
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Synchronized Translation for Videos
Comprehensive Gradio WebUI for audio processing
Python library and CLI tool to interface with Google Translate
The python library for real-time communication
Automatically translates the text of a video based on a subtitle file
High-Quality Voice Cloning TTS for 600+ Languages
A sound cloning tool with a web interface, using your voice
Tokenizer-Free TTS for Multilingual Speech Generation
Generate audiobooks from e-books
Towards Human-Sounding Speech
Real-time voice interactive digital human
An Open Source text-to-speech system built by inverting Whisper
Framework for building neural networks
Controllable and fast Text-to-Speech for over 7000 languages
MARS5 speech model (TTS) from CAMB.AI
A Conversational Speech Generation Model
Mice speech to text with MX Cinnamon OS ISO
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
Multi-Voice and Prompt-Controlled TTS Engine
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Conditional Variational Autoencoder with Adversarial Learning