TTS with kokoro and onnx runtime
Offline Text To Speech synthesis for python
Offline inference engine for art, real-time voice conversations
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Synchronized Translation for Videos
Comprehensive Gradio WebUI for audio processing
Python library and CLI tool to interface with Google Translate
A sound cloning tool with a web interface, using your voice
The python library for real-time communication
Automatically translates the text of a video based on a subtitle file
High-Quality Voice Cloning TTS for 600+ Languages
Tokenizer-Free TTS for Multilingual Speech Generation
Generate audiobooks from e-books
Towards Human-Sounding Speech
Real-time voice interactive digital human
An Open Source text-to-speech system built by inverting Whisper
Framework for building neural networks
Controllable and fast Text-to-Speech for over 7000 languages
MARS5 speech model (TTS) from CAMB.AI
A Conversational Speech Generation Model
Multi-Voice and Prompt-Controlled TTS Engine
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Conditional Variational Autoencoder with Adversarial Learning
Implementation of a Transformer based neural network
Generative Adversarial Networks for Efficient and High Fidelity Speech