TTS with kokoro and onnx runtime
Synchronized Translation for Videos
Offline inference engine for art, real-time voice conversations
Offline Text To Speech synthesis for python
Automatically translates the text of a video based on a subtitle file
A sound cloning tool with a web interface, using your voice
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Python library and CLI tool to interface with Google Translate
Generate audiobooks from e-books
Comprehensive Gradio WebUI for audio processing
Towards Human-Sounding Speech
The python library for real-time communication
An Open Source text-to-speech system built by inverting Whisper
Controllable and fast Text-to-Speech for over 7000 languages
Real-time voice interactive digital human
Framework for building neural networks
MARS5 speech model (TTS) from CAMB.AI
Multi-Voice and Prompt-Controlled TTS Engine
A Conversational Speech Generation Model
Easy AI Softwares for Blind, Deaf, Handicapped, Disabled People
Clone a voice in 5 seconds to generate arbitrary speech in real-time
Conditional Variational Autoencoder with Adversarial Learning
Implementation of a Transformer based neural network
Generative Adversarial Networks for Efficient and High Fidelity Speech
A cross-platform wrapper for common text-to-speech engines in Python