A nearly-live implementation of OpenAI's Whisper
A simple native web interface that uses ChatTTS to synthesize text
Generate audiobooks from EPUBs, PDFs and text with captions
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
Automatically translates the text of a video based on a subtitle file
Controllable and fast Text-to-Speech for over 7000 languages
MARS5 speech model (TTS) from CAMB.AI
A text-to-speech, speech-to-text and speech-to-speech library
Towards Human-Sounding Speech
Free, high-quality text-to-speech API endpoint to replace OpenAI
Virtual AI anchor that combines state-of-the-art technology
VITS2 backbone with multilingual-bert
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Clone a voice in 5 seconds to generate arbitrary speech in real-time
General Speech Restoration
Implementation of a Transformer based neural network
TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Vinux is an Ubuntu derived distribution for blind & visually impaired.