Instant voice cloning by MIT and MyShell. Audio foundation model
A simple native web interface that uses ChatTTS to synthesize text
Controllable & emotion-expressive zero-shot TTS
Style-Bert-VITS2: Bert-VITS2 with more controllable voice styles
A nearly-live implementation of OpenAI's Whisper
SOTA Open Source TTS
Towards Human-Sounding Speech
Controllable and fast Text-to-Speech for over 7000 languages
Spark-TTS Inference Code
Speech-AI-Forge is a project developed around TTS generation model
Mice speech to text with MX Cinnamon OS ISO
A webui for different audio related Neural Networks
The open-source virtual assistant for Ubuntu based Linux distributions