Speech-AI-Forge is a project developed around TTS generation model
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model
VITS2 backbone with multilingual-bert
A fast TTS architecture with conditional flow matching
Build Vision Agents quickly with any model or video provider
An Open Source text-to-speech system built by inverting Whisper
Management of Yandex Station and other smart home devices
SOTA discrete acoustic codec models with 40/75 tokens per second
Reading book source
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Unofficial Parallel WaveGAN
Best practice TTS based on BERT and VITS
A Conversational Speech Generation Model
Text to Speech Utility
Chuyển đổi văn bản thành giọng nói không giới hạn
Mice speech to text with MX Cinnamon OS ISO
Chinese voice dialogue robot/smart speaker project
A webui for different audio related Neural Networks
Txt-2-Mp3 6.3 Mark 2 [Improved.Simplified.Alternative]
Singing Voice Synthesis via Shallow Diffusion Mechanism
WaveRNN Vocoder + TTS
Clone a voice in 5 seconds to generate arbitrary speech in real-time
General Speech Restoration
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2