SOTA discrete acoustic codec models with 40/75 tokens per second
Synchronized Translation for Videos
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Speech-AI-Forge is a project developed around TTS generation model
Automatically translates the text of a video based on a subtitle file
Multi-lingual large voice generation model, providing inference
A fast TTS architecture with conditional flow matching
Real-time voice interactive digital human
Bailing is a voice dialogue robot similar to GPT-4o
Converts text to speech in realtime
End-to-end speech processing toolkit
Controllable and fast Text-to-Speech for over 7000 languages
VITS2 backbone with multilingual-bert
Toolkit for audio, music, and speech generation
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
Chuyển đổi văn bản thành giọng nói không giới hạn
Mice speech to text with MX Cinnamon OS ISO
Text to Speech Utility
Unofficial Parallel WaveGAN
Best practice TTS based on BERT and VITS
A webui for different audio related Neural Networks
WaveRNN Vocoder + TTS
Conditional Variational Autoencoder with Adversarial Learning
Generative Adversarial Networks for Efficient and High Fidelity Speech