Framework for building neural networks
Toolkit for audio, music, and speech generation
Controllable & emotion-expressive zero-shot TTS
A fast TTS architecture with conditional flow matching
SOTA discrete acoustic codec models with 40/75 tokens per second
One-click deployment (including offline integration package)
Foundational model for human-like, expressive TTS
End-to-end speech processing toolkit
Multi-lingual large voice generation model, providing inference
A TTS model capable of generating ultra-realistic dialogue
Bailing is a voice dialogue robot similar to GPT-4o
Reading book source
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
MARS5 speech model (TTS) from CAMB.AI
VITS2 backbone with multilingual-bert
Multi-Voice and Prompt-Controlled TTS Engine
A Conversational Speech Generation Model
Text to Speech Utility
Chuyển đổi văn bản thành giọng nói không giới hạn
Mice speech to text with MX Cinnamon OS ISO
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Unofficial Parallel WaveGAN
Best practice TTS based on BERT and VITS
Chinese voice dialogue robot/smart speaker project