Use Microsoft Edge's online text-to-speech service from Python
Multi-Voice and Prompt-Controlled TTS Engine
Toolkit for audio, music, and speech generation
Towards Human-Level Text-to-Speech through Style Diffusion
End-to-end speech processing toolkit
Multi-lingual large voice generation model, providing inference
A TTS model capable of generating ultra-realistic dialogue
Industrial-level controllable zero-shot text-to-speech system
A Conversational Speech Generation Model
Conditional Variational Autoencoder with Adversarial Learning