Towards Human-Level Text-to-Speech through Style Diffusion
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
An Open Source text-to-speech system built by inverting Whisper
Virtual AI anchor that combines state-of-the-art technology
A fast TTS architecture with conditional flow matching
Singing Voice Synthesis via Shallow Diffusion Mechanism