1 min voice data can also be used to train a good TTS model
Instant voice cloning by MIT and MyShell. Audio foundation model
Generate audiobooks from e-books
SOTA Open Source TTS
Multi-lingual large voice generation model, providing inference
Python library and CLI tool to interface with Google Translate
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Long-form streaming TTS system for multi-speaker dialogue generation
A lightweight text-to-speech model with zero-shot voice cloning
Interface for OuteTTS models
The official Python library for the Fish Audio API
Open source implementation of Microsoft's VALL-E X zero-shot TTS model