State-of-the-art TTS model under 25MB
A nearly-live implementation of OpenAI's Whisper
Generate audiobooks from e-books
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
GLM-4-Voice | End-to-End Chinese-English Conversational Model
Speech-AI-Forge is a project developed around TTS generation model
Towards Human-Sounding Speech
A text-to-speech, speech-to-text and speech-to-speech library
A lightweight text-to-speech model with zero-shot voice cloning
Controllable & emotion-expressive zero-shot TTS
MARS5 speech model (TTS) from CAMB.AI
Converts text to speech in realtime