Foundational model for human-like, expressive TTS
Speech-AI-Forge is a project developed around TTS generation model
Generate audiobooks from e-books, voice cloning & 1107+ languages
Industrial-level controllable zero-shot text-to-speech system
Virtual AI anchor that combines state-of-the-art technology
Open-source multi-speaker long-form text-to-speech model
Open-source framework for intelligent speech interaction
Automatically translates the text of a video based on a subtitle file
Bailing is a voice dialogue robot similar to GPT-4o
Oobabooga - The definitive Web UI for local AI, with powerful features
End-to-end speech processing toolkit
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
StreamSpeech is a seamless model for offline speech recognition
Generate audiobooks from EPUBs, PDFs and text with captions
A lightning fast audio upsampler
Long-form streaming TTS system for multi-speaker dialogue generation
MARS5 speech model (TTS) from CAMB.AI
A simple native web interface that uses ChatTTS to synthesize text
A high-quality rapid TTS voice cloning model
Python library and CLI tool to interface with Google Translate
The official Python SDK for the ElevenLabs API
Multi-lingual large voice generation model, providing inference
Speakr is a personal, self-hosted web application
Easy-to-use Speech Toolkit including Self-Supervised Learning model
Controllable and fast Text-to-Speech for over 7000 languages