Long-form streaming TTS system for multi-speaker dialogue generation
Interface for OuteTTS models
TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
Open-source multi-speaker long-form text-to-speech model
Self-hosted AI audio transcription
A Web UI for easy subtitle using whisper model
super expressive prompting model based on ltx2.3
MOSS‑TTS Family open‑source speech and sound generation model
Instantly generate AI-powered subtitles on your device
High-Quality Voice Cloning TTS for 600+ Languages
One-click deployment (including offline integration package)
Web presentation editor replicating many PowerPoint features online
End-to-end speech processing toolkit
MARS5 speech model (TTS) from CAMB.AI
Foundational model for human-like, expressive TTS
Synchronized Translation for Videos
Towards Human-Level Text-to-Speech through Style Diffusion
Conditional Variational Autoencoder with Adversarial Learning
A python package to analyze and compare voices with deep learning
TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Artificial intelligence evolves musical instruments played with mouse
Multilingual voice cloning model with 6-second voice samples
Dia-1.6B generates lifelike English dialogue and vocal expressions