Long-form streaming TTS system for multi-speaker dialogue generation
Spark-TTS Inference Code
Framework for building neural networks
StreamSpeech is a seamless model for offline speech recognition
Controllable & emotion-expressive zero-shot TTS
SOTA discrete acoustic codec models with 40/75 tokens per second
Controllable and fast Text-to-Speech for over 7000 languages
One-click deployment (including offline integration package)
A TTS model capable of generating ultra-realistic dialogue
Bailing is a voice dialogue robot similar to GPT-4o
An Open Source text-to-speech system built by inverting Whisper
Reading book source
Virtual AI anchor that combines state-of-the-art technology
High-quality multi-lingual text-to-speech library by MyShell.ai
A Conversational Speech Generation Model
Chuyển đổi văn bản thành giọng nói không giới hạn
Mice speech to text with MX Cinnamon OS ISO
Offline desktop app to convert EPUB to MP3 using Kokoro-82M neural TTS
Towards Human-Level Text-to-Speech through Style Diffusion
Toolkit for audio, music, and speech generation
Text to Speech Utility
VITS2 backbone with multilingual-bert
Multi-Voice and Prompt-Controlled TTS Engine
Unofficial Parallel WaveGAN