Toolkit for audio, music, and speech generation
Bailing is a voice dialogue robot similar to GPT-4o
Scalable generative AI framework built for researchers and developers
Industrial-level controllable zero-shot text-to-speech system
Automatically translates the text of a video based on a subtitle file
Real-time voice interactive digital human
Android system TTS application with Microsoft demo interface
Deep learning for text to speech
DeepMind's Tacotron-2 Tensorflow implementation
Toolkit for efficient experimentation with Speech Recognition