Instant voice cloning by MIT and MyShell. Audio foundation model
Scalable generative AI framework built for researchers and developers
Bailing is a voice dialogue robot similar to GPT-4o
Build Vision Agents quickly with any model or video provider
Toolkit for audio, music, and speech generation
One-click deployment (including offline integration package)
Framework for building neural networks
SOTA discrete acoustic codec models with 40/75 tokens per second
Virtual AI anchor that combines state-of-the-art technology
Open source implementation of Microsoft's VALL-E X zero-shot TTS model
Chuyển đổi văn bản thành giọng nói không giới hạn
Unofficial Parallel WaveGAN
Best practice TTS based on BERT and VITS
Text to Speech Utility
Mice speech to text with MX Cinnamon OS ISO
A Conversational Speech Generation Model
A webui for different audio related Neural Networks
Chinese voice dialogue robot/smart speaker project
Txt-2-Mp3 6.3 Mark 2 [Improved.Simplified.Alternative]
Singing Voice Synthesis via Shallow Diffusion Mechanism
WaveRNN Vocoder + TTS
Clone a voice in 5 seconds to generate arbitrary speech in real-time
General Speech Restoration
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2