An open source implementation of Deep Voice 3: Scaling Text-to-Speech with Convolutional Sequence Learning.
Features
- PyTorch implementation of convolutional networks-based text-to-speech synthesis models
- Documentation available
- Examples available
- Convolutional sequence-to-sequence model with attention for text-to-speech synthesis
- Multi-speaker and single speaker versions of DeepVoice3
- Audio samples and pre-trained models
- Preprocessor for LJSpeech (en), JSUT (jp) and VCTK datasets, as well as carpedm20/multi-speaker-tacotron-tensorflow compatible custom dataset (in JSON format)
- Language-dependent frontend text processor for English and Japanese
Categories
Machine LearningLicense
MIT LicenseFollow Deepvoice3_pytorch
Other Useful Business Software
Go From AI Idea to AI App Fast
Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of Deepvoice3_pytorch!