DC-TTS is a TensorFlow implementation of the DC-TTS architecture, a fully convolutional text-to-speech system designed to be efficiently trainable while producing natural speech. It follows the “Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks with Guided Attention” paper, but the author adapts and extends the design to make it practical for real experiments. The model is split into two networks: Text2Mel, which maps text to mel-spectrograms, and SSRN (spectrogram super-resolution network), which converts low-resolution mel-spectrograms into high-resolution magnitude spectrograms suitable for waveform synthesis. Training scripts, data loaders, and hyperparameter configurations are provided to reproduce results on several datasets, including LJ Speech for English, a Korean single-speaker dataset, and audiobook data from Nick Offerman and Kate Winslet.

Features

  • TensorFlow implementation of the DC-TTS architecture with convolution-only networks for text-to-speech
  • Two-stage pipeline with Text2Mel and SSRN networks for mel-spectrogram generation and super-resolution
  • Ready-made training scripts, data loaders, and hyperparameters for multiple English and Korean speech datasets
  • Guided attention mechanism that encourages monotonic alignments and stabilizes training
  • Support for normalization, dropout, and learning-rate decay to improve robustness versus the original paper
  • Pretrained model for LJ Speech plus synthesis utilities to generate audio samples directly from text

Project Samples

Project Activity

See All Activity >

Categories

Text to Speech

License

Apache License V2.0

Follow DC-TTS

DC-TTS Web Site

Other Useful Business Software
Our Free Plans just got better! | Auth0 Icon
Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
Try free now
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of DC-TTS!

Additional Project Details

Programming Language

Python

Related Categories

Python Text to Speech Software

Registered

16 hours ago