Multimodal AI Story Teller, built with Stable Diffusion, GPT, etc.
Chinese voice dialogue robot/smart speaker project
PyTorch implementation of VALL-E (Zero-Shot Text-To-Speech)
State-of-the-art Multilingual Question Answering research
A walk along memory lane
Singing Voice Synthesis via Shallow Diffusion Mechanism
WaveRNN Vocoder + TTS
A Deep-Learning-Based Chinese Speech Recognition System
A flexible and efficient library for deep learning
Clone a voice in 5 seconds to generate arbitrary speech in real-time
PAddle PARAllel text-to-speech toolKIT
Implementation of a Transformer based neural network
Real-Time State-of-the-art Speech Synthesis for Tensorflow 2
Conditional Variational Autoencoder with Adversarial Learning
Aseryla code repositories
Generative Adversarial Networks for Efficient and High Fidelity Speech
A tool that AI automatically recommends commit messages
An implementation of Tacotron 2 that supports multilingual experiments
Bangla text to speech synthesis in python
A platform for Artificial Intelligence experimentation on Minecraft
A multi-modeling and simulation environment to study complex systems
Toolkit for efficient experimentation with Speech Recognition
TensorFlow Implementation of DC-TTS: yet another text-to-speech model
Tool to parse the command line and configuration files.