Generative Adversarial Networks for Efficient and High Fidelity Speech
Library of deep learning models and datasets
IPTV/NVR/CCTV/Video cloud https://fastocloud.com
DeepMind's Tacotron-2 Tensorflow implementation
PyTorch implementation of convolutional neural networks
TensorFlow Implementation of DC-TTS: yet another text-to-speech model
RtlSdr listen to radio, recognize audio, and writes text file log
Cross Audio-Visual Recognition using 3D Architectures
A cross-platform wrapper for common text-to-speech engines in Python
An Incremental Spoken Dialogue Processing Toolkit
Dia-1.6B generates lifelike English dialogue and vocal expressions