Clone a voice in 5 seconds to generate arbitrary speech in real-time
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)
Open source embedded speech-to-text engine
Generative Adversarial Networks for Efficient and High Fidelity Speech
Easy-OCR solution and Tesseract trainer for GNU/Linux
Library of deep learning models and datasets
IPTV/NVR/CCTV/Video cloud https://fastocloud.com
Solver ReCaptcha v2 Free
A python package to analyze and compare voices with deep learning
Written or imported text offline read or online download.
DeepMind's Tacotron-2 Tensorflow implementation
PyTorch implementation of convolutional neural networks
TensorFlow Implementation of DC-TTS: yet another text-to-speech model
RtlSdr listen to radio, recognize audio, and writes text file log
Cross Audio-Visual Recognition using 3D Architectures
Just Another Speech Recognition and Text to Speech software.
Beamforming and Speech Recognition Toolkit
A cross-platform wrapper for common text-to-speech engines in Python
An Incremental Spoken Dialogue Processing Toolkit
Recommends music based upon your current taste.