Robust Speech Recognition via Large-Scale Weak Supervision
Training data (data labeling, annotation, workflow) for all data types
kaldi-asr/kaldi is the official location of the Kaldi project
A PyTorch-based Speech Toolkit
A subtitle generator for Japanese Adult Videos.
Mice speech to text with MX Cinnamon OS ISO
Library of deep learning models and datasets
Cross Audio-Visual Recognition using 3D Architectures