A Deep-Learning-Based Chinese Speech Recognition System
ASRT is an end-to-end deep-learning Chinese ASR system built with TensorFlow/Keras, using convolution + CTC and a Max-Entropy HMM language model. It provides a REST/gRPC server backend and client SDKs in multiple languages (Python, Java, Go, Windows). Notably lightweight, it performs well without needing GPU acceleration and runs across platforms, targeting developers and researchers building Chinese voice interfaces.
RtlSdr listen to radio, recognize audio, and writes text file log
Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation.
https://sourceforge.net/projects/fm2txt-rtlsdr/files/
If you prefer GitHub source, not SF:
https://github.com/randaller/fm2txt
For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc):
https://github.com/randaller/souncard2txt
A speech synthesis and recognition library that is cross-platform, accessible from Java and C++, and has a very small API. Uses CMU Sphinx4 and FreeTTS internally.