BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.
Features
- Portable to Unix-like Systems with the G++ compiler and SWIG
- Both C++ and Python interfaces
- Abundant classes and functions for microphone array processing and speech recognition
- Efficient handling for a block of incoming audio samples that makes BTK suitable for real-time prototypes
- Free software
Follow Distant Speech Recognition
You Might Also Like