BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.
Features
- Portable to Unix-like Systems with the G++ compiler and SWIG
- Both C++ and Python interfaces
- Abundant classes and functions for microphone array processing and speech recognition
- Efficient handling for a block of incoming audio samples that makes BTK suitable for real-time prototypes
- Free software
Follow Distant Speech Recognition
Other Useful Business Software
Keep company data safe with Chrome Enterprise
Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.