BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.

Features

  • Portable to Unix-like Systems with the G++ compiler and SWIG
  • Both C++ and Python interfaces
  • Abundant classes and functions for microphone array processing and speech recognition
  • Efficient handling for a block of incoming audio samples that makes BTK suitable for real-time prototypes
  • Free software

Project Samples

Project Activity

See All Activity >

Follow Distant Speech Recognition

Distant Speech Recognition Web Site

Other Useful Business Software
Gen AI apps are built with MongoDB Atlas Icon
Gen AI apps are built with MongoDB Atlas

The database for AI-powered applications.

MongoDB Atlas is the developer-friendly database used to build, scale, and run gen AI and LLM-powered apps—without needing a separate vector database. Atlas offers built-in vector search, global availability across 115+ regions, and flexible document modeling. Start building AI apps faster, all in one place.
Start Free

Additional Project Details

Operating Systems

Cygwin, Linux

Programming Language

C++, Python

Related Categories

Python Algorithms, Python HMI Software, Python Sound Audio, Python Speech Recognition Software, C++ Algorithms, C++ HMI Software, C++ Sound Audio, C++ Speech Recognition Software

Registered

2015-03-30