speaker free download

The SpeechBrain Toolkit

A PyTorch-based Speech Toolkit

...SpeechBrain supports state-of-the-art methods for end-to-end speech recognition, including models based on CTC, CTC+attention, transducers, transformers, and neural language models relying on recurrent neural networks and transformers. Speaker recognition is already deployed in a wide variety of realistic applications. SpeechBrain provides different models for speaker recognition, including X-vector, ECAPA-TDNN, PLDA, and contrastive learning. Spectral masking, spectral mapping, and time-domain enhancement are different methods already available within SpeechBrain. Separation methods such as Conv-TasNet, DualPath RNN, and SepFormer are implemented as well. ...

Downloads: 1 This Week

Last Update: 2026-03-30

See Project

CMU Sphinx

Speech Recognition Toolkit

...----> Maintenance and improvement work has MOVED to https://cmusphinx.github.io/ Please go there for the most recent software and documentation. <---- CMUSphinx is a speaker-independent large vocabulary continuous speech recognizer released under BSD style license. It is also a collection of open source tools and resources that allows researchers and developers to build speech recognition systems.

58 Reviews

Downloads: 307 This Week

Last Update: 2024-01-11

See Project

Lip Reading

Cross Audio-Visual Recognition using 3D Architectures

...Audio-visual recognition (AVR) has been considered as a solution for speech recognition tasks when the audio is corrupted, as well as a visual recognition method used for speaker verification in multi-speaker scenarios. The approach of AVR systems is to leverage the extracted information from one modality to improve the recognition ability of the other modality by complementing the missing information. The essential problem is to find the correspondence between the audio and visual streams, which is the goal of this work. ...

Downloads: 4 This Week

Last Update: 2022-08-11

See Project

Distant Speech Recognition

Beamforming and Speech Recognition Toolkit

BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.

Downloads: 1 This Week

Last Update: 2019-08-21

See Project

Arabisc

Arabisc is speaker independent large vocabulary continuous speech recognizer for Arabic language released under GNU license.It is also a collection of open source tools that allows researchers and developers to build speech recognition systems for Arab

1 Review

Downloads: 1 This Week

Last Update: 2013-04-26

See Project

Search Results for "speaker"

Showing 5 open source projects for "speaker"

The SpeechBrain Toolkit

CMU Sphinx

Lip Reading

Distant Speech Recognition

Arabisc

Search Results for "speaker"

Showing 5 open source projects for "speaker"

The SpeechBrain Toolkit

CMU Sphinx

Lip Reading

Distant Speech Recognition

Arabisc

Related Searches

Related Categories