automatic free download

whisper.cpp

Port of OpenAI's Whisper model in C/C++

whisper.cpp is a lightweight, C/C++ reimplementation of OpenAI’s Whisper automatic speech recognition (ASR) model—designed for efficient, standalone transcription without external dependencies. The entire high-level implementation of the model is contained in whisper.h and whisper.cpp. The rest of the code is part of the ggml machine learning library. The command downloads the base.en model converted to custom ggml format and runs the inference on all .wav samples in the folder samples. whisper.cpp supports integer quantization of the Whisper ggml models. ...

Downloads: 483 This Week

Last Update: 2026-01-15

See Project

OpenVINO

OpenVINO™ Toolkit repository

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference. Boost deep learning performance in computer vision, automatic speech recognition, natural language processing and other common tasks. Use models trained with popular frameworks like TensorFlow, PyTorch and more. Reduce resource demands and efficiently deploy on a range of Intel® platforms from edge to cloud. This open-source version includes several components: namely Model Optimizer, OpenVINO™ Runtime, Post-Training Optimization Tool, as well as CPU, GPU, MYRIAD, multi device and heterogeneous plugins to accelerate deep learning inferencing on Intel® CPUs and Intel® Processor Graphics. ...

Downloads: 22 This Week

Last Update: 2025-12-10

See Project

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project

Kaldi is an open source toolkit for speech recognition research. It provides a powerful framework for building state-of-the-art automatic speech recognition (ASR) systems, with support for deep neural networks, Gaussian mixture models, hidden Markov models, and other advanced techniques. The toolkit is widely used in both academia and industry due to its flexibility, extensibility, and strong community support. Kaldi is designed for researchers who need a highly customizable environment to experiment with new algorithms, as well as for practitioners who want robust, production-ready ASR pipelines. ...

Downloads: 6 This Week

Last Update: 7 hours ago

See Project

Flashlight library

A C++ standalone library for machine learning

Flashlight is a fast, flexible machine learning library written entirely in C++ by Facebook AI Research and the creators of Torch, TensorFlow, Eigen, and Deep Speech. Native support in C++ and simple extensibility make Flashlight a powerful research framework that's hackable to its core and enables fast iteration on new experimental setups and algorithms with little unopinionated and without sacrificing performance. In a single repository, Flashlight provides apps for research across...

Downloads: 0 This Week

Last Update: 2022-05-27

See Project

wav2letter++

Facebook AI research's automatic speech recognition toolkit

First, install Flashlight (using the 0.3 branch is required) with the ASR application. This repository includes recipes to reproduce the following research papers as well as pre-trained models. All results reproduction must use Flashlight <= 0.3.2 for exact reproducibility. At least one of LZMA, BZip2, or Z is required for LM compression with KenLM. It is highly recommended to build KenLM with position-independent code (-fPIC) enabled, to enable python compatibility. After installing, run...

Downloads: 0 This Week

Last Update: 2022-05-27

See Project

Speech Recognition in English & Polish

Speech recognition software for English & Polish languages

...Audio conversion and cutting sound files into smaller ones. 2. Searching for words or phrases in sound files (recognized by SkryBot). 3. Editing sound files and automatic cutting off long silence parts in audio file.

2 Reviews

Downloads: 2 This Week

Last Update: 2020-03-15

See Project

Distant Speech Recognition

Beamforming and Speech Recognition Toolkit

BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.

Downloads: 0 This Week

Last Update: 2019-08-21

See Project

Voice XML Enabling Software

Voice XML Enabling Software (VXES) is an application that connects a VoiceXML Interpreter, a telephony platform, and MRCP servers that provide services for Automatic Speech Recognition and Text to Speech Synthesis. C++, Windows & Linux OS supported

Downloads: 0 This Week

Last Update: 2016-02-21

See Project

Search Results for "automatic"

Showing 8 open source projects for "automatic"

whisper.cpp

OpenVINO

Kaldi

Flashlight library

wav2letter++

Speech Recognition in English & Polish

Distant Speech Recognition

Voice XML Enabling Software

Search Results for "automatic"

Showing 8 open source projects for "automatic"

whisper.cpp

OpenVINO

Kaldi

Flashlight library

wav2letter++

Speech Recognition in English & Polish

Distant Speech Recognition

Voice XML Enabling Software

Related Searches

Related Categories