Best Open Source BSD Speech Recognition Software 2026

whisper.cpp

Port of OpenAI's Whisper model in C/C++

whisper.cpp is a lightweight, C/C++ reimplementation of OpenAI’s Whisper automatic speech recognition (ASR) model—designed for efficient, standalone transcription without external dependencies. The entire high-level implementation of the model is contained in whisper.h and whisper.cpp. The rest of the code is part of the ggml machine learning library. The command downloads the base.en model converted to custom ggml format and runs the inference on all .wav samples in the folder samples. whisper.cpp supports integer quantization of the Whisper ggml models. Quantized models require less memory and disk space and depending on the hardware can be processed more efficiently.

Downloads: 779 This Week

Last Update: 2026-06-19

See Project

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

OpenAI Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented as a sequence of tokens to be predicted by the decoder, allowing a single model to replace many stages of a traditional speech-processing pipeline. The multitask training format uses a set of special tokens that serve as task specifiers or classification targets.

Downloads: 75 This Week

Last Update: 2025-06-26

See Project

Google2SRT

Download, save and convert multiple subtitles from YouTube videos

Google2SRT allows you to download, save and convert multiple subtitles and translations from YouTube and Google Video to SubRip (.srt) format, which is recognized by most video players. You can download XML subtitles or simply type video's URL, Google2SRT will do the rest.

33 Reviews

Downloads: 34 This Week

Last Update: 2025-01-11

See Project

Kaldi

Speech recognition research toolkit

13 Reviews

Downloads: 7 This Week

Last Update: 2016-02-19

See Project

Voce

A speech synthesis and recognition library that is cross-platform, accessible from Java and C++, and has a very small API. Uses CMU Sphinx4 and FreeTTS internally.

3 Reviews

Downloads: 2 This Week

Last Update: 2013-10-03

See Project

Open Pandora's Box

Pandora is an artificial intelligent web based bot

Pandora is an artificial intelligent web based bot written in Java. Pandora is a component based AI architecture including, database memory, XML, voice, voice rec, chat, IRC, HTTP, Wiktionary, Freebase, consciousness, language, GUI, applet, web, jsp, Android

1 Review

Downloads: 3 This Week

Last Update: 2013-11-20

See Project

ViaVoice tools

Tools for use with IBM's ViaVoice speech recognition product

Downloads: 1 This Week

Last Update: 2013-03-07

See Project

AK toolkit

The AK toolkit is another kit for building and use Hidden Markov Models (HMMs). Originally developed for handwritten text recognition (HTR) using Bernoulli HMMs, it also implements diagonal Gaussians and can be used for any other purpose.

Downloads: 0 This Week

Last Update: 2013-04-22

See Project

Arabisc

Arabisc is speaker independent large vocabulary continuous speech recognizer for Arabic language released under GNU license.It is also a collection of open source tools that allows researchers and developers to build speech recognition systems for Arab

1 Review

Downloads: 0 This Week

Last Update: 2013-04-26

See Project

CJ7

CJ7 is an open-source speech recognition engine.

Downloads: 0 This Week

Last Update: 2016-10-23

See Project

Cheery

A smartphone-PC interface for control your computer remotely.

Cheery is a smartphone-PC interface for control your computer remotely. Uses speech recognition for get the commands and it sends to a Java server that does the actions. Coming soon Cheery will also be a Swiss Army Knife for Android.

Downloads: 0 This Week

Last Update: 2016-10-17

See Project

G.A.S.I.

Webcam Gesture and Voice Recognition OS proof of concept

Inspired by interfaces from sci-fi movies like Iron Man, Gesture Analytical Sonic Interface (GASI) is a proof of concept of a Webcam gesture (Kinect like) and Voice recognition based computer interface, constraining itself to only components included in average laptops (A simple webcam and microphone, no Kinect)

Downloads: 0 This Week

Last Update: 2016-11-18

See Project

HMM Speech Recognition in Java

Downloads: 0 This Week

Last Update: 2013-09-21

See Project

HMM Speech Recognition in Matlab

A speech recognition system using Matlab/Simulink/Stateflow.

This project provide hidden Markov model speech recognition system by using Matlab/Simulink/Stateflow.

4 Reviews

Downloads: 0 This Week

Last Update: 2016-07-25

See Project

High-order HMM in Java

A duration high-order hidden Markov model (DHO-HMM) in Java.

This project provides an implementation of duration high-order hidden Markov model (DHO-HMM) in Java. It is compactible with JDK 5 & 6. It was used in the author's research on speech recognition of Mandarin digits. There are some Chinese words in this project and I am afraid that I don't have enough time to translate to English recently.

Downloads: 0 This Week

Last Update: 2013-09-16

See Project

High-order HMM in Matlab

Implementation of duration high-order hidden Markov model in Matlab.

Implementation of duration high-order hidden Markov model (DHO-HMM) in Matlab with application in speech recognition.

2 Reviews

Downloads: 0 This Week

Last Update: 2015-02-15

See Project

IITM multi modal interface

Augmenting other natural interfaces namely handwriting, speech recognition to input Indian language characters to the computer. Speech synthesis is also provided to read out local language text.

Downloads: 0 This Week

Last Update: 2014-05-10

See Project

ILA - teachable voice assistant

ILA is a fully customizable and teachable voice assistant for Java

ILA stands for (kind of) intelligent, learning assistant and is a speech recognition system aka voice assistant very similar to Siri, Google Now and Cortana. ILA is fully customizable and you can teach her/him/it new things by yourself like executing system commands, opening web pages, programs and apps or just some basic conversation :-) ILA runs on Java und thus is compatible to Windows, Mac and Linux. It is designed to integrate with your home enviroment and for example build up your own, free and open Amazon Echo replacement ;-) Right now the key components of ILA are the open source speech recognition CMU Sphinx-4, Google (Speech Recognition/Text-To-Speech) and MaryTTS (Text-To-Speech). The goal is to make ILA completely free of Google by improving all aspects of the open source systems. Since version 3.3 users can also write own add-ons to extend ILA. ILA's successor is the SEPIA Framework: https://sepia-framework.github.io/ Hope you enjoy ILA - Florian

4 Reviews

Downloads: 0 This Week

Last Update: 2018-07-23

See Project

InproTK

An Incremental Spoken Dialogue Processing Toolkit

InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/

Downloads: 0 This Week

Last Update: 2015-06-16

See Project

M68331 Voice Recognition System

This project will show how to implement the Hidden Markov Model approximations of Voice Recognition into embedded and low power systems.

Downloads: 0 This Week

Last Update: 2013-02-21

See Project

MRCP4J

The MRCPv2 protocol is designed to allow client devices to control media processing resources, such as speech recognition engines. MRCP4J provides a Java API that encapsulates the MRCPv2 protocol and can be used to implement MRCP clients and/or servers.

Downloads: 0 This Week

Last Update: 2013-04-25

See Project

OC Volume

OC Volume is a speech recognition engine written in Java for integration with other applications. It is currently an User-Dependent Isolated Word Recognizer and can be expanded to include more capability for recognition.

Downloads: 0 This Week

Last Update: 2013-02-21

See Project

Open Source Speech Recognition Project

Developpement of speech recognition software and libraries for the linux system. Should allow evryone to integrate speech recognition in his software very easily.

Downloads: 0 This Week

Last Update: 2013-03-14

See Project

ROSA is an Open Source Agent (ROSA)

ROSA is an open source agent implementation. It will contain a speech engine, a speech recognition engine and many more...

Downloads: 0 This Week

Last Update: 2013-02-25

See Project

STRUDLE

Tool for helping in the diagnosis of the dislexy, based on the speech recognition done with the usage of HTK

1 Review

Downloads: 0 This Week

Last Update: 2013-04-09

See Project

Open Source BSD Speech Recognition Software

Speech Recognition Software for BSD

whisper.cpp

Whisper

Google2SRT

Kaldi

Voce

Open Pandora's Box

ViaVoice tools

AK toolkit

Arabisc

CJ7

Cheery

G.A.S.I.

HMM Speech Recognition in Java

HMM Speech Recognition in Matlab

High-order HMM in Java

High-order HMM in Matlab

IITM multi modal interface

ILA - teachable voice assistant

InproTK

M68331 Voice Recognition System

MRCP4J

OC Volume

Open Source Speech Recognition Project

ROSA is an Open Source Agent (ROSA)

STRUDLE

Related Searches