Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Speech Recognition Software
Search Results

Search Results for "can="

x

Sort By:

Relevance

Clear All Filters

OS

Windows 45
Linux 37
Mac 31
More...
BSD 12
ChromeOS 7
Mobile Operating Systems 4
Desktop Operating Systems 1

Category

Artificial Intelligence 51
Multimedia 14
Scientific/Engineering 5
Internet 4
Software Development 4
System 4
Business 3
Communications 3
Education 3
Desktop Environment 2
Formats and Protocols 1
Security 1
Text Editors 1

License

OSI-Approved Open Source 35
GNU Free Documentation License 1
Other License 1
Public Domain 1

Translations

English 10
German 3
Spanish 3
French 2
More...
Arabic 1
Brazilian Portuguese 1
Catalan 1
Chinese (Simplified) 1
Chinese (Traditional) 1
Italian 1
Russian 1
Ukrainian 1

Programming Language

Java 10
Python 10
C++ 6
C 5
More...
C# 4
JavaScript 3
Perl 2
PHP 2
Visual Basic .NET 2
BASIC 1
Delphi/Kylix 1
Go 1
Visual Basic 1

Status

Alpha 8
Beta 8
Pre-Alpha 4
Production/Stable 4
More...
Planning 1

Showing 51 open source projects for "can="

View related business solutions

Speech Recognition Clear Filters & Widen Search

Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
1

whisper.cpp

Port of OpenAI's Whisper model in C/C++

...The command downloads the base.en model converted to custom ggml format and runs the inference on all .wav samples in the folder samples. whisper.cpp supports integer quantization of the Whisper ggml models. Quantized models require less memory and disk space and depending on the hardware can be processed more efficiently.

Downloads: 384 This Week

Last Update: 2026-03-19
See Project
2

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

OpenAI Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented as a sequence of tokens to be predicted by the decoder, allowing a single model to replace many stages of a traditional speech-processing pipeline. ...

Downloads: 53 This Week

Last Update: 2025-06-26
See Project
3

Buster

Captcha solver extension for humans

...The continued development of Buster is made possible thanks to the support of awesome backers. If you'd like to join them, please consider contributing with Patreon, PayPal or Bitcoin. The success rate of the extension can be improved by simulating user interactions with the help of a client app. Follow the instructions from the extension's options to download and install the client app on Windows, Linux and macOS, or get the app from this repository.

Downloads: 33 This Week

Last Update: 2026-04-27
See Project
4

OpenVINO

OpenVINO™ Toolkit repository

OpenVINO™ is an open-source toolkit for optimizing and deploying AI inference. Boost deep learning performance in computer vision, automatic speech recognition, natural language processing and other common tasks. Use models trained with popular frameworks like TensorFlow, PyTorch and more. Reduce resource demands and efficiently deploy on a range of Intel® platforms from edge to cloud. This open-source version includes several components: namely Model Optimizer, OpenVINO™ Runtime,...

Downloads: 21 This Week

Last Update: 2026-03-25
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.

Start Free
5

Omnilingual ASR

Omnilingual ASR Open-Source Multilingual SpeechRecognition

Omnilingual-ASR is a research codebase exploring automatic speech recognition that generalizes across a very large number of languages using shared modeling and training recipes. It focuses on leveraging self-supervised audio pretraining and scalable fine-tuning so low-resource languages can benefit from high-resource data. The project provides data preparation pipelines, training scripts, decoding utilities, and evaluation tools so researchers can reproduce results and extend to new language sets. It emphasizes modularity: acoustic modeling, language modeling, tokenization, and decoding are separable pieces you can swap or ablate. ...

Downloads: 0 This Week

Last Update: 2025-12-12
See Project
6

NVIDIA NeMo

Toolkit for conversational AI

...NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI architectures are typically large and require a lot of data and compute for training. NeMo uses PyTorch Lightning for easy and performant multi-GPU/multi-node mixed-precision training. Supported models: Jasper, QuartzNet, CitriNet, Conformer-CTC, Conformer-Transducer, Squeezeformer-CTC, Squeezeformer-Transducer, ContextNet, LSTM-Transducer (RNNT), LSTM-CTC. ...

Downloads: 2 This Week

Last Update: 2026-04-22
See Project
7

annyang!

Speech recognition for your site

...Use optional words or phrases to define a part of the command as optional. annyang plays nicely with all browsers, progressively enhancing browsers that support SpeechRecognition, while leaving users with older browsers unaffected. Grab the latest version of annyang.min.js, drop it in your html, and start adding commands. You can easily add a GUI for the user to interact with Speech Recognition using Speech KITT. Speech KITT is fully customizable and comes with many different themes, and instructions on how to create your own designs.

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
8

The SpeechBrain Toolkit

A PyTorch-based Speech Toolkit

SpeechBrain is an open-source and all-in-one conversational AI toolkit. It is designed to be simple, extremely flexible, and user-friendly. Competitive or state-of-the-art performance is obtained in various domains. SpeechBrain supports state-of-the-art methods for end-to-end speech recognition, including models based on CTC, CTC+attention, transducers, transformers, and neural language models relying on recurrent neural networks and transformers. Speaker recognition is already deployed in a...

Downloads: 0 This Week

Last Update: 2026-03-30
See Project
9

Vosk Speech Recognition Toolkit

Offline speech recognition API for Android, iOS, Raspberry Pi

...Speech recognition bindings are implemented for various programming languages like Python, Java, Node.JS, C#, C++, Rust, Go and others. Vosk supplies speech recognition for chatbots, smart home appliances, and virtual assistants. It can also create subtitles for movies, and transcription for lectures and interviews. Vosk scales from small devices like Raspberry Pi or Android smartphones to big clusters.

Downloads: 70 This Week

Last Update: 2024-04-22
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
10

Google2SRT

Download, save and convert multiple subtitles from YouTube videos

Google2SRT allows you to download, save and convert multiple subtitles and translations from YouTube and Google Video to SubRip (.srt) format, which is recognized by most video players. You can download XML subtitles or simply type video's URL, Google2SRT will do the rest.

33 Reviews

Downloads: 51 This Week

Last Update: 2025-01-11
See Project
11

Voxal voice changer

Transform your voice in real-time voxal voice changer

...Audio can be captured from various sources, pre-listening is available, and the most popular audio formats are supported.

1 Review

Downloads: 5 This Week

Last Update: 2025-11-16
See Project
12

Mice MX OS speech to text Voice Control

Mice speech to text with MX Cinnamon OS ISO

...The distribution uses the Cinnamon desktop interface, which is configured to be operated using voice commands and outputs. The user interface and the control of your own devices and home automation systems can be customized and extended. The voice control program MiceStTM.py was developed to enable easy adaptation to other languages. However, only German settings are currently implemented. category: System commands comment: Screen grid trigger: Display screen (Ras.*|Grid)* terminal_command: /opt/micesttm/read-aloud/screen_grid.py & sleep 1 && xdotool search --name "screen grid" windowactivate intern_command: tts: Screen grid for the mouse click was selected.

Downloads: 0 This Week

Last Update: 2026-04-18
See Project
13

Flashlight library

A C++ standalone library for machine learning

...Native support in C++ and simple extensibility make Flashlight a powerful research framework that's hackable to its core and enables fast iteration on new experimental setups and algorithms with little unopinionated and without sacrificing performance. In a single repository, Flashlight provides apps for research across multiple domains. Flashlight can be broken down into several components as described above. Each component can be incrementally built by specifying the correct build options. Flashlight is most-easily built and installed with vcpkg. Both the CUDA and CPU backends are supported with vcpkg. For either backend, first, install Intel MKL. Flashlight app binaries are also built for the selected features and are installed into the vcpkg install tree's tools directory.

Downloads: 1 This Week

Last Update: 2022-05-27
See Project
14

VideoSrt

Windows-GUI

This is an open source Windows-GUI software tool that can recognize video speech and automatically generate subtitle SRT files. VideoSrtIt is written in Golanglanguage and developed based on lxn/walk Windows-GUI toolkit. Open source software tool that can recognize video speech and automatically generate subtitle SRT files. It is suitable for business scenarios that quickly and batch generate Chinese/English subtitles and text files for media (video/audio).

Downloads: 25 This Week

Last Update: 2023-01-13
See Project
15

wav2letter++

Facebook AI research's automatic speech recognition toolkit

...It is highly recommended to build KenLM with position-independent code (-fPIC) enabled, to enable python compatibility. After installing, run export KENLM_ROOT_DIR=... so that wav2letter++ can find it. This is needed because KenLM doesn't support a make install step.wav2letter++ expects audio and transcription data to be prepared in a specific format so that they can be read from the pipelines. Each dataset (test/valid/train) needs to be in a separate file with one sample per line. A sample is specified using 4 columns separated by space (or tabs).

Downloads: 0 This Week

Last Update: 2022-05-27
See Project
16

Tensor2Tensor

Library of deep learning models and datasets

Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and compare the results. Tensor2Tensor, or T2T for short, is a library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research. ...

Downloads: 0 This Week

Last Update: 2021-05-24
See Project
17

Polaris programing with voice in Eclipse

Polaris, programing with voice in Eclipse IDE

With Polaris you have the possibility of incorporating speech into programing. Through use of this plugin in Eclipse IDE you can see that not only is it possible to provide an environment for a programing with voice, but that programing with voice it is part of the natural evolution of programming tools. VOICE COMMANDS eclipse task eclipse search eclipse skip eclipse format eclipse new eclipse save eclipse rename eclipse cut eclipse copy eclipse paste eclipse all eclipse delete eclipse close eclipse get eclipse hash eclipse string Efforts are made on daily basic to increase the range of functionality that can be controlled with voice. ...

Downloads: 7 This Week

Last Update: 2019-05-12
See Project
18

Commander

Commander.exe is speech recognition engine for Polaris.

...What is Polaris ? Polaris is plugin for Eclipse IDE With Polaris you have the possibility of incorporating speech into programing. Through use of this plugin in Eclipse IDE you can see that not only is it possible to provide an environment for a programing with voice, but that programing with voice it is part of the natural evolution of programming tools. Current version supports simple but powerful commands such as openig search forms, changing workspace, copy and paste code. Efforts on daily basis are made to increase the range of functionality that can be controlled with voice. ...

Downloads: 0 This Week

Last Update: 2019-05-12
See Project
19

Tensorpack

A Neural Net Training Interface on TensorFlow, with focus on speed

...There are too many symbolic function wrappers already. Tensorpack includes only a few common layers. You can use any TF symbolic functions inside Tensorpack.

Downloads: 0 This Week

Last Update: 2022-08-01
See Project
20

ILA - teachable voice assistant

ILA is a fully customizable and teachable voice assistant for Java

...The goal is to make ILA completely free of Google by improving all aspects of the open source systems. Since version 3.3 users can also write own add-ons to extend ILA. ILA's successor is the SEPIA Framework: https://sepia-framework.github.io/ Hope you enjoy ILA - Florian

4 Reviews

Downloads: 1 This Week

Last Update: 2018-07-23
See Project
21

H.B.S.N

Speech Recognition System

...This software is a package of many sub applications.And those are as listed below , Audio Player Video Player Email Client Weather Application Mp3 Tag Editor Picture Viewer Home Automation Application Alarm / Timer Folder Locker Message Encrypt Application Income & Expenses Logging Application Apart from that we can do many thing from this software by using voice commands , such as , Open & close applications which are installed in the computer Open web sites Open folders which are in the HDD Control built-in audio & video player Control the home automation system Reading mails Reading selected text Speaking clock ( Time & Date) Speaking weather report There are system commands for the tasks which this application does.And we can replace the system default commands with custom commands.

1 Review

Downloads: 0 This Week

Last Update: 2018-06-30
See Project
22

JuliusModels

Open source speech models for Julius in English and other languages.

Open source speech models for Julius speech decoder. Its aim is to give access a wider community of speech recognition enthusiasts to quality models, which they can use in their own projects on different OS platforms (Unix, Windows, etc...) All of the models are based on HTK modelling software and data sets available freely on the Internet.

Downloads: 1 This Week

Last Update: 2018-05-11
See Project
23

Lip Reading

Cross Audio-Visual Recognition using 3D Architectures

...The essential problem is to find the correspondence between the audio and visual streams, which is the goal of this work. We proposed the utilization of a coupled 3D Convolutional Neural Network (CNN) architecture that can map both modalities into a representation space to evaluate the correspondence of audio-visual streams using the learned multimodal features.

Downloads: 1 This Week

Last Update: 2022-08-11
See Project
24

C# Speech Recognition Tutorial

C# Speech Recognition Tutorial

This is an easy (as can be) tutorial to show how speech recognition is done with in C#. On the form the button is pressed, and within 5 seconds say your speech. In this example Q and B act as commands. The code filters the recognised words looking for the letter Q and B. File contains the source code-use this to make the simple form with the named elements in the image-in a new winforms program.

Downloads: 0 This Week

Last Update: 2017-08-11
See Project
25

JAVT - Just Another Voice Transformer

Just Another Speech Recognition and Text to Speech software.

...JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.

Downloads: 3 This Week

Last Update: 2020-08-19
See Project

Previous
You're on page 1
2
3
Next

Related Searches

voice changer

whisper-windows-x64.exe

whisper

whisper-bin-x64.zip

whisper.cpp

openvino

vosk-win64-0.3.39

vosk

whisper-cli.exe

delphi speech recognition

Related Categories

Artificial Intelligence

Multimedia

Scientific/Engineering

Internet

Software Development

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise