Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Speech Recognition Software
Search Results

Search Results for "speech to text in java"

x

Sort By:

Relevance

Clear All Filters

OS

Linux 49
Windows 41
Mac 31
More...
BSD 23
ChromeOS 14
Desktop Operating Systems 3
Mobile Operating Systems 3

Category

Artificial Intelligence 49
Multimedia 19
Scientific/Engineering 8
Software Development 6
Communications 5
Internet 3
Education 2
Database 1
Formats and Protocols 1
Terminals 1

License

OSI-Approved Open Source 34
Public Domain 3
Other License 1

Translations

English 10
German 4
Arabic 2
French 2
More...
Italian 2
Spanish 2
Brazilian Portuguese 1
Catalan 1
Chinese (Simplified) 1
Chinese (Traditional) 1
Dutch 1
Polish 1
Russian 1
Ukrainian 1
Vietnamese 1

Programming Language

Java 23
Python 11
C 9
C++ 6
More...
Perl 3
BASIC 2
JavaScript 2
PHP 2
Unix Shell 2
C# 1
Go 1
JSP 1

Status

Beta 14
Production/Stable 7
Pre-Alpha 6
Alpha 6
More...
Planning 2

Showing 49 open source projects for "speech to text in java"

View related business solutions

Speech Recognition Linux Clear Filters & Widen Search

Level Up Your Cyber Defense with External Threat Management
See every risk before it hits. From exposed data to dark web chatter. All in one unified view.

Move beyond alerts. Gain full visibility, context, and control over your external attack surface to stay ahead of every threat.

Try for Free
Keep company data safe with Chrome Enterprise
Protect your business with AI policies and data loss prevention in the browser

Make AI work your way with Chrome Enterprise. Block unapproved sites and set custom data controls that align with your company's policies.

Download Chrome
1

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

OpenAI Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented...

Downloads: 78 This Week

Last Update: 2025-06-26
See Project
2

SpeechRecognition

Speech recognition module for Python

Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using...

Downloads: 15 This Week

Last Update: 2025-11-19
See Project
3

NVIDIA NeMo

Toolkit for conversational AI

NVIDIA NeMo, part of the NVIDIA AI platform, is a toolkit for building new state-of-the-art conversational AI models. NeMo has separate collections for Automatic Speech Recognition (ASR), Natural Language Processing (NLP), and Text-to-Speech (TTS) models. Each collection consists of prebuilt modules that include everything needed to train on your data. Every module can easily be customized, extended, and composed to create new conversational AI model architectures. Conversational AI architectures are typically large and require a lot of data and compute for training. ...

Downloads: 6 This Week

Last Update: 2025-11-10
See Project
4

Mice MX OS speech to text Voice Control

Mice speech to text with MX Cinnamon OS ISO

Note about this image This image contains a system based on Linux MX, which was created to improve accessibility within the Linux environment. The distribution uses the Cinnamon desktop interface, which is configured to be operated using voice commands and outputs. The user interface and the control of your own devices and home automation systems can be customized and extended. The voice control program MiceStTM.py was developed to enable easy adaptation to other languages. However, only...

Downloads: 2 This Week

Last Update: 2025-05-14
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

Maia

MAIA (MyApp Intelligence Artificial) is designed to provide a foundation for building your own voice-controlled assistant with Python. It uses various libraries and modules for speech recognition, text-to-speech synthesis, and custom functionality.

Downloads: 2 This Week

Last Update: 2024-04-21
See Project
6

Mice TTM

mice stt tts

Dieses Tool wird speziell für die Barrierefreiheit unter Linux entwickelt. Es ermöglicht das umwandeln/konvertieren/parsen von Texten die aus einer Spracherkennung stammen, in Diktate sowie das Ausführen von Makros. Dies funktioniert ohne Internet, da die Spracherkennung auf dem PC selbst erfolgt. Mausbewegungen auf benannte Wörter und dann entsprechend auswählen oder per Sprachbefehl klicken. Außerdem können Textpassagen z.B. unter Libreoffice Wirter per Sprachbefehl entsprechend...

Downloads: 1 This Week

Last Update: 2024-08-07
See Project
7

Google2SRT

Download, save and convert multiple subtitles from YouTube videos

Google2SRT allows you to download, save and convert multiple subtitles and translations from YouTube and Google Video to SubRip (.srt) format, which is recognized by most video players. You can download XML subtitles or simply type video's URL, Google2SRT will do the rest.

33 Reviews

Downloads: 97 This Week

Last Update: 2025-01-11
See Project
8

ASRT Speech Recognition

A Deep-Learning-Based Chinese Speech Recognition System

ASRT is an end-to-end deep-learning Chinese ASR system built with TensorFlow/Keras, using convolution + CTC and a Max-Entropy HMM language model. It provides a REST/gRPC server backend and client SDKs in multiple languages (Python, Java, Go, Windows). Notably lightweight, it performs well without needing GPU acceleration and runs across platforms, targeting developers and researchers building Chinese voice interfaces.

Downloads: 0 This Week

Last Update: 2025-07-03
See Project
9

VideoSrt

Windows-GUI

...Recognize video/audio speech to generate subtitle files (support Chinese-English translation, bilingual subtitles) Extract speech text from video/audio. Batch translation, filter processing/encoding SRT subtitle files. Using the Alibaba Cloud speech recognition interface, the accuracy is high, and the standard Mandarin/English recognition rate is over 95%. Video recognition does not need to upload the original video, which is convenient, fast and time-saving.

Downloads: 20 This Week

Last Update: 2023-01-13
See Project
Cloud data warehouse to power your data-driven innovation
BigQuery is a serverless and cost-effective enterprise data warehouse that works across clouds and scales with your data.

BigQuery Studio provides a single, unified interface for all data practitioners of various coding skills to simplify analytics workflows from data ingestion and preparation to data exploration and visualization to ML model creation and use. It also allows you to use simple SQL to access Vertex AI foundational models directly inside BigQuery for text processing tasks, such as sentiment analysis, entity extraction, and many more without having to deal with specialized models.

Try for free
10

Tensor2Tensor

Library of deep learning models and datasets

Deep Learning (DL) has enabled the rapid advancement of many useful technologies, such as machine translation, speech recognition and object detection. In the research community, one can find code open-sourced by the authors to help in replicating their results and further advancing deep learning. However, most of these DL systems use unique setups that require significant engineering effort and may only work for a specific problem or architecture, making it hard to run new experiments and...

Downloads: 0 This Week

Last Update: 2021-05-24
See Project
11

Speech Recognition in English & Polish

Speech recognition software for English & Polish languages

Software for speech recognition in English & Polish languages. Basic versions of SkryBot: 1. SkryBot Home Speech (English Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesEnglish/InstalatorSkryBotHomeSpeechDemo-2.6.9.18117.exe/download 2. SkryBot DoMowy (Polish Language) - https://sourceforge.net/projects/skrybotdomowy/files/ReleasesPolish/InstalatorSkryBotDoMowyDemo-2.4.9.18117.exe/download More...

2 Reviews

Downloads: 6 This Week

Last Update: 2020-03-15
See Project
12

CMU Sphinx

Speech Recognition Toolkit

Thank you for visiting! ----> Maintenance and improvement work has MOVED to https://cmusphinx.github.io/ Please go there for the most recent software and documentation. <---- CMUSphinx is a speaker-independent large vocabulary continuous speech recognizer released under BSD style license. It is also a collection of open source tools and resources that allows researchers and developers to build speech recognition systems.

58 Reviews

Downloads: 570 This Week

Last Update: 2024-01-11
See Project
13

ILA - teachable voice assistant

ILA is a fully customizable and teachable voice assistant for Java

...It is designed to integrate with your home enviroment and for example build up your own, free and open Amazon Echo replacement ;-) Right now the key components of ILA are the open source speech recognition CMU Sphinx-4, Google (Speech Recognition/Text-To-Speech) and MaryTTS (Text-To-Speech). The goal is to make ILA completely free of Google by improving all aspects of the open source systems. Since version 3.3 users can also write own add-ons to extend ILA. ILA's successor is the SEPIA Framework: https://sepia-framework.github.io/ Hope you enjoy ILA - Florian

4 Reviews

Downloads: 2 This Week

Last Update: 2018-07-23
See Project
14

annyang!

Speech recognition for your site

...You can easily add a GUI for the user to interact with Speech Recognition using Speech KITT. Speech KITT is fully customizable and comes with many different themes, and instructions on how to create your own designs.

Downloads: 0 This Week

Last Update: 2021-09-13
See Project
15

FM2TXT

RtlSdr listen to radio, recognize audio, and writes text file log

Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt

Downloads: 2 This Week

Last Update: 2017-12-17
See Project
16

Speechalyzer

Process large speech data wrt transcription, labeling and annotation

Speechalyzer: a tool for the daily work of a 'speech worker' It is optimized to process large speech data sets with respect to transcription, labeling and annotation. It is implemented as a client server based framework in Java and interfaces software for speech recognition, synthesis, speech classification and quality evaluation. The application is mainly the processing of training data for speech recognition and classification models and performing benchmarking tests on speech-to-text, text-to-speech and speech classification software systems.

Downloads: 1 This Week

Last Update: 2016-04-27
See Project
17

Hemera - Intelligent System

Hemera is a Virtual Intelligent System aggregating some more advanced Artificial Intelligence Technologies (speech, speech recognition, form recognition, motion recognition ...); with applications in daily tasks, domotics and robotics ...

Downloads: 0 This Week

Last Update: 2015-01-21
See Project
18

jaivox

Speech recognition application builder and library

Java library and tools to create open source speech recognition applications. Generates dialogs for conversational interfaces. Works with a popular open source speech recognition library.

Downloads: 0 This Week

Last Update: 2015-03-26
See Project
19

S.T.A.N Voice Recognition Software

a software like J.A.R.V.I.S!!

Hello everyone! I am presenting a software named S.T.A.N, a voice recognition software which recognizes your voice and performs actions like from opening to facebook to renaming, copying a file, creating a folder and many more! This software has over 50+ commands to perform by just listening to your voice! so just download it and check it out. Your comments and feedback is valuable to us. And if you like, please like our facebook page and mail us!

Downloads: 0 This Week

Last Update: 2014-06-29
See Project
20

Cheery

A smartphone-PC interface for control your computer remotely.

Cheery is a smartphone-PC interface for control your computer remotely. Uses speech recognition for get the commands and it sends to a Java server that does the actions. Coming soon Cheery will also be a Swiss Army Knife for Android.

Downloads: 0 This Week

Last Update: 2016-10-17
See Project
21

InproTK

An Incremental Spoken Dialogue Processing Toolkit

InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/

Downloads: 0 This Week

Last Update: 2015-06-16
See Project
22

HMM Speech Recognition in Java

HMM Speech Recognition in Java

HMM Speech Recognition in Java

Downloads: 0 This Week

Last Update: 2013-09-21
See Project
23

Voce

A speech synthesis and recognition library that is cross-platform, accessible from Java and C++, and has a very small API. Uses CMU Sphinx4 and FreeTTS internally.

3 Reviews

Downloads: 2 This Week

Last Update: 2013-10-03
See Project
24

High-order HMM in Java

A duration high-order hidden Markov model (DHO-HMM) in Java.

This project provides an implementation of duration high-order hidden Markov model (DHO-HMM) in Java. It is compactible with JDK 5 & 6. It was used in the author's research on speech recognition of Mandarin digits. There are some Chinese words in this project and I am afraid that I don't have enough time to translate to English recently.

Downloads: 0 This Week

Last Update: 2013-09-16
See Project
25

G.A.S.I.

Webcam Gesture and Voice Recognition OS proof of concept

Inspired by interfaces from sci-fi movies like Iron Man, Gesture Analytical Sonic Interface (GASI) is a proof of concept of a Webcam gesture (Kinect like) and Voice recognition based computer interface, constraining itself to only components included in average laptops (A simple webcam and microphone, no Kinect)

Downloads: 0 This Week

Last Update: 2016-11-18
See Project

Previous
You're on page 1
2
Next

Related Searches

cmusphinx-zh-cn-5.2.tar.gz

arabic speech to text

whisper-windows-x64.exe

whisper

text to speech

farsi speech recognition

pocketsphinx

speech recognition in russian

set

offline artificial intelligence\

Related Categories

Artificial Intelligence

Multimedia

Scientific/Engineering

Software Development

Communications

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2025 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise

×

Thanks for helping keep SourceForge clean.

X

You seem to have CSS turned off. Please don't fill out this field.

You seem to have CSS turned off. Please don't fill out this field.

Briefly describe the problem (required):

Upload screenshot of ad (required):

Select a file, or drag & drop file here.

✔

✘

Screenshot instructions:

Click URL instructions:
Right-click on the ad, choose "Copy Link", then paste here →
(This may not be possible with some types of ads)

More information about our ad policies

Ad destination/click URL: