Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Artificial Intelligence
Speech Recognition Software
Search Results

Search Results for "qt-based"

x

Sort By:

Relevance

Clear All Filters

OS

Windows 34
Linux 29
Mac 20
More...
BSD 11
ChromeOS 7
Mobile Operating Systems 2
Desktop Operating Systems 1

Category

Artificial Intelligence 43
Multimedia 7
Education 4
Scientific/Engineering 4
Communications 3
Internet 3
Business 2
Software Development 2
System 2
Desktop Environment 1
Formats and Protocols 1
Mobile 1
Security 1

License

OSI-Approved Open Source 31
Other License 1
Public Domain 1

Translations

English 8
French 2
Spanish 2
German 1
More...
Italian 1

Programming Language

Python 10
Java 7
C# 5
C 4
More...
C++ 4
PHP 4
JavaScript 3
Visual Basic .NET 2
BASIC 1
Delphi/Kylix 1
Go 1
JSP 1
Prolog 1
Unix Shell 1

Status

Beta 9
Alpha 7
Production/Stable 6
Pre-Alpha 5
More...
Planning 3

Showing 43 open source projects for "qt-based"

View related business solutions

Speech Recognition Clear Filters & Widen Search

Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
1

The SpeechBrain Toolkit

A PyTorch-based Speech Toolkit

...Competitive or state-of-the-art performance is obtained in various domains. SpeechBrain supports state-of-the-art methods for end-to-end speech recognition, including models based on CTC, CTC+attention, transducers, transformers, and neural language models relying on recurrent neural networks and transformers. Speaker recognition is already deployed in a wide variety of realistic applications. SpeechBrain provides different models for speaker recognition, including X-vector, ECAPA-TDNN, PLDA, and contrastive learning. ...

Downloads: 2 This Week

Last Update: 5 days ago
See Project
2

FireRedASR

Open-source industrial-grade ASR models

...The project includes multiple model variants to meet different application needs, such as high-accuracy end-to-end interaction using an encoder-adapter-LLM framework and efficient real-time recognition using attention-based encoder-decoder architectures, giving developers flexibility in balancing performance and resource constraints. FireRedASR not only excels in traditional speech recognition tasks but also demonstrates strong capability in challenging scenarios like singing lyrics recognition, where accurate transcription is often difficult for conventional models.

Downloads: 0 This Week

Last Update: 2026-02-25
See Project
3

WhisperJAV

A subtitle generator for Japanese Adult Videos.

A subtitle generator for Japanese Adult Videos. Transformer-based ASR architectures like Whisper suffer significant performance degradation when applied to the spontaneous and noisy domain of JAV. This degradation is driven by specific acoustic and temporal characteristics that defy the statistical distributions of standard training data.

1 Review

Downloads: 57 This Week

Last Update: 2 days ago
See Project
4

AzioSpeech Recognition and Translation

AzioSpeech Recognition and Translation

Starting from version 1.2.1.0, the project has been renamed to AzioSpeech Recognition and Translation and is officially published in the Microsoft Store at: https://apps.microsoft.com/detail/9PFV5DG73198 A desktop application built with Avalonia UI that provides real-time speech recognition and translation using Azure Speech Services. Convert spoken words into text and translate them into multiple languages with professional-grade accuracy. Important Setup Requirements Before using...

Downloads: 0 This Week

Last Update: 2026-02-13
See Project
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
5

Mice MX OS speech to text Voice Control

Mice speech to text with MX Cinnamon OS ISO

Note about this image This image contains a system based on Linux MX, which was created to improve accessibility within the Linux environment. The distribution uses the Cinnamon desktop interface, which is configured to be operated using voice commands and outputs. The user interface and the control of your own devices and home automation systems can be customized and extended. The voice control program MiceStTM.py was developed to enable easy adaptation to other languages.

Downloads: 0 This Week

Last Update: 2025-05-14
See Project
6

ASRT Speech Recognition

A Deep-Learning-Based Chinese Speech Recognition System

ASRT is an end-to-end deep-learning Chinese ASR system built with TensorFlow/Keras, using convolution + CTC and a Max-Entropy HMM language model. It provides a REST/gRPC server backend and client SDKs in multiple languages (Python, Java, Go, Windows). Notably lightweight, it performs well without needing GPU acceleration and runs across platforms, targeting developers and researchers building Chinese voice interfaces.

Downloads: 0 This Week

Last Update: 2025-07-03
See Project
7

VideoSrt

Windows-GUI

This is an open source Windows-GUI software tool that can recognize video speech and automatically generate subtitle SRT files. VideoSrtIt is written in Golanglanguage and developed based on lxn/walk Windows-GUI toolkit. Open source software tool that can recognize video speech and automatically generate subtitle SRT files. It is suitable for business scenarios that quickly and batch generate Chinese/English subtitles and text files for media (video/audio). Recognize video/audio speech to generate subtitle files (support Chinese-English translation, bilingual subtitles) Extract speech text from video/audio. ...

Downloads: 28 This Week

Last Update: 2023-01-13
See Project
8

CALL-SLT

A project which uses existing speech recognition and speech translation resources to build conversation partners for beginning language students, based on the idea of a "translation game".

Downloads: 0 This Week

Last Update: 2019-06-17
See Project
9

Tensorpack

A Neural Net Training Interface on TensorFlow, with focus on speed

Tensorpack is a neural network training interface based on TensorFlow v1. Uses TensorFlow in the efficient way with no extra overhead. On common CNNs, it runs training 1.2~5x faster than the equivalent Keras code. Your training can probably gets faster if written with Tensorpack. Scalable data-parallel multi-GPU / distributed training strategy is off-the-shelf to use. Squeeze the best data loading performance of Python with tensorpack.dataflow.

Downloads: 0 This Week

Last Update: 2022-08-01
See Project
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
10

NASH OS

Nash Operating System for Modern Ecommerce

The all-built-in-one, automatic, ready-to-go out-of-box, easy-to-use state-of-the-art, and really awesome NASH OS! Over 25,000+ flexible features and controls and all scalable!! The most powerful solution ever built to instantly deliver new heights of online ecommerce enterprise to you.

Downloads: 7 This Week

Last Update: 2019-03-24
See Project
11

KALDI IVR ASTERISK SPEECH

Working template to create an Asterisk IVR system using kaldi

Working template to create an Asterisk IVR system using kaldi for speech recognition. IVR based speech recognition.

Downloads: 0 This Week

Last Update: 2018-12-15
See Project
12

JuliusModels

Open source speech models for Julius in English and other languages.

...Its aim is to give access a wider community of speech recognition enthusiasts to quality models, which they can use in their own projects on different OS platforms (Unix, Windows, etc...) All of the models are based on HTK modelling software and data sets available freely on the Internet.

Downloads: 5 This Week

Last Update: 2018-05-11
See Project
13

Speechalyzer

Process large speech data wrt transcription, labeling and annotation

Speechalyzer: a tool for the daily work of a 'speech worker' It is optimized to process large speech data sets with respect to transcription, labeling and annotation. It is implemented as a client server based framework in Java and interfaces software for speech recognition, synthesis, speech classification and quality evaluation. The application is mainly the processing of training data for speech recognition and classification models and performing benchmarking tests on speech-to-text, text-to-speech and speech classification software systems.

Downloads: 0 This Week

Last Update: 2016-04-27
See Project
14

Awesome Recurrent Neural Networks

A curated list of resources dedicated to RNN

...Provides a wide range of works and resources such as a Recurrent Neural Network Tutorial, a Sequence-to-Sequence Model Tutorial, Tutorials by nlintz, Notebook examples by aymericdamien, Scikit Flow (skflow) - Simplified Scikit-learn like Interface for TensorFlow, Keras (Tensorflow / Theano)-based modular deep learning library similar to Torch, char-rnn-tensorflow by sherjilozair, char-rnn in tensorflow, and much more. Codes, theory, applications, and datasets about natural language processing, robotics, computer vision, and much more.

Downloads: 0 This Week

Last Update: 2021-09-22
See Project
15

Hemera - Intelligent System

Hemera is a Virtual Intelligent System aggregating some more advanced Artificial Intelligence Technologies (speech, speech recognition, form recognition, motion recognition ...); with applications in daily tasks, domotics and robotics ...

Downloads: 0 This Week

Last Update: 2015-01-21
See Project
16

Kaldi+PDNN

Fully fledged DNN Speech Recognition based on PDNN and Kaldi

Downloads: 0 This Week

Last Update: 2015-12-19
See Project
17

Responding Partner

Control your PC computer with voice commands

Responding Partner is a windows application that enables you free talking with your computer which equipped with spoken animation character. You will be surprised how smart responding partner robot is. It also enables voice commands and controls to your computer for small task like open media files, open and close program, shutdown and restart computer,open website, type in editor, text to speech,etc. You can extend the ability by installing new plugin which available at files tab. We will...

1 Review

Downloads: 0 This Week

Last Update: 2017-06-08
See Project
18

Speech

Dictation / Speech Recognition

Dictation / Speech Recognition software that runs on any platform supported by Google Chrome.

Downloads: 0 This Week

Last Update: 2013-11-17
See Project
19

G.A.S.I.

Webcam Gesture and Voice Recognition OS proof of concept

Inspired by interfaces from sci-fi movies like Iron Man, Gesture Analytical Sonic Interface (GASI) is a proof of concept of a Webcam gesture (Kinect like) and Voice recognition based computer interface, constraining itself to only components included in average laptops (A simple webcam and microphone, no Kinect)

Downloads: 0 This Week

Last Update: 2016-11-18
See Project
20

Bavieca (www.bavieca.org)

Bavieca is an open-source speech recognition tookit.

Bavieca (www.bavieca.org) is an open-source speech recognition toolkit intended for speech research and as a platform for rapid development of speech-enabled solutions by non speech experts. It comprises the most common acoustic modeling and adaptation techniques including discriminative training, and efficient dynamic and FSM-based decoders that can operate in batch and live recognition modes. Bavieca is entirely written in C++ and distributed under the Apache 2.0 license. Bavieca was developed at Boulder Language Technologies (BLT) during the last three years in response to the needs of the research projects conducted within the company. Research at BLT includes the development of conversational dialog systems and assessment tools that are deployed in formal educational settings and other real-life scenarios.

2 Reviews

Downloads: 0 This Week

Last Update: 2013-07-17
See Project
21

Open Pandora's Box

Pandora is an artificial intelligent web based bot

Pandora is an artificial intelligent web based bot written in Java. Pandora is a component based AI architecture including, database memory, XML, voice, voice rec, chat, IRC, HTTP, Wiktionary, Freebase, consciousness, language, GUI, applet, web, jsp, Android

1 Review

Downloads: 2 This Week

Last Update: 2013-11-20
See Project
22

husky

Haskell based automatic speech recognizer

...The goal of husky is to provide a speech recognition system that is suitable for education and for prototyping new algorithms in research. For this purpose, pipeline based design of speech recognition systems have been developed, which enabled highly abstracted implementation of the systems while keeping the access to the details of the process.

Downloads: 0 This Week

Last Update: 2016-11-11
See Project
23

Domotic Speech-recognition interface

Speech-recognition interface for a domotic system.

This product recognizes oral commands and translates them to domotic orders for a domotic system. This product does not implement a domotic system. This product is an interface to be plugged to a domotic system. The speech recognition is done by an arduino UNO board and an EasyVR shield. Available oral commands are generated from a house description file in XML format. The oral commands have to be trained for a specific users. For this purpose 2 interfaces are provided: a command line...

Downloads: 0 This Week

Last Update: 2015-12-29
See Project
24

AK toolkit

The AK toolkit is another kit for building and use Hidden Markov Models (HMMs). Originally developed for handwritten text recognition (HTR) using Bernoulli HMMs, it also implements diagonal Gaussians and can be used for any other purpose.

Downloads: 1 This Week

Last Update: 2013-04-22
See Project
25

iComand

This software records and replays user interaction with the computer. It can be interfaced through voice commands.

2 Reviews

Downloads: 0 This Week

Last Update: 2014-06-29
See Project

Previous
You're on page 1
2
Next

Related Searches

video to srt

conversational ai

whisperjav

whisper jav

mi 10

cinnamon desktop

phone flash softwares

ivr for asterisk

julius

transcription

Related Categories

Artificial Intelligence

Multimedia

Education

Scientific/Engineering

Communications

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise