Open Source Windows Speech Recognition Software - Page 2

Speech Recognition Software for Windows

View 36 business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    ILA - teachable voice assistant

    ILA - teachable voice assistant

    ILA is a fully customizable and teachable voice assistant for Java

    ILA stands for (kind of) intelligent, learning assistant and is a speech recognition system aka voice assistant very similar to Siri, Google Now and Cortana. ILA is fully customizable and you can teach her/him/it new things by yourself like executing system commands, opening web pages, programs and apps or just some basic conversation :-) ILA runs on Java und thus is compatible to Windows, Mac and Linux. It is designed to integrate with your home enviroment and for example build up your own, free and open Amazon Echo replacement ;-) Right now the key components of ILA are the open source speech recognition CMU Sphinx-4, Google (Speech Recognition/Text-To-Speech) and MaryTTS (Text-To-Speech). The goal is to make ILA completely free of Google by improving all aspects of the open source systems. Since version 3.3 users can also write own add-ons to extend ILA. ILA's successor is the SEPIA Framework: https://sepia-framework.github.io/ Hope you enjoy ILA - Florian
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Polaris programing with voice in Eclipse

    Polaris programing with voice in Eclipse

    Polaris, programing with voice in Eclipse IDE

    With Polaris you have the possibility of incorporating speech into programing. Through use of this plugin in Eclipse IDE you can see that not only is it possible to provide an environment for a programing with voice, but that programing with voice it is part of the natural evolution of programming tools. VOICE COMMANDS eclipse task eclipse search eclipse skip eclipse format eclipse new eclipse save eclipse rename eclipse cut eclipse copy eclipse paste eclipse all eclipse delete eclipse close eclipse get eclipse hash eclipse string Efforts are made on daily basic to increase the range of functionality that can be controlled with voice. PREREQUISITE Windows OS and Eclipse IDE. Headphones with microphone, not mandatory, but it will improve speech recognition. Port Number that is setted in Polaris Preference page must not be used by any other application.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 3
    JuliusModels

    JuliusModels

    Open source speech models for Julius in English and other languages.

    Open source speech models for Julius speech decoder. Its aim is to give access a wider community of speech recognition enthusiasts to quality models, which they can use in their own projects on different OS platforms (Unix, Windows, etc...) All of the models are based on HTK modelling software and data sets available freely on the Internet.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    Responding Partner

    Responding Partner

    Control your PC computer with voice commands

    Responding Partner is a windows application that enables you free talking with your computer which equipped with spoken animation character. You will be surprised how smart responding partner robot is. It also enables voice commands and controls to your computer for small task like open media files, open and close program, shutdown and restart computer,open website, type in editor, text to speech,etc. You can extend the ability by installing new plugin which available at files tab. We will continuous to update new plugin and animation character. Engine inside: - Speech Recognition - Text to Speech Requirements - Microphone
    Downloads: 2 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    This project, npp (net plus plus, net++), is developed on top of open source package QuickNet for Neural Network training in speech recognition.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    A speech synthesis and recognition library that is cross-platform, accessible from Java and C++, and has a very small API. Uses CMU Sphinx4 and FreeTTS internally.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Collaborative development and distribution of Windows Speech Recognition (WSR) application macros to 1) improve the accessibility of personal computing for impaired users, and 2) improve the efficiency of personal computing for all users.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    FM2TXT

    FM2TXT

    RtlSdr listen to radio, recognize audio, and writes text file log

    Just log your favorite FM station speech to a text file using rtl-sdr dongle and speech recognition. Cross-platform tool. Follow the README on the download page for Windows installation. https://sourceforge.net/projects/fm2txt-rtlsdr/files/ If you prefer GitHub source, not SF: https://github.com/randaller/fm2txt For those, who want to recognize from soundcard, not from rtl-sdr (this allows to transcribe NFM etc): https://github.com/randaller/souncard2txt
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    NASH OS

    NASH OS

    Nash Operating System for Modern Ecommerce

    The all-built-in-one, automatic, ready-to-go out-of-box, easy-to-use state-of-the-art, and really awesome NASH OS! Over 25,000+ flexible features and controls and all scalable!! The most powerful solution ever built to instantly deliver new heights of online ecommerce enterprise to you.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 10
    Arabisc is speaker independent large vocabulary continuous speech recognizer for Arabic language released under GNU license.It is also a collection of open source tools that allows researchers and developers to build speech recognition systems for Arab
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    A voice recognition application which detect Quran (Islamic Holy book) pronunciation and result in a detection success percentage for each word of the Quran statements, when complete it can be considered a base for Arabic language recognition.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    AzioSpeech Recognition and Translation

    AzioSpeech Recognition and Translation

    AzioSpeech Recognition and Translation

    Starting from version 1.2.1.0, the project has been renamed to AzioSpeech Recognition and Translation and is officially published in the Microsoft Store at: https://apps.microsoft.com/detail/9PFV5DG73198 A desktop application built with Avalonia UI that provides real-time speech recognition and translation using Azure Speech Services. Convert spoken words into text and translate them into multiple languages with professional-grade accuracy. Important Setup Requirements Before using this application, you MUST have: 1. Azure Account Setup Active Azure Subscription - Create a free account at portal.azure.com Azure Speech Service Resource - You must create your own Speech Service within your Azure subscription Valid API Key & Region - Obtain these credentials from your Azure Speech Service resource 2. Windows Privacy Settings CRITICAL: Microphone Access Required You must grant microphone access through Windows settings
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13

    Commander

    Commander.exe is speech recognition engine for Polaris. 

    Commander.exe is speech recognition engine for Polaris.  What is Polaris ? Polaris is plugin for Eclipse IDE With Polaris you have the possibility of incorporating speech into programing. Through  use of this plugin in Eclipse IDE you can see that not only is it possible to provide an  environment for a programing with voice, but that programing with voice it is part of the  natural evolution of programming tools. Current version supports simple but powerful  commands such as openig search forms, changing workspace, copy and paste code.  Efforts on daily basis are made to increase the range of functionality that can be  controlled with voice.  Commander is here to recognize speech and send it to Polaris who activates actions in Eclipse IDE
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    InproTK

    InproTK

    An Incremental Spoken Dialogue Processing Toolkit

    InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    JAVT - Just Another Voice Transformer

    JAVT - Just Another Voice Transformer

    Just Another Speech Recognition and Text to Speech software.

    JAVT or Just Another Voice Transformer (formerly, it is called Just Another Video Transcriber) is a Speech Recognition software that also support text to Speech and simple media conversion. JAVT allows you to convert from video files to audio wav file using ffmpeg, and then transcribe the audio file to text using either Microsoft SAPI or CMU Sphinx. You can also open a text file and allow JAVT to read it out for you through text to speech conversion.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Pythia is a natural language question answering system, which uses Speech Recognition and Text To Speech technologies to communicate with the user.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17

    Speech Recognition System

    Speech Recognition System - Matlab source code

    Speech recognition technology is used more and more for telephone applications like travel booking and information, financial account information, customer service call routing, and directory assistance. Using constrained grammar recognition, such applications can achieve remarkably high accuracy. Research and development in speech recognition technology has continued to grow as the cost for implementing such voice-activated systems has dropped and the usefulness and efficacy of these systems has improved. For example, recognition systems optimized for telephone applications can often supply information about the confidence of a particular recognition, and if the confidence is low, it can trigger the application to prompt callers to confirm or repeat their request. Index Terms: speech, recognition, verification, sound, isolated, words.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    "MedicalRecords"

    MedicalRecords is an integrated medical information system.

    Introduction “MedicalRecords” is an open source, client-server medical information system that is primarily intended to facilitate the storage, organization and retrieval of personal medical information that may be obtained from a variety of sources including physician offices and medical centers. Data that are downloadable in machine readable format can be transferred electronically to the database. Alternately, the data can be transferred from USB flash drives, CD ROMs or other removable storage media. Documents can be entered by scanning to PDF files or other formats. Finally, information may be entered through use of speech recognition or typing. “MedicalRecords” gives one or more patients access to an integrated medical record the data in which may come from a variety of sources. It also provides an easy means for presenting the integrated data to specialist or other new care provider, emergency room staff or admitting physicians.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    This is an application that takes the input of ABNF code and then converts it to GRXML. Both standards adhere to the W3 standard of grammars for speech recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    A speech recognition application. It uses Microsoft Speech SDK to recognize and speak words. It can Play Music, Read the News, Tell the Time, Open Apps and many other cool things only with voice commands.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    ASR for Medical Reporting

    Automatic speech recognition system for medical reporting in spanish.

    This is a functional prototype of automatic speech recognition system for medical reporting in Spanish using CMU Sphinx4 ASR toolkit. This ASR use pre-trained acoustic model and context dependent language model in nuclear medicine diagnostics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    ASRT Speech Recognition

    ASRT Speech Recognition

    A Deep-Learning-Based Chinese Speech Recognition System

    ASRT is an end-to-end deep-learning Chinese ASR system built with TensorFlow/Keras, using convolution + CTC and a Max-Entropy HMM language model. It provides a REST/gRPC server backend and client SDKs in multiple languages (Python, Java, Go, Windows). Notably lightweight, it performs well without needing GPU acceleration and runs across platforms, targeting developers and researchers building Chinese voice interfaces.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    This project'll be the core engine of many voice based platforms,which can be implemented into your projects,websites...etc to provide an Arabic speech service, where your servers can interact with the clients through Arabic Speech Recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Awesome Recurrent Neural Networks

    Awesome Recurrent Neural Networks

    A curated list of resources dedicated to RNN

    A curated list of resources dedicated to recurrent neural networks (closely related to deep learning). Provides a wide range of works and resources such as a Recurrent Neural Network Tutorial, a Sequence-to-Sequence Model Tutorial, Tutorials by nlintz, Notebook examples by aymericdamien, Scikit Flow (skflow) - Simplified Scikit-learn like Interface for TensorFlow, Keras (Tensorflow / Theano)-based modular deep learning library similar to Torch, char-rnn-tensorflow by sherjilozair, char-rnn in tensorflow, and much more. Codes, theory, applications, and datasets about natural language processing, robotics, computer vision, and much more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    Bavieca (www.bavieca.org)

    Bavieca is an open-source speech recognition tookit.

    Bavieca (www.bavieca.org) is an open-source speech recognition toolkit intended for speech research and as a platform for rapid development of speech-enabled solutions by non speech experts. It comprises the most common acoustic modeling and adaptation techniques including discriminative training, and efficient dynamic and FSM-based decoders that can operate in batch and live recognition modes. Bavieca is entirely written in C++ and distributed under the Apache 2.0 license. Bavieca was developed at Boulder Language Technologies (BLT) during the last three years in response to the needs of the research projects conducted within the company. Research at BLT includes the development of conversational dialog systems and assessment tools that are deployed in formal educational settings and other real-life scenarios.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB