Speech Software for Windows

View 79 business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    Find the pitch of a power spectrum (signal) as per the afferent/efferent neural crossover. This occurs between the Lateral olivocochlear efferents and the inner hair cell afferents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    This is a speech project make by NTHU MIRLab, Taiwan.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    The MRCPv2 protocol is designed to allow client devices to control media processing resources, such as speech recognition engines. MRCP4J provides a Java API that encapsulates the MRCPv2 protocol and can be used to implement MRCP clients and/or servers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Mail2Voice

    Mail2Voice

    E-Mail software for people with cognitive impairment and illiteracy.

    /!\ Please see our website for downloads /!\ E-Mail client software dedicated to people with cognitive impairment and illiteracy. Very simplified graphical interface, voice recording outgoing messages (attached MP3), speech reading incoming messages. The code moved to Git : https://git.framasoft.org/groups/Mail2Voice /!\ Please see our website for downloads /!\
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    Matsig is an object-oriented signal class library (Toolbox in MATLAB lingo) for MATLAB 6.5 and later. It implements a signal class, simplifying operations and manipulations common in audio signal processing and speech processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    A minimum voice generator. Maps text to sounds using also number to text (library included) transforms and spelling.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    Mixed Excitation hts_engine API

    Adds mixed excitation to the hts_engine API

    Adds mixed excitation to the hts_engine API
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Simple Perl CGI script to manage user registrations on a murmur server (mumble server), via D-BUS
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    My Voice Commander

    My Voice Commander

    Control few of your PC tasks using your voice on Win7 PC

    How can i use this program? 1- Execute media files, start or stop your webcam, Capture images via webcam, browse webcam captured images, execute file or folders and even more, using your voice. 2- GoodDay Caller Feature [ A User welcome approach ]. (GoodDay Caller says Good Morning, Good Afternoon, Good Evening, Good Night to user)
    Downloads: 0 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    My Freedom To Communicate (MyFTC) is Assistive Technology (AT) software that uses text-to-speech technology to enable nonverbal individuals to communicate easily in real life situations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    A Simple Nepali Voice Synthesis Project
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Software to make creating new iTunes-compatible RSS feeds for podcasts easy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    This tool is for Nuance developers who wish to analyze the results of Nuance's batchrec program. It reads in the results of a Nuance batch recognition run, calculates WER using sclite, and stores the results in a database for subsequent analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    OC Volume is a speech recognition engine written in Java for integration with other applications. It is currently an User-Dependent Isolated Word Recognizer and can be expanded to include more capability for recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    ONZE Miner

    ONZE Miner was a browser-based linguistics research tool

    NB: ONZE Miner has been renamed LaBB-CAT, and active support has been moved to another sourceforge project: http://labbcat.sourceforge.net ONZE Miner was a browser-based linguistics research tool that stores audio recordings and regular-expression searchable text transcripts of interviews. The search results, entire transcripts, and media, can be viewed or exported in a variety of format
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Implementation of Media Resource Control Protocol Client (MRCP). Supports ASR and TTS functionality. Design pattern implementation. Documentation, sample application and library source code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Open AAC is an open source Augmentative and Alternative Communication (AAC) program for children and adults with communication difficulties.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    An open PHP-based framework for the Nabaztag™ (http://www.nabaztag.com/) electronic pet. Due to major changes on the Violet backend, OpenNab can no longer be connected to it but it still can be used as a standalone server to set your bunnies free !
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    PDF Annot is a piece of software that enables you to add audio and text annotation to a PDF. It uses JPedal SimpleViewer and iText library. Annotations are supported by Adobe'sofficial PDF Reader. Report any bug here: krakosia[at]gmail.com
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    PHP-VOX is the Text To Speech(TTS) binding for PHP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    PHP based Viewer for Voice Servers like Mumble.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    PJSGUA is an Open Source SIP UAC based on PJSUA reference UAC from PJSIP project
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Platform for Annotated Corpora in XML Integrated tool for corpus linguists built on Eclipse, Vex, Subversive, etc. for creating and editing transcriptions and annotations, querying, managing version controlled data, and building a shippable corpus.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Project that aims converting a text page directly into MP3 or other audio format using the MBrola libraries
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    PersonaPlex

    PersonaPlex

    PersonaPlex code

    PersonaPlex is an open-source real-time conversational speech AI model that goes beyond traditional text chat by providing full-duplex speech-to-speech interaction, meaning it can listen and talk at the same time instead of waiting for you to finish speaking before responding. This architectural approach eliminates awkward pauses and makes conversations feel much more human-like, with natural behaviors such as overlapping speech, interruptions, and fluent turn-taking, traits that traditional AI assistants typically lack. PersonaPlex also supports persona and voice control, allowing developers to define the role and speaking style of the agent using text prompts and voice conditioning, making it suitable for applications like customized voice assistants, interactive character agents, or domain-specific conversational tools. Internally, it processes continuous audio streams in a hybrid input format so that speech understanding and generation occur jointly.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB