Speech Software for Mac

View 65 business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    J Subtitle Player is a program that plays SRT subtitle files on a translucent window. It's a great way to add a subtitle to a video that doesn't have any subtitle support. It can also be used with native and online streaming videos, such as, Netflix, Hulu, Amazon Prime videos, Google Play Movies, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    mp3 library, advanced ID3V1 and ID3V2 tagger, player. Organize a large mp3 library, over 40,000 songs. Speech synthesis and tag backup utilities. Scripts to maintain and organize song files.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    An IDE for visually impaired users. It supports compiling and immediate error line focus, automatic code clean-up and not to mention all screen-readers E.G. NVDA. Sorry Linux can't work. Also, does NOT require Java Access Bridge.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Free Open Source VoiceXML editor programmed in Java (Swing). The VoiceXML document is regularly parsed, a tree view is built and syntax errors are reported in a specific table.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    JeSpeak is a Java library that bridges eSpeak, which is a compact open source software speech synthesizer. JeSpeak uses JNI to make native call to libespeak.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    An initiative to create something similar to the windows program Roger Wilco, Teamspeak, BattleCom and Speak Freely, allowing users from different platforms talk with each other in real time with minimal CPU and bandwidth usage. Voice chat.....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Get your database online quickly using configuration not code. Use this secure, scalable and proven enterprise technology to publish any relational data, any custom process on the internet. MySQL, Java, XML, XSL. xHTML GUI, beta voiceXML and WML/WAP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    This project will show how to implement the Hidden Markov Model approximations of Voice Recognition into embedded and low power systems.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    A XFig based rapid prototype yeilding an audio speed alteration tool. This tool lets you arbitrarily alter the speed of audio files. It uses the WSOLA algorithm for audio speed alteration without pitch change.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 10
    Find the pitch of a power spectrum (signal) as per the afferent/efferent neural crossover. This occurs between the Lateral olivocochlear efferents and the inner hair cell afferents.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    The MRCPv2 protocol is designed to allow client devices to control media processing resources, such as speech recognition engines. MRCP4J provides a Java API that encapsulates the MRCPv2 protocol and can be used to implement MRCP clients and/or servers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Matsig is an object-oriented signal class library (Toolbox in MATLAB lingo) for MATLAB 6.5 and later. It implements a signal class, simplifying operations and manipulations common in audio signal processing and speech processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    A minimum voice generator. Maps text to sounds using also number to text (library included) transforms and spelling.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    Mixed Excitation hts_engine API

    Adds mixed excitation to the hts_engine API

    Adds mixed excitation to the hts_engine API
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Moshi

    Moshi

    A speech-text foundation model for real time dialogue

    Moshi is a speech-text foundation model and full-duplex spoken dialogue framework. It uses Mimi, a state-of-the-art streaming neural audio codec. Mimi processes 24 kHz audio, down to a 12.5 Hz representation with a bandwidth of 1.1 kbps, in a fully streaming manner (latency of 80ms, the frame size), yet performs better than existing, non-streaming, codecs like SpeechTokenizer (50 Hz, 4kbps), or SemantiCodec (50 Hz, 1.3kbps). Moshi models two streams of audio: one corresponds to Moshi, and the other one to the user. At inference, the stream from the user is taken from the audio input, and the one for Moshi is sampled from the model's output. Along these two audio streams, Moshi predicts text tokens corresponding to its own speech, its inner monologue, which greatly improves the quality of its generation. A small Depth Transformer models inter codebook dependencies for a given time step, while a large, 7B parameter Temporal Transformer models the temporal dependencies.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Simple Perl CGI script to manage user registrations on a murmur server (mumble server), via D-BUS
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    A Simple Nepali Voice Synthesis Project
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Software to make creating new iTunes-compatible RSS feeds for podcasts easy.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    This tool is for Nuance developers who wish to analyze the results of Nuance's batchrec program. It reads in the results of a Nuance batch recognition run, calculates WER using sclite, and stores the results in a database for subsequent analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    ONZE Miner

    ONZE Miner was a browser-based linguistics research tool

    NB: ONZE Miner has been renamed LaBB-CAT, and active support has been moved to another sourceforge project: http://labbcat.sourceforge.net ONZE Miner was a browser-based linguistics research tool that stores audio recordings and regular-expression searchable text transcripts of interviews. The search results, entire transcripts, and media, can be viewed or exported in a variety of format
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Implementation of Media Resource Control Protocol Client (MRCP). Supports ASR and TTS functionality. Design pattern implementation. Documentation, sample application and library source code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Open AAC is an open source Augmentative and Alternative Communication (AAC) program for children and adults with communication difficulties.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    An open PHP-based framework for the Nabaztag™ (http://www.nabaztag.com/) electronic pet. Due to major changes on the Violet backend, OpenNab can no longer be connected to it but it still can be used as a standalone server to set your bunnies free !
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    PHP based Viewer for Voice Servers like Mumble.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB