Showing 146 open source projects for "audio linux"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1

    Bermuda Text-to-Speech

    This project includes basic NLP and DSP techniques for Text-to-Speech

    See TTS demo at: http://rslp.racai.ro/index.php?page=tts This is an entirely written in JAVA project which includes a set of tools and methods designed to enable Multilingual Text-to-Speech (TTS) synthesis. We currently support English and Romanian but we will soon train more models and make them available for download. If you want to read more about our other NLP and TTS tools check out http://nlptools.racai.ro.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    InproTK

    InproTK

    An Incremental Spoken Dialogue Processing Toolkit

    InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    pyespeak

    Python to eSpeak speech synthesis

    ctypes Python module for eSpeak http://espeak.sf.net speech synthesis
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    AarTon
    AarTon is an automated text-to-speech application. It allows user to enter text in a web-based front-end and render these texts via a multi-channel sound card.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 5
    A speech synthesis and recognition library that is cross-platform, accessible from Java and C++, and has a very small API. Uses CMU Sphinx4 and FreeTTS internally.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 6
    J Subtitle Player is a program that plays SRT subtitle files on a translucent window. It's a great way to add a subtitle to a video that doesn't have any subtitle support. It can also be used with native and online streaming videos, such as, Netflix, Hulu, Amazon Prime videos, Google Play Movies, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7

    HMM Speech Recognition in Java

    HMM Speech Recognition in Java

    HMM Speech Recognition in Java
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    HMM Speech Recognition in Matlab

    A speech recognition system using Matlab/Simulink/Stateflow.

    This project provide hidden Markov model speech recognition system by using Matlab/Simulink/Stateflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    SpotMachine
    A program to record and sequentially play audio spots in e.g. supermarkets. This project is now on github: http://github.com/pryds/spotmachine
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    RNNLIB is a recurrent neural network library for sequence learning problems. Applicable to most types of spatiotemporal data, it has proven particularly effective for speech and handwriting recognition. full installation and usage instructions given at http://sourceforge.net/p/rnnl/wiki/Home/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    An initiative to create something similar to the windows program Roger Wilco, Teamspeak, BattleCom and Speak Freely, allowing users from different platforms talk with each other in real time with minimal CPU and bandwidth usage. Voice chat.....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Rhema STH

    Free Open Source Software for the Speech & Hearing Impaired

    RHEMA - Speak to Hear Software Application RHEMA is a software designed to help people with speech disability. Thiruvalluvar, the Tamil Sage of the 1st Century CE had said: “Wealth of wealth is wealth acquired be ear attent; Wealth mid all wealth supremely excellent. “ Kural No : 411 This software is the first version, with limited words in Tamil for them to practice. We have tested it with the help of a school and atleast two children were able to pick up some...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The aim of the project is to make programming through voice input as easy and productive as with mouse and keyboard. For install, Use subversion, as described in this page: http://sourceforge.net/apps/mediawiki/voicecode/index.php?title=VCode_1_Doc/InstallationManual.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14

    festival-croatian

    Croatian support for Festival.

    festival-croatian is Croatian support for Festival speech synthesis system. This support includes Croatian lexicon, which contains 83 entries, Croatian synthesis module, which contains Croatian phoneset, lts rules, tokenization, utterance, and accents, Croatian support for mbrola speech synthesizer, and 2 Czech voices provided by brailcom, until completing Croatian festival voice.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Speect
    Speect is a multilingual TTS system. It offers a full text-to-speech system with various API's, as well as an environment for research and development of TTS systems and voices. It is written in ANSI C and uses a plug-in mechanism for extensions. Speect also includes an extensive set of Python bindings for quick implementation of new ideas, these bindings are derived from SWIG interface files and can easily be extended for other languages supported by SWIG. Speect is free and open...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    Mixed Excitation hts_engine API

    Adds mixed excitation to the hts_engine API

    Adds mixed excitation to the hts_engine API
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    ONZE Miner

    ONZE Miner was a browser-based linguistics research tool

    NB: ONZE Miner has been renamed LaBB-CAT, and active support has been moved to another sourceforge project: http://labbcat.sourceforge.net ONZE Miner was a browser-based linguistics research tool that stores audio recordings and regular-expression searchable text transcripts of interviews. The search results, entire transcripts, and media, can be viewed or exported in a variety of format
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    spantus
    Spantus - signal(speech) analytical tool
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Zanzibar is a complete, standards based IVR. It includes an MRCPv2 Server with ASR and TTS engines as well as an voiceXML interpreter so that you can deploy and run voiceXML applications. It integrates with VOIP PBX’s (like Asterisk) using SIP and RTP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    This project is being developed to be a Java based speech recognition (SR) program. In addition to the SR program itself, it includes a program which allows a user to view the sound being received by the computer. The user can manipulate this data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    TranscriberAG is designed for assisting the manual annotation of speech signals. It provides a user-friendly GUI for segmenting long duration speech recordings, transcribing them, labeling speech turns, topic changes and acoustic conditions.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 22
    DonnerLaParole
    Clavier virtuel et synthétiseur vocal pour les personnes ne pouvant plus parler et ayant du mal à utiliser leurs mains. Virtual keyboard and speech synthetiser for people with reduced mobility and unability to speak. In French and english.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    An IDE for visually impaired users. It supports compiling and immediate error line focus, automatic code clean-up and not to mention all screen-readers E.G. NVDA. Sorry Linux can't work. Also, does NOT require Java Access Bridge.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Emacs VR Mode integrates the features of Dragon NaturallySpeaking with GNU Emacs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Software to make creating new iTunes-compatible RSS feeds for podcasts easy.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB