Showing 268 open source projects for "audio"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    mp3 library, advanced ID3V1 and ID3V2 tagger, player. Organize a large mp3 library, over 40,000 songs. Speech synthesis and tag backup utilities. Scripts to maintain and organize song files.
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2

    eNTranslator

    To aid translation of satsangs of Paramhamsa Nithyananda

    ...Time stamp information may be synched and a subtitle file or a simple textual output may be generated. Additionally it is planned to use google voice tools to also add voice over from these translated text. Finally the subtitle, translated audio (if any) would be muxed with the original video and uploaded.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    Accelerated Feature Extraction Tool

    A fast GPU accelerated feature extraction software for speech analysis

    A fast feature extraction software tool for speech analysis and processing. It incorporates standard MFCC, PLP, and TRAPS features. The tool is a specially designed to process very large audio data sets. It uses GPU acceleration if compatible GPU available (CUDA as weel as OpenCL, NVIDIA, AMD, and Intel GPUs are supported). CPU SSE intrinsic instruction set is used in cases where no compatible GPU present. The output files are stored in HTK format. The software is developed at Department of Cybernetics at University of West Bohemia in Pilsen.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    JAVIER

    JAvascript Voicexml InterpretER

    JAVIER is a JAvascript Voicexml InterpretER, designed (but not restricted) to run inside a web browser, its main engine has less than 1000 lines of code. It's maybe, the tiniest but (almost) FULL VoiceXML implementation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    A Python library to create sophisticated multilingual IVR applications. NOTICE. The repository is frozen, please find the latest version of the software at https://github.com/sippy/vapp
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    NoiseGator (Noise Gate)

    NoiseGator (Noise Gate)

    A simple noise gate app intended for use with VOIPs like Skype.

    Ever wanted to cut out background noise when talking with others on Skype? Now it's possible! NoiseGator is a light-weight noise gate application that routes audio through an audio input to an audio output. In real-time the audio level is analysed and if the average level is higher than the threshold the audio bypasses as normal. However, if the average level goes below the threshold, the gate closes and the audio is cut. When used with a virtual audio cable it can act as a noise gate for a either a sound input(microphone) or sound output(speakers). ...
    Leader badge
    Downloads: 544 This Week
    Last Update:
    See Project
  • 7

    High-order HMM in Matlab

    Implementation of duration high-order hidden Markov model in Matlab.

    Implementation of duration high-order hidden Markov model (DHO-HMM) in Matlab with application in speech recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Wrapper for vendors to simplify usage of the Java Speech API (JSR 113). Note that the spec is an untested early access and that there may be changes in the API.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    jaivox

    jaivox

    Speech recognition application builder and library

    Java library and tools to create open source speech recognition applications. Generates dialogs for conversational interfaces. Works with a popular open source speech recognition library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • 10
    SetFon focus is an interface web based for Praat resources (www.praat.org) wich focus speech sound annalysis; it is a gerent program for acoustic analysis PHP/Mysql based. Developed with the framework SIMP.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    Twiline

    Transcribe audio files to written text

    Simple program to assist transcriptions from recorded interviews. Resembles somewhat the early F4. <br/>Keywords: transcription transcribe interview
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Teamspeak 3 Love Plugin

    Teamspeak 3 Love Plugin

    A plugin for Teamspeak3. This plugin allows you to autofollow a user.

    A plugin for Teamspeak3. This plugin allows you to follow a user while he switches through channels. For the love menu just right click any name in the server view.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13

    Steel TTS

    A cross-platform wrapper for common text-to-speech engines in Python

    Steel is a cross-platform package for using common text-to-speech (speech synthesis) engines in Python. Steel currently supports the following TTS software: - Microsoft Speech API 5 (SAPI5) - eSpeak - NS Speech Synthesis - FreeTTS Documentation: http://sourceforge.net/p/steeltts/wiki/ Bug Tracker: http://sourceforge.net/p/steeltts/tickets/ If you are interested in contributing to the Steel TTS codebase, or would like to make a feature-request, please contact the lead...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    Cotovía

    Cotovía

    Text-to-Speech System for Galician and Spanish

    Cotovía is a unit-selection text-to-speech system for Galician and Spanish. Cotovía is distributed under the GPL3.0+ license, but each of the avaliable speaker voices has its own license. The speakers available at sourceforge are free for commercial and non-commercial uses. Another speaker, free for non-commercial uses, is avaliable through external links (see the Blog section). Cotovia has been developed by the University de Vigo and the center 'Ramón Piñeiro' for Research in Humanities,...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 15

    Bermuda Text-to-Speech

    This project includes basic NLP and DSP techniques for Text-to-Speech

    See TTS demo at: http://rslp.racai.ro/index.php?page=tts This is an entirely written in JAVA project which includes a set of tools and methods designed to enable Multilingual Text-to-Speech (TTS) synthesis. We currently support English and Romanian but we will soon train more models and make them available for download. If you want to read more about our other NLP and TTS tools check out http://nlptools.racai.ro.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    InproTK

    InproTK

    An Incremental Spoken Dialogue Processing Toolkit

    InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17

    pyespeak

    Python to eSpeak speech synthesis

    ctypes Python module for eSpeak http://espeak.sf.net speech synthesis
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    AarTon
    AarTon is an automated text-to-speech application. It allows user to enter text in a web-based front-end and render these texts via a multi-channel sound card.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    de-ess

    de-ess

    De-essing software to reduce sibilance in speech using TSP

    This de-esser uses a novel approach called Temporal Sibilance Processing. The idea is to distinguish between fricatives and voiced sections of the speech signal by the number of zero crossings in time. Most of the speech file is left untouched (the samples are directly copied from source to destination). Only fricatives that are long enough and loud enough are filtered. The advantage of this approach over traditional approaches is that the clarity of the remaining speech is completely unaffected.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20

    Austrian German Voices for Festival

    Austrian voices for the Festival speech synthesis system

    Hidden Markov Model based voice models of Austrian German for the Festival speech synthesis system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A speech synthesis and recognition library that is cross-platform, accessible from Java and C++, and has a very small API. Uses CMU Sphinx4 and FreeTTS internally.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22
    J Subtitle Player is a program that plays SRT subtitle files on a translucent window. It's a great way to add a subtitle to a video that doesn't have any subtitle support. It can also be used with native and online streaming videos, such as, Netflix, Hulu, Amazon Prime videos, Google Play Movies, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    The project provides a ready-to-use interface for the julius CSR engine for a handicapped child which is not able to use the keyboard well. It integrates into X11 and Windows. Find out how you can help: http://simon-listens.org/index.php?support
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24

    HMM Speech Recognition in Java

    HMM Speech Recognition in Java

    HMM Speech Recognition in Java
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    HMM Speech Recognition in Matlab

    A speech recognition system using Matlab/Simulink/Stateflow.

    This project provide hidden Markov model speech recognition system by using Matlab/Simulink/Stateflow.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB