Showing 62 open source projects for "audio"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Defox text to speech and downloader

    Defox text to speech and downloader

    Written or imported text offline read or online download.

    This software design to convert text to speech and download the converted speech. Description : • Installation setup with two languages (English, French) • Two areas called text reading and speech downloading • Many languages supported to download center Note 1: I'm a student yet and I'm not in the software designing industry. Therefore maybe I haven't software making skills. I'm worried about that. ! Note 2 : When you double click on the software maybe it will get some seconds...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    aeneas

    aeneas

    Automagically synchronize audio and text (aka forced alignment)

    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment). aeneas automatically generates a synchronization map between a list of text fragments and an audio file containing the narration of the text. In computer science this task is known as (automatically computing a) forced alignment.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 3

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Python Audio Tools are a collection of audio handling programs. These include programs for CD extraction, track conversion from one audio format to another, track renaming and retagging, track identification, CD burning from tracks, and more.
    Leader badge
    Downloads: 24 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5

    Tygamusic

    A pygame music lib.

    This lib was produced while I was programming an other program/game. I was tired of pygame's bad system of handling playlists and the management of music in general. With this lib I want to create an layer that allows you to interact with the music, how you would expect it. Currently featuring: -Playlist -Normal pausing and resuming (played time isn’t lost when new song is loaded) -Automatic recognition of songs and adding them to a separate list
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    A Python library to create sophisticated multilingual IVR applications. NOTICE. The repository is frozen, please find the latest version of the software at https://github.com/sippy/vapp
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    DR14 T.meter

    DR14 T.meter

    Compute the DR14 of a given audio file according to the procedure desc

    Introduction to DR14 T.meter DR14 T.meter is a free and opens source command line tool for computing the Dynamic Range of your music according to the procedure used in the off-line meter released by the Pleasurize Music Foundation. This tool is very useful to measure how is loud your music and for understanding that a good quality album always has also a good dynamic, and it's also useful for understanding the effects of the so called loudness war. Dr14 t.meter is released under the...
    Leader badge
    Downloads: 9 This Week
    Last Update:
    See Project
  • 8
    InproTK

    InproTK

    An Incremental Spoken Dialogue Processing Toolkit

    InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    RNNLIB is a recurrent neural network library for sequence learning problems. Applicable to most types of spatiotemporal data, it has proven particularly effective for speech and handwriting recognition. full installation and usage instructions given at http://sourceforge.net/p/rnnl/wiki/Home/
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 10
    This is a tutorial about how to create a video player. This tutorial pretends to be a brief introduction to the Graphics User Interfaces (GUI) programming, using the Python programming lenguage and Qt framework, both multiplataform. In this tutorial we'll use PyQt to show you the base of Qt programming under Pyhon. To make this tutorial even more funny, because you can learn the theorical side of Qt programming just reading the documentation, I will focus it only in the practical...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11

    EnDiskEx

    Bulk extractor for Ensoniq-formatted disk images

    EnDiskEx is a command-line tool that bulk extracts instruments, sequences, songs, and banks from Ensoniq-formatted disk images (RAW, GKH, EDE, and EDA) for the EPS/ASR family of samplers. The extracted files are saved as EFE / SMF / TXT files. EnDiskEx is designed to extract Ensoniq banks for re-creation within a different DAW. It will track down the instrument and song files from bank references even if they were saved on another disk. There also exists a disk mapping feature to...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    VoiceCode is an Open Source initiative started by the National Research Council of Canada, to develop a programming by voice toolbox. The aim of the project is to make programming through voice input as easy and productive as with mouse and keyboard. For install, Use subversion, as described in this page: http://sourceforge.net/apps/mediawiki/voicecode/index.php?title=VCode_1_Doc/InstallationManual.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Strasheela is a constraint-based music composition system. The user defines music theories by sets of compositional rules and the system creates music which complies with these theories. User-interface is the programming language Oz.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Linderdaum Engine
    Linderdaum Engine is an open source purely object-oriented 3D gaming engine written in C++. It is designed to be an integrated solution for the development of interactive 3D applications, for game, industrial and scientific visualization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Elucidation is a Python module designed to be an extremely powerful backend for audio and video converters. The aim of the module is to do all the heavy lifting while applications using it are little more than interfaces to it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    These bindings allow you to use the fmod ex sound library from python with nice python api. You can (or, if i must say the truth, will be able to) use any feature you like. Now we're in phase where everyone who wants to help would be appreciated.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    JackPy
    Pure Python bindings for JACK Audio
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Set of scripts and class file(s) written in Python that builds upon the somewhat limited rhythmbox development API that communicates using DBUS, adding some very useful functionality that currently isn't available.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    TappedAudio is an audio library with an emphasis on the programming interface -- agnostic to the backend, written in highly-encapsulated C++ and using SWIG to present a unified API for C++, Python, Ruby, etc., regardless of the underlying audio platform.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    figusdevpack (fdp), is designed for reducing application development time and acquainting new users with existing resources for multimedia development. It is structured around a wrapper library, example applications, tutorials and documentation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    QWave: Qt-based waveform display and audio playback class library.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Synthesia is an open hardware/software platform intended for creating standalone audio devices such as synthesizers on embedded processors.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    Convert text to International Morse Code. Input is ASCII text. Output can be: - . -..- - on the console, raw 8bit PCM suitable for piping to /dev/audio, .wav files or even (mp3|ogg). Good for headlines on your MP3 player or code practice.
    Leader badge
    Downloads: 68 This Week
    Last Update:
    See Project
  • 24
    uListen is a TTS(Text To Speech) application. It can TALK you the web pages, chm files, pdf files and word files and plain text files.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    panzi's usefull little (python) scripts (puls) are a set of small and usefull scripts (dcop, i18n, a amarok-plugin etc.).
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB