59 projects for "audio processing" with 2 filters applied:

  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Cloud tools for web scraping and data extraction Icon
    Cloud tools for web scraping and data extraction

    Deploy pre-built tools that crawl websites, extract structured data, and feed your applications. Reliable web data without maintaining scrapers.

    Automate web data collection with cloud tools that handle anti-bot measures, browser rendering, and data transformation out of the box. Extract content from any website, push to vector databases for RAG workflows, or pipe directly into your apps via API. Schedule runs, set up webhooks, and connect to your existing stack. Free tier available, then scale as you need to.
    Explore 10,000+ tools
  • 1
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    ...It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Membrane Core

    Membrane Core

    The core of Membrane Framework, multimedia processing framework

    membrane_core is the foundation of the Membrane multimedia framework for Elixir, providing the abstractions and runtime needed to build real-time audio and video pipelines. It models media processing as a graph of lightweight, supervised OTP processes—elements connected by links—so work is isolated, fault-tolerant, and easy to scale or reconfigure at runtime. The core defines a clear lifecycle and callback API for elements, plus concepts like buffers, events, and capabilities/format negotiation to keep components interoperable and type-safe. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    bfxr

    bfxr

    Flash + AIR sound effects generator. Based on Sfxr.

    ...Its purpose is to enable users, especially game developers and sound designers, to quickly generate retro, 8-bit/“chiptune” style sound effects (“bleeps”, “booms”, “zaps”, etc.) without deep knowledge of audio signal processing. It offers an interactive GUI through which you can tweak many parameters (oscillators, envelopes, filters, etc.) to sculpt custom sound effects; you can preview in real time, export, and iterate. The project includes libraries, HTML templates, and both ActionScript and JavaScript code. It has been well-received (over a thousand stars), but as of 2025, it has been superseded by a newer version called bfxr2, which is a JavaScript reworking of the original.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 4
    ffmpeg.wasm

    ffmpeg.wasm

    FFmpeg for browser, powered by WebAssembly

    ffmpeg.wasm is a pure WebAssembly (and JavaScript/TypeScript) port of FFmpeg that enables in-browser media recording, conversion, and streaming—letting developers perform video/audio processing entirely client-side without server uploads. Transpiled via Emscripten from FFmpeg and its codecs into WebAssembly. Supports both single-threaded and multi-threaded cores using web workers. Written in TypeScript for improved developer experience.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Reach Your Audience with Rise Vision, the #1 Cloud Digital Signage Software Solution Icon
    Reach Your Audience with Rise Vision, the #1 Cloud Digital Signage Software Solution

    K-12 Schools, Higher Education, Businesses, Restaurants

    Rise Vision is the #1 digital signage company, offering easy-to-use cloud digital signage software compatible with any player across multiple screens. Forget about static displays. Save time and boost sales with 500+ customizable content templates for your screens. If you ever need help, get free training and exceptionally fast support.
    Learn More
  • 5
    Live API Web Console

    Live API Web Console

    A react-based starter app for using the Live API over websockets

    ...It ships with demo branches that show grounded search, function calling, and visualization—one example has the model calling a function that renders Vega/Altair graphs directly in the UI. Under the hood there’s an event-emitting WebSocket client, an audio in/out processing layer, and a minimal scaffolded view so you can focus on your app logic rather than wiring.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    WildMidi is a midi processing library and a midi player using the gus patch set.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 7
    Drumstick Libraries

    Drumstick Libraries

    MIDI libraries for Qt/C++

    Drumstick is a tool to play music. This is a set of C++ MIDI libraries using Qt5 objects, idioms and style. It contains a C++ wrapper around the ALSA library sequencer interface; ALSA sequencer provides software support for MIDI technology on Linux. A complementary library provides classes for SMF (Standard MIDI files: .MID/.KAR), and Cakewalk (.WRK) file formats processing. A multiplatform realtime MIDI I/O library is also provided.
    Leader badge
    Downloads: 31 This Week
    Last Update:
    See Project
  • 8
    MLT Multimedia Framework
    A multimedia authoring and processing framework and a video playout server for television broadcasting.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    Data Crow

    Data Crow

    The ultimate cataloguer

    Data Crow allows you to use the standard movie & video (divx, xvid, DVD, Blu-ray, etc), book (and eBooks), images, board games, comic books, games & software, music (mp3 and other music files) cataloguing modules. Besides these modules, which you can change to fit your requirements, you can create new modules (want to catalogue your stamps, equipment, or anything else?). The GUI is skinnable. Reporting (using JasperReports and their community edition JasperSoft Developer Studio ), loan...
    Leader badge
    Downloads: 339 This Week
    Last Update:
    See Project
  • eProcurement Software Icon
    eProcurement Software

    Enterprises and companies seeking a solution to manage all their procurement operations and processes

    eBuyerAssist by Eyvo is a cloud-based procurement solution designed for businesses of all sizes and industries. Fully modular and scalable, it streamlines the entire procurement lifecycle—from requisition to fulfillment. The platform includes powerful tools for strategic sourcing, supplier management, warehouse operations, and contract oversight. Additional modules cover purchase orders, approval workflows, inventory and asset management, customer orders, budget control, cost accounting, invoice matching, vendor credit checks, and risk analysis. eBuyerAssist centralizes all procurement functions into a single, easy-to-use system—improving visibility, control, and efficiency across your organization. Whether you're aiming to reduce costs, enhance compliance, or align procurement with broader business goals, eBuyerAssist helps you get there faster, smarter, and with measurable results.
    Learn More
  • 10
    SPTK is a suite of speech signal processing tools for UNIX environments, e.g., LPC analysis, PARCOR analysis, LSP analysis, PARCOR synthesis filter, LSP synthesis filter, vector quantization techniques, and other extended versions of them.
    Downloads: 22 This Week
    Last Update:
    See Project
  • 11

    AhoTTS - TTS for Basque and Spanish

    Text-to-Speech for Basque and Spanish

    Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Delphi ASIO & VST Packages
    With these packages for Delphi the user can easily create VST plugins or ASIO applications within minutes. The included algorithms for filters and dynamics help to built effects without much knowledge of digital signal processing.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 13
    VAD

    VAD

    Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM

    This repository is a voice activity detection (VAD) toolkit that implements multiple models (DNN, bDNN, LSTM, ACAM) for detecting speech versus non-speech in audio. It also provides a recorded dataset in varied real-world settings (e.g. bus stop, construction site, park, room) with ground truth labeling. Acoustic feature extraction (multi-resolution cochleagram, MRCG). Post-processing modules (e.g. smoothing, thresholds). The toolkit supports both MATLAB and Python/TensorFlow components (for feature extraction, classification, postprocessing). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Source code for "mda" audio processing plug-ins in VST format. Available for many years as closed-source freeware from mda-vst.com
    Leader badge
    Downloads: 290 This Week
    Last Update:
    See Project
  • 15
    Sysex Osc Generator

    Sysex Osc Generator

    A Sysex OSC hex string generator for the X32/X-Air/Wing digital mixers

    The Sysex OSC Generator provides a means of selecting a desired OSC command for the Behringer X32, X-Air or Wing digital mixer and generating the Sysex OSC hex string. This can be added to any midi device that supports sysex sending of commands. Available for the PC, Mac, linux (32 and 64bit) and Raspberry Pi platforms. Feedback of suggestions and bug reports that would improve the app would be appreciated.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16

    AhoTTS Multilingual, a Multilingual TTS

    Text-to-Speech TTS for Basque, Spanish, Catalan, Galician and English

    Text-to-Speech conversor for Basque, Spanish, Catalan, Galician and English. It includes linguistic processing and built voices for all the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    GTK+ IOStream

    GTK+ IOStream

    GTK+iostream, Data plots, ORB, Neural Networks, WSOLA

    ...You can also plot like so : Plot figure; figure.plot(x, y, x.size()); Plotting utilises GtkDataBox.sf.net ORB utilises www.zeroc.com Feed forward neural networks with different activation functions. Audio playback/recording and digital signal processing (DSP) - utilising Jack www.jackaudio.org
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18

    Xamp

    This is Xamp, a secure and platform independend media player.

    This is Xamp, the next generation of a skinnable and platform independent media player which based on the Doppio media framework: https://sourceforge.net/projects/doppio. The player supports various pluggable video and audio formats. Xamp is primarily intend to run as a native executable but can also run as a Java player version working with Java 1.1 VM or above. To ensure the best user experience, the Java version of Xamp doesn't use the Java Media Framework, the high level part of the Java Sound API, the high level part of the Java Swing API, or other high level API's: http://maikmerten.livejournal.com/2015/05/13 Development goals are: -Superior sound quality -Combines audio player with media player -Processing of insecure media files -Performance -Reliability -Open codec support -Click to ready installation Supports currently: Ogg Sceleton Ogg Comment ID3 Ogg Vorbis WAV, SND, AU, AIFF Ogg Theora MP1, MP2, MP3 Mpeg1, Mpeg2 M3U, PLZ WSZ
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Icarus Scene Engine

    Icarus Scene Engine

    3D .NET engine for Windows, Linux, Mac

    Icarus Scene Engine is a cross-platform, CLS-compliant 3D framework, integrating open source APIs into a cohesive all-open-source, all .NET solution. For Windows, MacOSX & Linux. Uses OpenTK, OpenGL, OpenAL APIs for graphics and audio. Runs on Mono/.NET. Links to FFMpeg for streaming video, and FreeType as a temporary conversion mod to bring fonts into a native XML format. Binary builds of Icarus will be infrequent, roughly every 18 months or so, the GIT code updates regularly on a...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    MARF is a general cross-platform framework with a collection of algorithms for audio (voice, speech, and sound) and natural language text analysis and recognition along with sample applications (identification, NLP, etc.) of its use, implemented in Java.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21

    Accelerated Feature Extraction Tool

    A fast GPU accelerated feature extraction software for speech analysis

    A fast feature extraction software tool for speech analysis and processing. It incorporates standard MFCC, PLP, and TRAPS features. The tool is a specially designed to process very large audio data sets. It uses GPU acceleration if compatible GPU available (CUDA as weel as OpenCL, NVIDIA, AMD, and Intel GPUs are supported). CPU SSE intrinsic instruction set is used in cases where no compatible GPU present.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    NASPRO
    NASPRO, acronym for "NASPRO Architecture for Sound PROcessing" is a collection of free and open source sound processing software built around the LV2 plugin standard.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    PNP is a program for tutorial purposes in the signal processing area. It allows to set, move and remove poles and zeros of an one dimensional digital filter. The program shows different effects of the filter in the frequency domain.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    InproTK

    InproTK

    An Incremental Spoken Dialogue Processing Toolkit

    InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    MplayerXP is branch of well known mplayer (http://mplayerhq.hu) which is based on new (thread based) core. Main goal of this project is to achieve smoothness of video playback due monotonous CPU loading.
    Downloads: 8 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next