Search Results for "audio processing" - Page 8

Showing 374 open source projects for "audio processing"

View related business solutions
  • Stop Storing Third-Party Tokens in Your Database Icon
    Stop Storing Third-Party Tokens in Your Database

    Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

    Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
    Try Auth0 for Free
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    hora

    hora

    Efficient approximate nearest neighbor search algorithm collections

    hora is an open-source high-performance vector similarity search library designed for large-scale machine learning and information retrieval systems. The project focuses on approximate nearest neighbor search, a fundamental technique used in modern AI applications such as recommendation systems, image search, and semantic search engines. Hora implements multiple efficient indexing algorithms that allow systems to rapidly search through high-dimensional vectors produced by machine learning...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Vivid 3D

    Vivid 3D

    Vivid is a modern C++ 3D engine using OpenGL4+

    Vivid is a modern C++ 3D engine using OpenGL4+. It is written using Visual C++ 2022, and relies on several open source projects to achieve it's goal of making it easy and run to make modern games with it.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    AAXtoMP3

    AAXtoMP3

    Convert Audible's .aax filetype to MP3, FLAC, M4A, or OPUS

    ...AAXtoMP3 supports batch processing, enabling users to convert multiple files in a single workflow. Its minimal setup and script-based usage make it suitable for automation and integration into personal media pipelines. Overall, it provides a practical solution for managing audiobook libraries in open formats.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    Telegram WebRTC (VoIP)

    Telegram WebRTC (VoIP)

    Voice chats, private incoming and outgoing calls in Telegram

    Telegram WebRTC (VoIP) is a Python and C++ library that enables real-time voice and video communication features for Telegram bots and clients. It provides an interface for joining, managing, and streaming audio or video in Telegram group calls and voice chats. The library is built on top of low-level communication protocols, ensuring efficient handling of real-time media streams. It supports integration with FFmpeg and other tools for processing audio and video before transmission. tgcalls allows developers to create bots that can play music, stream content, or interact with live voice channels programmatically. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    Beep

    Beep

    A little package that brings sound to any Go application

    A little package that brings sound to any Go application. Suitable for playback and audio processing. Beep is built on top of its Streamer interface, which is like io.Reader, but for audio. It was one of the best design decisions I've ever made and it enabled all the rest of the features to naturally come together with not much code. Decode and play WAV, MP3, OGG, and FLAC. Encode and save WAV. Very simple API. Limiting the support to stereo (two channel) audio made it possible to simplify the architecture and the API. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    Delphi ASIO & VST Packages
    With these packages for Delphi the user can easily create VST plugins or ASIO applications within minutes. The included algorithms for filters and dynamics help to built effects without much knowledge of digital signal processing.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    VAD

    VAD

    Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM

    This repository is a voice activity detection (VAD) toolkit that implements multiple models (DNN, bDNN, LSTM, ACAM) for detecting speech versus non-speech in audio. It also provides a recorded dataset in varied real-world settings (e.g. bus stop, construction site, park, room) with ground truth labeling. Acoustic feature extraction (multi-resolution cochleagram, MRCG). Post-processing modules (e.g. smoothing, thresholds). The toolkit supports both MATLAB and Python/TensorFlow components (for feature extraction, classification, postprocessing). ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Data augmentation

    Data augmentation

    List of useful data augmentation resources

    List of useful data augmentation resources. You will find here some links to more or less popular github repos, libraries, papers, and other information. Data augmentation can be simply described as any method that makes our dataset larger. To create more images for example, we could zoom in and save a result, we could change the brightness of the image or rotate it. To get a bigger sound dataset we could try to raise or lower the pitch of the audio sample or slow down/speed up....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ACToolkit's temporary homepage

    ACToolkit's temporary homepage

    Max-objects-&-patches for Algorithmic Composition and statistical DSP

    It was formerly jey-Toolkit. Now it's renamed and also being distributed by Cycling '74. Through the Files menu that appears above, the same package as released at the Package Manager of Max 7/8 and the beta-version update(s) are accessible. PLEASE NOTE that, on Max 6.1.10, some Jitter features and GEN~ objects that the patches included in the package use WON'T be working. The name of our package ACToolkit is a derivative of AC Toolbox, the legendary "Algorithmic music composition program...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 10
    CHOW Phaser

    CHOW Phaser

    Phaser effect based loosely on the Schulte Compact Phasing 'A'

    ChowPhaser is an open-source audio plugin that emulates the classic Schulte Compact Phasing 'A' effect. It offers a unique phasing effect with nonlinear feedback and modulation capabilities, suitable for various audio processing applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Lumix Engine

    Lumix Engine

    3D C++ Game Engine - yet another open source game engine

    Lumix Engine is an open-source C++ 3D game engine that aims to strike a balance between performance, flexibility, and ease of use, making it a compelling alternative to heavier commercial engines. It features a modern, component-based design (entity-component-system) and offers a full editor — enabling developers to build games in a WYSIWYG style similar to engines like Unity, while staying in a lightweight, open environment. The engine supports advanced rendering (with PBR, HDR, deferred...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Denoiser

    Denoiser

    Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)

    ...The implementation includes data augmentation techniques applied to the raw waveforms (e.g. noise mixing, reverberation) to improve model robustness and generalization to diverse noise types. The project supports both offline denoising (batch inference) and live audio processing (e.g. via loopback audio interfaces), making it practical for real-time use in calls or recording. The codebase includes training and evaluation scripts, configuration management via Hydra, and pretrained models on standard noise datasets.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    Live Transcribe Speech Engine

    Live Transcribe Speech Engine

    Live Transcribe is an Android application

    ...Its design prioritizes latency and robustness in noisy, far-field environments, enabling continuous transcription with low delay on mobile hardware. The engine manages audio front-end processing—such as noise suppression and voice activity detection—before feeding audio into compact, accurate acoustic and language models. Partial hypotheses stream as words are recognized, then stabilize with minimal jitter as confidence increases, which is crucial for usability. The code emphasizes efficient use of CPU and neural accelerators to balance battery life with responsiveness. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    X32 Scene Parser

    X32 Scene Parser

    An X32 scene management tool

    This parsing tool can be used to extract sections of a Behringer X32 or Midas M32 scene file in order to create specialized snippets.
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • 15
    X-Air Scene Parser

    X-Air Scene Parser

    An X-Air scene management tool

    The X-Air/M-Air does not include snippets (like the X32), This parsing tool is a port of the X32 Scene Parser that can be used to create modified scene files which can function like snippets.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    Savify

    Savify

    Download Spotify songs to mp3 with full metadata and cover art

    Savify is a command-line tool designed to download and archive music from Spotify by leveraging YouTube as the audio source while preserving Spotify metadata. It allows users to input playlists, albums, or individual tracks and automatically retrieves matching audio files with proper tagging. The tool integrates FFmpeg and yt-dlp to handle downloading, conversion, and formatting into common audio formats such as MP3. It enriches files with metadata including artist, album, cover art, and...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 17
    hplayer

    hplayer

    A multi-screen player using Qt + FFmpeg

    ...It focuses on providing a minimal yet functional implementation of video playback, including decoding, rendering, and synchronization. The project is structured as a learning resource, helping developers understand the fundamentals of multimedia pipelines. It supports common audio and video formats and includes playback controls for managing media streams. The architecture emphasizes performance and simplicity, using native libraries to achieve efficient playback. It also demonstrates integration between UI layers and low-level media processing components. Overall, it serves as a practical reference for building custom media players.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Perl Audio Converter

    Perl Audio Converter

    Linux Audio Converter / Tagger / CD Ripper

    ...It can also extract audio from the following video extensions: RM, RV, ASF, DivX, MPG, MKV, MPEG, AVI, MOV, OGM, OGV, QT, VCD, SVCD, M4V, NSV, NUV, PSP, SMK, VOB, FLV, WEBM and WMV. Parallel Processing, a CD ripping function with CDDB support, batch conversion, tag preservation for most supported formats, independent tag reading & writing, service menus for KDE Dolphin/Konqueror, Gnome Nautilus script, and action scripts for Nemo/Thunar are also provided.
    Downloads: 25 This Week
    Last Update:
    See Project
  • 19
    RxFFmpeg

    RxFFmpeg

    FFmpeg 4.0 + X264 + mp3lame + fdk-aac + opencore-amr + openssl

    RxFFmpeg is a high-performance multimedia processing framework built on top of FFmpeg and optimized specifically for Android environments, enabling developers to perform complex audio and video editing operations programmatically. It integrates widely used encoding libraries such as x264, mp3lame, and fdk-aac, allowing it to support a broad range of media formats and transformations.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    Source code for "mda" audio processing plug-ins in VST format. Available for many years as closed-source freeware from mda-vst.com
    Leader badge
    Downloads: 197 This Week
    Last Update:
    See Project
  • 21
    quick-media

    quick-media

    media(audio/image/qrcode/markdown/html/svg/png) support

    quick-media is a lightweight multimedia processing toolkit designed to simplify common video and audio operations through streamlined command execution. It provides a wrapper around FFmpeg functionality, enabling users to perform tasks such as transcoding, clipping, and format conversion with simplified commands. The tool emphasizes ease of use while still allowing access to advanced encoding parameters when needed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    MultiPlayer-mini
    Application for realtime decoding & auralization of spatial audio formats. In development - some features not yet available.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Sysex Osc Generator

    Sysex Osc Generator

    A Sysex OSC hex string generator for the X32/X-Air/Wing digital mixers

    The Sysex OSC Generator provides a means of selecting a desired OSC command for the Behringer X32, X-Air or Wing digital mixer and generating the Sysex OSC hex string. This can be added to any midi device that supports sysex sending of commands. Available for the PC, Mac, linux (32 and 64bit) and Raspberry Pi platforms. Feedback of suggestions and bug reports that would improve the app would be appreciated.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 24
    Xabe.FFmpeg

    Xabe.FFmpeg

    .NET Standard wrapper for FFmpeg. It allows to process media

    Xabe.FFmpeg is a .NET library that provides a high-level wrapper for FFmpeg, allowing developers to perform multimedia operations using a clean and intuitive API. It simplifies complex command-line interactions by offering structured methods for tasks such as conversion, concatenation, and streaming. The library supports both synchronous and asynchronous execution, making it suitable for scalable applications. It includes utilities for retrieving media information through FFprobe, enabling...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Tone.js

    Tone.js

    A Web Audio framework for making interactive music in the browser

    Tone.js is an open source Web Audio framework for creating interactive music in the browser. It has advanced scheduling capabilities, synths, effects and more. Tone.js is built to be useful and familiar to both musicians and audio programmers. It has common DAW (digital audio workstation) features for those looking to schedule events and tinker with pre-built synths and effects.
    Downloads: 4 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB