Search Results for "audio processing" - Page 4

200 projects for "audio processing" with 1 filter applied:

  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • 1
    wasmboy

    wasmboy

    Game Boy / Game Boy Color Emulator Library

    wasmboy is a Game Boy and Game Boy Color emulator built using WebAssembly and JavaScript, designed to run efficiently in both browsers and Node environments. It leverages modern web technologies such as HTML5 canvas and the Web Audio API to deliver graphics and sound directly within a web interface. The project emphasizes portability and integration, allowing it to be embedded into other applications as a reusable dependency. It supports a wide range of emulator features including save...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Piano transcription

    Piano transcription

    Task of transcribing piano recordings into MIDI files

    Piano transcription is an open-source high-resolution piano transcription system by ByteDance that converts raw audio recordings of piano performance into symbolic MIDI files — detecting note onsets, offsets, pitch, velocity, and even pedal usage. The system is implemented in Python (PyTorch) and is capable of accurate transcription of polyphonic piano recordings, even with complex passages and pedal techniques, making it suitable for classical piano music. By using this transcription tool,...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 3

    AhoTTS - TTS for Basque and Spanish

    Text-to-Speech for Basque and Spanish

    Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    AutoSub

    AutoSub

    A CLI script to generate subtitle files (SRT/VTT/TXT) for any video

    AutoSub is a Python-based tool designed to automatically generate subtitles for video or audio content using speech recognition technology. It processes media files by extracting audio, transcribing spoken content, and generating subtitle files in standard formats. The tool supports multiple languages and can integrate with translation systems to produce subtitles in different languages. It is designed for automation, allowing batch processing of multiple media files. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    Music Source Separation

    Music Source Separation

    Separate audio recordings into individual sources

    Music Source Separation is a PyTorch-based open-source implementation for the task of separating a music (or audio) recording into its constituent sources — for example isolating vocals, instruments, bass, accompaniment, or background from a mixed track. It aims to give users the ability to take any existing song and decompose it into separate stems (vocals, accompaniment, etc.), or to train custom separation models on their own datasets (e.g. for speech enhancement, instrument isolation, or...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    SVoice (Speech Voice Separation)

    SVoice (Speech Voice Separation)

    We provide a PyTorch implementation of the paper Voice Separation

    ...The repository includes all necessary scripts for training, dataset preparation, distributed training, evaluation, and audio separation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    hora

    hora

    Efficient approximate nearest neighbor search algorithm collections

    hora is an open-source high-performance vector similarity search library designed for large-scale machine learning and information retrieval systems. The project focuses on approximate nearest neighbor search, a fundamental technique used in modern AI applications such as recommendation systems, image search, and semantic search engines. Hora implements multiple efficient indexing algorithms that allow systems to rapidly search through high-dimensional vectors produced by machine learning...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    AAXtoMP3

    AAXtoMP3

    Convert Audible's .aax filetype to MP3, FLAC, M4A, or OPUS

    ...AAXtoMP3 supports batch processing, enabling users to convert multiple files in a single workflow. Its minimal setup and script-based usage make it suitable for automation and integration into personal media pipelines. Overall, it provides a practical solution for managing audiobook libraries in open formats.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    Delphi ASIO & VST Packages
    With these packages for Delphi the user can easily create VST plugins or ASIO applications within minutes. The included algorithms for filters and dynamics help to built effects without much knowledge of digital signal processing.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    VAD

    VAD

    Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM

    This repository is a voice activity detection (VAD) toolkit that implements multiple models (DNN, bDNN, LSTM, ACAM) for detecting speech versus non-speech in audio. It also provides a recorded dataset in varied real-world settings (e.g. bus stop, construction site, park, room) with ground truth labeling. Acoustic feature extraction (multi-resolution cochleagram, MRCG). Post-processing modules (e.g. smoothing, thresholds). The toolkit supports both MATLAB and Python/TensorFlow components (for feature extraction, classification, postprocessing). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    CHOW Phaser

    CHOW Phaser

    Phaser effect based loosely on the Schulte Compact Phasing 'A'

    ChowPhaser is an open-source audio plugin that emulates the classic Schulte Compact Phasing 'A' effect. It offers a unique phasing effect with nonlinear feedback and modulation capabilities, suitable for various audio processing applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Denoiser

    Denoiser

    Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)

    ...The implementation includes data augmentation techniques applied to the raw waveforms (e.g. noise mixing, reverberation) to improve model robustness and generalization to diverse noise types. The project supports both offline denoising (batch inference) and live audio processing (e.g. via loopback audio interfaces), making it practical for real-time use in calls or recording. The codebase includes training and evaluation scripts, configuration management via Hydra, and pretrained models on standard noise datasets.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Live Transcribe Speech Engine

    Live Transcribe Speech Engine

    Live Transcribe is an Android application

    ...Its design prioritizes latency and robustness in noisy, far-field environments, enabling continuous transcription with low delay on mobile hardware. The engine manages audio front-end processing—such as noise suppression and voice activity detection—before feeding audio into compact, accurate acoustic and language models. Partial hypotheses stream as words are recognized, then stabilize with minimal jitter as confidence increases, which is crucial for usability. The code emphasizes efficient use of CPU and neural accelerators to balance battery life with responsiveness. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    hplayer

    hplayer

    A multi-screen player using Qt + FFmpeg

    ...It focuses on providing a minimal yet functional implementation of video playback, including decoding, rendering, and synchronization. The project is structured as a learning resource, helping developers understand the fundamentals of multimedia pipelines. It supports common audio and video formats and includes playback controls for managing media streams. The architecture emphasizes performance and simplicity, using native libraries to achieve efficient playback. It also demonstrates integration between UI layers and low-level media processing components. Overall, it serves as a practical reference for building custom media players.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Perl Audio Converter

    Perl Audio Converter

    Linux Audio Converter / Tagger / CD Ripper

    ...It can also extract audio from the following video extensions: RM, RV, ASF, DivX, MPG, MKV, MPEG, AVI, MOV, OGM, OGV, QT, VCD, SVCD, M4V, NSV, NUV, PSP, SMK, VOB, FLV, WEBM and WMV. Parallel Processing, a CD ripping function with CDDB support, batch conversion, tag preservation for most supported formats, independent tag reading & writing, service menus for KDE Dolphin/Konqueror, Gnome Nautilus script, and action scripts for Nemo/Thunar are also provided.
    Downloads: 24 This Week
    Last Update:
    See Project
  • 16
    Source code for "mda" audio processing plug-ins in VST format. Available for many years as closed-source freeware from mda-vst.com
    Leader badge
    Downloads: 180 This Week
    Last Update:
    See Project
  • 17
    quick-media

    quick-media

    media(audio/image/qrcode/markdown/html/svg/png) support

    quick-media is a lightweight multimedia processing toolkit designed to simplify common video and audio operations through streamlined command execution. It provides a wrapper around FFmpeg functionality, enabling users to perform tasks such as transcoding, clipping, and format conversion with simplified commands. The tool emphasizes ease of use while still allowing access to advanced encoding parameters when needed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Sysex Osc Generator

    Sysex Osc Generator

    A Sysex OSC hex string generator for the X32/X-Air/Wing digital mixers

    The Sysex OSC Generator provides a means of selecting a desired OSC command for the Behringer X32, X-Air or Wing digital mixer and generating the Sysex OSC hex string. This can be added to any midi device that supports sysex sending of commands. Available for the PC, Mac, linux (32 and 64bit) and Raspberry Pi platforms. Feedback of suggestions and bug reports that would improve the app would be appreciated.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 19
    Xabe.FFmpeg

    Xabe.FFmpeg

    .NET Standard wrapper for FFmpeg. It allows to process media

    Xabe.FFmpeg is a .NET library that provides a high-level wrapper for FFmpeg, allowing developers to perform multimedia operations using a clean and intuitive API. It simplifies complex command-line interactions by offering structured methods for tasks such as conversion, concatenation, and streaming. The library supports both synchronous and asynchronous execution, making it suitable for scalable applications. It includes utilities for retrieving media information through FFprobe, enabling...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    LiVES

    LiVES

    LiVES is a Video Editing System. It is designed to be simple to use, y

    LiVES mixes realtime video performance and non-linear editing in one professional quality application. It is designed to be simple to use, yet powerful. It is small in size, yet it has many advanced features. Using LiVES, you can start editing and making video right away, without having to worry about formats, frame sizes, or framerates. It is a very flexible tool which is used by both professional VJ's and video editors - mix and switch clips from the keyboard, use dozens of realtime...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 21
    Jacktube is an audio/MIDI processing program using LADSPA plugins.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    ffmpeg.js

    ffmpeg.js

    Port of FFmpeg with Emscripten

    ffmpeg.js is a JavaScript port of the FFmpeg multimedia framework compiled with Emscripten, enabling video and audio processing directly within browsers or Node.js environments. It provides prebuilt modules optimized for web use, balancing performance and file size while supporting common encoding and decoding tasks. By running entirely in JavaScript through asm.js, it allows developers to manipulate media files without requiring native binaries or server-side processing. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    videoshow

    videoshow

    Simple node.js utility to create video slideshows from images

    videoshow is a Node.js utility designed to create video slideshows from a sequence of images using FFmpeg as its processing engine. It allows developers to programmatically generate videos by combining images with optional audio tracks, subtitles, and visual transitions. The tool supports customization of parameters such as frame rate, resolution, bitrate, and codecs, enabling flexible output configurations. It includes both a programmatic API and a command-line interface, making it adaptable for different workflows. videoshow processes media efficiently and is used in production environments to generate large volumes of videos automatically. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    MediaToolkit

    MediaToolkit

    A .NET library to convert and process all your video & audio files

    MediaToolkit is a .NET library designed to simplify multimedia processing tasks by providing an easy-to-use interface over FFmpeg functionality. It allows developers to perform operations such as video conversion, thumbnail generation, and metadata extraction without dealing with raw command-line syntax. The library supports common media workflows, making it suitable for backend services and desktop applications. It provides structured APIs for configuring encoding parameters and handling...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Ecasound is a software package designed for multitrack audio processing. It can be used for simple tasks like audio playback, recording and format conversions, as well as for multitrack effect processing, mixing, recording and signal recycling. Ecasound
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB