Showing 26 open source projects for "audio separation"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    Ultimate Vocal Remover (UVR5)

    Ultimate Vocal Remover (UVR5)

    GUI for a Vocal Remover that uses Deep Neural Networks

    This application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 and v4 4-stem models).
    Downloads: 829 This Week
    Last Update:
    See Project
  • 2
    OpenVINO AI Plugins for Audacity

    OpenVINO AI Plugins for Audacity

    A set of AI-enabled effects, generators, and analyzers for Audacity

    A set of AI-enabled effects, generators, and analyzers for Audacity. These AI features run 100% locally on your PC, no internet connection is necessary. OpenVINO™ is used to run AI models on supported accelerators found on the user's system such as CPU, GPU, and NPU.
    Downloads: 127 This Week
    Last Update:
    See Project
  • 3
    Librosa

    Librosa

    Python library for audio and music analysis

    Librosa is a powerful Python library for analyzing and processing audio and music signals. Built on top of NumPy, SciPy, and matplotlib, it provides a wide range of tools for feature extraction, time-series manipulation, audio display, and music information retrieval. Whether you're building machine learning models for audio classification or visualizing spectrograms, Librosa is a go-to library for researchers and developers working in audio signal processing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    audioFlux

    audioFlux

    A library for audio and music analysis, feature extraction

    ...It can be provided to deep learning networks for training and is used to study various tasks in the audio field such as Classification, Separation, Music Information Retrieval(MIR) ASR, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    MusicFreePlugins

    MusicFreePlugins

    MusicFreePlayPlugin

    The MusicFreePluginsc project is a collection and framework for plugins that extend the functionality of the MusicFree ecosystem by providing access to various music sources and features. It defines a standardized interface for plugin development, allowing contributors to implement features such as search, playback, and metadata retrieval. The system is designed to be modular, enabling users to install, update, and manage plugins independently of the core application. It supports multiple...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    Voice-Pro

    Voice-Pro

    Comprehensive Gradio WebUI for audio processing

    Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.
    Downloads: 25 This Week
    Last Update:
    See Project
  • 7
    Whisper-WebUI

    Whisper-WebUI

    A Web UI for easy subtitle using whisper model

    ...Whisper WebUI also includes advanced preprocessing and postprocessing features such as voice activity detection, background music separation, and speaker diarization, enabling more accurate and structured outputs.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    TTS WebUI

    TTS WebUI

    A single Gradio + React WebUI with extensions for ACE-Step

    TTS-WebUI is a unified Gradio + React web interface that brings together a large ecosystem of text-to-speech, voice conversion, and audio generation models under a single UI. It supports a wide range of models such as Bark, MusicGen, Tortoise, RVC, StyleTTS2, ParlerTTS, CosyVoice, XTTSv2, Stable Audio, SeamlessM4T, and many others, exposing them as interchangeable backends for speech and music synthesis. The project provides an installer that sets up Conda, Python environments, and all...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    FL Studio

    FL Studio

    The best music production software

    FL Studio by Image-Line is a complete digital audio workstation (DAW) designed to turn your musical ideas into professional-quality tracks with ease. Trusted by world-renowned artists like Martin Garrix, Mustard, and Boi-1da, it’s built for producers of every genre — from EDM and hip-hop to pop and cinematic music. With an intuitive interface and powerful workflow, you can start creating in minutes while exploring endless possibilities as your skills grow. FL Studio comes packed with over...
    Downloads: 6,415 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    vocal-separate

    vocal-separate

    An extremely simple tool for separating vocals and background music

    ...Users can drag and drop an audio or video file onto the interface to begin separation, choosing between two, four, or five stems, which allows isolating specific components like vocals, bass, drums, or piano depending on the chosen model. After processing, the tool outputs separate WAV files for each extracted stem, making it easy to export and use in audio editing or remix software.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    Ultimate Mono2Stereo

    Ultimate Mono2Stereo

    A better mono to stereo audio conversion tool

    What sets this frontend for ffmpeg apart from other stereo simulators is it creates an impression of stereo separation. Without trying too hard to be "true" stereo in a gimmicky fashion. More importantly, it produces none of the weird phasing, and/or time delay artifacts. And very little, if any of the tone discoloration, when those others aren't meticulously set up just right. All this is accomplished by using the ffmpeg crossover audio filter to split the sound into 8 frequency bands. ...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 12
    Audio AI Timeline

    Audio AI Timeline

    A timeline of the latest AI models for audio generation

    Audio AI Timeline is a curated project that organizes the development of audio-related artificial intelligence into a structured and accessible historical timeline. Rather than functioning as a model training framework, it serves as an informational resource that maps key papers, systems, models, datasets, and milestones across areas such as speech synthesis, music generation, audio understanding, source separation, and general audio machine learning. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    Asteroid

    Asteroid

    The PyTorch-based audio source separation toolkit for researchers

    The PyTorch-based audio source separation toolkit for researchers. Pytorch-based audio source separation toolkit that enables fast experimentation on common datasets. It comes with a source code thats supports a large range of datasets and architectures, and a set of recipes to reproduce some important papers. Building blocks are thought and designed to be seamlessly plugged together.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Demucs

    Demucs

    Code for the paper Hybrid Spectrogram and Waveform Source Separation

    Demucs (Deep Extractor for Music Sources) is a deep-learning framework for music source separation—extracting individual instrument or vocal tracks from a mixed audio file. The system is based on a U-Net-like convolutional architecture combined with recurrent and transformer elements to capture both short-term and long-term temporal structure. It processes raw waveforms directly rather than spectrograms, allowing for higher-quality reconstruction and fewer artifacts in separated tracks. ...
    Downloads: 94 This Week
    Last Update:
    See Project
  • 15
    MISST

    MISST

    A local GUI music source separation tool built on Tkinter and public F

    MISST : Music/Instrumental Stem Separation Tool. This application uses state-of-the-art source separation models to extract the 4 core stems from audio files (Bass, Drums, Other Instrumentals and Vocals). But it is not limited to this. MISST acts as a developped music player aswell, fit to enjoy and medal with your audio files as you see fit. MISST even comes prepared to import songs and playlists directly from your music library.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16

    audioFlux

    A library for audio and music analysis, feature extraction.

    audioflux is a deep learning tool library for audio and music analysis, feature extraction. It supports dozens of time-frequency analysis transformation methods and hundreds of corresponding time-domain and frequency-domain feature combinations. It can be provided to deep learning networks for training, and is used to study various tasks in the audio field such as Classification, Separation, Music Information Retrieval(MIR) and ASR etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Music Source Separation

    Music Source Separation

    Separate audio recordings into individual sources

    Music Source Separation is a PyTorch-based open-source implementation for the task of separating a music (or audio) recording into its constituent sources — for example isolating vocals, instruments, bass, accompaniment, or background from a mixed track. It aims to give users the ability to take any existing song and decompose it into separate stems (vocals, accompaniment, etc.), or to train custom separation models on their own datasets (e.g. for speech enhancement, instrument isolation, or other audio-separation tasks). ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    SVoice (Speech Voice Separation)

    SVoice (Speech Voice Separation)

    We provide a PyTorch implementation of the paper Voice Separation

    ...Separate models are trained for different speaker counts, and the largest-capacity model dynamically determines the actual number of speakers in a mixture. The repository includes all necessary scripts for training, dataset preparation, distributed training, evaluation, and audio separation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Spleeter

    Spleeter

    Deezer source separation library including pretrained models

    ...Spleeter is also very fast as it can perform separation of audio files to 4 stems 100x faster than real-time when run on a GPU. We designed Spleeter so you can use it straight from command line as well as directly in your own development pipeline as a Python library. It can be installed with Conda, with pip or be used with Docker.
    Downloads: 61 This Week
    Last Update:
    See Project
  • 20
    NanoDSP Open Source DSP

    NanoDSP Open Source DSP

    Audio Enhancer for Windows and Embedded Platform

    NanoDSP is designed for embedded operation and aims for low CPU load. Main functions 1. Bass amplification using a quadratic curve Generates odd and even harmonics, and uses human auditory psychology missing fundamentals to create the illusion of bass enhancement. By using a quadratic curve for the bass component, distortion is created and harmonics are generated to amplify the bass. 2. Separation of bass and Mid, treble using moving average By using the moving average, the...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    ManyEars
    ManyEars implements real-time microphone array processing to perform sound source localisation, tracking and separation. It was designed for mobile robot audition in dynamic environments. NOTE: Development will continue on github : https://github.com/introlab/manyears
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Streamripper
    Streamripper records shoutcast-compatible streams. For shoutcast style streams it finds the "meta data" or track separation data, and uses that as a marker for where the track should be separated.
    Leader badge
    Downloads: 120 This Week
    Last Update:
    See Project
  • 23
    ISSE

    ISSE

    An Interactive Source Separation Editor

    In applications such as audio denoising, music transcription, music remixing, and audio-based forensics, it is desirable to decompose a single-channel recording into its respective sources. To perform such tasks, we present a new software tool to perform source separation by painting on time-frequency visualizations of sound. Initial results shows the software can achieve state-of-the-art separation results compared to prior work.
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • 24
    StationRipper, based off of streamripper, records up to 300 simultaneous Shoutcast streams. Supports Track Separation, Scheduling, Memory Recording, play integration via Shoutcast.com, MP3 player access, easy rip restarts, and a music library manager.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    TacitPixel is a scalable C++ framework for building application prototypes for interactive collaborative VR/AR applications. It provides a number of features that make prototyping fast and portable. TacitPixel is the successor of the TAP and Twisted Pair engines.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB