Search Results for "audio source separation" - Page 4

Showing 5991 open source projects for "audio source separation"

View related business solutions
  • Enterprise-grade ITSM, for every business Icon
    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

    Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.
    Try it Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    Sonic Pi

    Sonic Pi

    Sonic Pi is your free code-based music creation and performance tool

    Sonic Pi is a new kind of musical instrument. Instead of strumming strings or whacking things with sticks - you write code, live. Sonic Pi is a complete open source programming environment originally designed to explore and teach programming concepts within schools through the process of creating new sounds. In addition to being an engaging education resource it has evolved into an extremely powerful and performance-ready live coding instrument suitable for professional artists and DJs....
    Downloads: 21 This Week
    Last Update:
    See Project
  • 2
    S&box

    S&box

    s&box is a modern game engine, built on Valve's Source 2

    S&box is the open-source codebase for s&box, a next-generation sandbox game development platform from the creators of Garry’s Mod that blends modding freedom with modern tooling and performance. Built on a cutting-edge game engine, s&box allows creators to prototype, build, and share interactive game modes, tools, and environments using C#, JavaScript, and visual scripting, promoting accessible content creation for developers of varying skill levels.
    Downloads: 171 This Week
    Last Update:
    See Project
  • 3
    koel

    koel

    A personal music streaming server that simply works

    koel is a simple, open source, web-based personal audio streaming service created with some of the more modern technologies that appeals to web developers. It embraces such technologies as CSS grid and drag-and-drop API to deliver a music streaming application that simply works. koel is written in Vue on the client side, and Laravel on the server side. It requires a server that can run Laravel 5.8 and a modern browser that can support newer web technologies.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    SenseVoice

    SenseVoice

    Multilingual speech recognition and audio understanding model

    SenseVoice is a speech foundation model designed to perform multiple voice understanding tasks from audio input. It provides capabilities such as automatic speech recognition, spoken language identification, speech emotion recognition, and audio event detection within a single system. SenseVoice is trained on more than 400,000 hours of speech data and supports over 50 languages for multilingual recognition tasks. It is built to achieve high transcription accuracy while maintaining efficient...
    Downloads: 11 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    Basic Pitch

    Basic Pitch

    A lightweight audio-to-MIDI converter with pitch bend detection

    Basic Pitch is a Python library for Automatic Music Transcription (AMT), using lightweight neural network developed by Spotify's Audio Intelligence Lab. It's small, easy-to-use, pip install-able and npm install-able via its sibling repo. Basic Pitch may be simple, but it's is far from "basic"! basic-pitch is efficient and easy to use, and its multi pitch support, its ability to generalize across instruments, and its note accuracy compete with much larger and more resource-hungry AMT systems....
    Downloads: 42 This Week
    Last Update:
    See Project
  • 6
    gm

    gm

    R Package for Music Score and Audio Generation

    Create music easily, and show musical scores and audio files in R Markdown documents, R Jupyter Notebooks and RStudio.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    WhisperJAV

    WhisperJAV

    Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD

    WhisperJAV is an open-source speech transcription pipeline designed specifically for generating subtitles for Japanese adult video content. The project addresses challenges that standard speech recognition models face when transcribing this type of audio, which often includes low signal-to-noise ratios and large numbers of non-verbal vocalizations. Traditional automatic speech recognition systems can misinterpret these sounds as words, leading to inaccurate transcripts. ...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 8
    VibeVoice

    VibeVoice

    Open-source multi-speaker long-form text-to-speech model

    VibeVoice-1.5B is Microsoft’s frontier open-source text-to-speech (TTS) model designed for generating expressive, long-form, multi-speaker conversational audio such as podcasts. Unlike traditional TTS systems, it excels in scalability, speaker consistency, and natural turn-taking for up to 90 minutes of continuous speech with as many as four distinct speakers. A key innovation is its use of continuous acoustic and semantic speech tokenizers operating at an ultra-low frame rate of 7.5 Hz, enabling high audio fidelity with efficient processing of long sequences. ...
    Downloads: 19 This Week
    Last Update:
    See Project
  • 9
    SysDVR

    SysDVR

    Stream switch games to your PC via USB or network

    This is a sysmodule that allows capturing the running game output to a pc via USB or network connection. Stream and switch games to your PC via USB or network. Cross-platform, can stream to Windows, Mac and Linux. Stream via USB or Wifi. Video quality is fixed to 720p @ 30fps with h264 compression, this is a hardware limit. Audio quality is fixed to 16bit PCM @ 48kHz stereo. Not compressed. Very low latency with an optimal setup, most games are playable.
    Downloads: 92 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    AudioMuse-AI

    AudioMuse-AI

    AudioMuse-AI is an Open Source Dockerized environment

    AudioMuse-AI is an open-source system designed to automatically generate playlists and analyze music libraries using artificial intelligence and audio signal processing techniques. The platform runs locally in a Dockerized environment and performs detailed sonic analysis on audio files to understand characteristics such as tempo, mood, and acoustic similarity.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    EeveeSpotify

    EeveeSpotify

    A tweak to enhance Spotify experience

    EeveeSpotifyReborn is an unofficial modification for the Spotify mobile application that alters client-side behavior to unlock premium-like features without requiring a paid subscription. It operates by injecting changes into the Spotify app, making it interpret the user account as having premium access and enabling functionalities that are normally restricted. The project was developed through reverse engineering techniques, including analyzing application behavior and intercepting requests...
    Downloads: 49 This Week
    Last Update:
    See Project
  • 12
    EasyRPG Player

    EasyRPG Player

    RPG Maker 2000/2003 and EasyRPG games interpreter

    EasyRPG Player is a game interpreter to play RPG Maker 2000, 2003, and EasyRPG games. It uses the LCF parser library (liblcf) to read RPG Maker game data. EasyRPG Player is a program that allows to play games created with RPG Maker 2000 and 2003. It aims to be a free (as in freedom) cross-platform RPG Maker 2000/2003 interpreter. The main goal is to play all games created with them as the original game interpreter (RPG_RT) does. This project is being developed. FreeType2 for external font...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13
    Giada

    Giada

    Your Hardcore Loop Machine.

    Giada is an open-source, minimalistic and hardcore music production tool. Designed for DJs, live performers, and electronic musicians. Build your performance in real time by layering audio tracks or MIDI events, driven by the main sequencer. Load samples from your crates and play them with a computer keyboard or a MIDI controller. Write songs from scratch or edit existing live recordings with the powerful Action Editor, for fine-tuned control.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 14
    Audiomentations

    Audiomentations

    A Python library for audio data augmentation

    A Python library for audio data augmentation. Inspired by albumentations. Useful for deep learning. Runs on CPU. Supports mono audio and multichannel audio. Can be integrated in training pipelines in e.g. Tensorflow/Keras or Pytorch. Has helped people get world-class results in Kaggle competitions. Is used by companies making next-generation audio products. Mix in another sound, e.g. a background noise. Useful if your original sound is clean and you want to simulate an environment where...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    SuperCollider

    SuperCollider

    Audio server, programming language, and IDE for sound synthesis

    SuperCollider is a platform for audio synthesis and algorithmic composition, used by musicians, artists, and researchers working with sound. It is free and open source software available for Windows, macOS, and Linux. scsynth, a real-time audio server, forms the core of the platform. It features 400+ unit generators (“UGens”) for analysis, synthesis, and processing.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    AudioNotes

    AudioNotes

    Extract audio and video content and organize it into a Markdown note

    ...As an open-source repository, AudioNotes provides developers or power users the opportunity to customize how audio is captured, stored, annotated, and replayed — e.g. adding playback speed control, export to standard formats, or synchronization between notes and audio timeline. It may support simple UI for starting/stopping recordings, writing or editing notes, and navigating through recorded sessions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    SFML

    SFML

    Simple and Fast Multimedia Library

    SFML provides a simple interface to the various components of your PC, to ease the development of games and multimedia applications. It is composed of five modules: system, window, graphics, audio and network. Discover their features more in detail in the tutorials and the API documentation. With SFML, your application can compile and run out of the box on the most common operating systems: Windows, Linux, macOS and soon Android & iOS. Pre-compiled SDKs for your favorite OS are available on...
    Downloads: 93 This Week
    Last Update:
    See Project
  • 18
    Quod Libet

    Quod Libet

    Music player and music library manager for Linux, Windows, and macOS

    Quod Libet is a cross-platform audio/music management program. It provides many ways to view your local library, and supports streaming audio and feeds (podcasts, etc). It has extremely flexible metadata editing and searching capabilities. With over 90 plugins included, you can extend and integrate with almost anything, or write your own! Ex Falso is a bare-bones tag editor with the same editing interface as Quod Libet. Quod Libet is a GTK+-based audio player written in Python, using the...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 19
    Feishin

    Feishin

    A modern self-hosted music player

    Feishin is an open-source social video platform project that aims to blend elements of community sharing and video discovery by letting users upload, browse, and interact with short and long-form content in a Web2-style experience. It includes features you’d expect from a modern video platform, such as profile pages, feeds, search functionality, and engagement tools like likes, comments, and follows, while maintaining a focus on performance and responsiveness. Built as a full-stack...
    Downloads: 27 This Week
    Last Update:
    See Project
  • 20
    Blur my Shell

    Blur my Shell

    Extension that adds a blur look to different parts of the GNOME Shell

    Adds a blur look to different parts of the GNOME Shell, including the top panel, dash and overview. If you use Ubuntu, you may want to change the radius of the corner effect for Dash-to-Dock in preferences, else you may see unpleasantly mixed radii in your dock.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 21
    yabridge

    yabridge

    A modern and transparent way to use Windows VST2, VST3 and CLAP plugin

    Yet Another way to use Windows audio plugins on Linux. Yabridge seamlessly supports using both 32-bit and 64-bit Windows VST2, VST3, and CLAP plugins in 64-bit Linux plugin hosts as if they were native plugins, with optional support for plugin groups to enable inter-plugin communication for VST2 plugins and quick startup times. Its modern concurrent architecture and focus on transparency allows yabridge to be both fast and highly compatible, while also staying easy to debug and maintain.
    Downloads: 43 This Week
    Last Update:
    See Project
  • 22
    LX Music Mobile

    LX Music Mobile

    A music software developed based on React native

    ...The README clearly states that the UI and default behaviour aren’t especially geared toward new users — you’re encouraged to explore settings and adjust according to your preferences. It is open-source under the Apache-2.0 license, and has accumulated a solid community (14,600+ stars, 1,700+ forks). Because it supports custom sources (including non-official music platforms) the team includes disclaimers around copyright, clarifying that the project does not take responsibility for the legality or correctness of the audio data you play.
    Downloads: 14 This Week
    Last Update:
    See Project
  • 23
    OpenAI.fm

    OpenAI.fm

    Code for openai.fm, a demo for the OpenAI Speech API

    OpenAI.fm is an official interactive demo application built to showcase the OpenAI Speech API and its advanced text-to-speech capabilities, providing developers and creators with a hands-on web interface to convert text into high-quality, customizable audio using state-of-the-art TTS models. Developed using Next.js and the OpenAI Speech API, this demo illustrates how the latest neural voice models can produce natural, expressive speech with adjustable styles and voices, highlighting features...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 24
    Oboe

    Oboe

    Oboe is a C++ library that makes it easy to build high-performance

    oboe is a C++ library for building high-performance audio apps on Android, providing a unified, low-latency API over AAudio and OpenSL ES. It abstracts device and API-version differences so developers can focus on audio processing instead of platform quirks. The library emphasizes minimal latency and glitch-free playback/recording via tuned buffer strategies and callback-driven I/O. It supports features like floating-point audio, channel configuration, sample-rate negotiation, and stream...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    PolarDB for PostgreSQL

    PolarDB for PostgreSQL

    A cloud-native database based on PostgreSQL developed by Alibaba Cloud

    PolarDB for PostgreSQL is Alibaba Cloud's cloud-native, distributed version of PostgreSQL designed for high availability, scalability, and performance. It enhances standard PostgreSQL with features like shared storage, compute-storage separation, and parallel processing. PolarDB supports cloud-native workloads, offering enterprise-grade capabilities while maintaining PostgreSQL compatibility.
    Downloads: 1 This Week
    Last Update:
    See Project