Python Sound/Audio Software

View 115 business solutions

Browse free open source Python Sound/Audio Software and projects below. Use the toggles on the left to filter open source Python Sound/Audio Software by OS, license, language, programming language, and project status.

  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    LAME (Lame Aint an MP3 Encoder)

    LAME (Lame Aint an MP3 Encoder)

    A high quality MP3 encoder

    LAME is an educational tool to be used for learning about MP3 encoding. The goal of the LAME project is to improve the psycho acoustics, quality and speed of MP3 encoding. Note: we provide source code only!
    Leader badge
    Downloads: 20,918 This Week
    Last Update:
    See Project
  • 2
    spotDL

    spotDL

    Download your Spotify playlists and songs along with album art

    spotDL is a command-line tool that allows users to download songs and playlists from Spotify by sourcing the audio from YouTube. Built in Python, it automatically matches Spotify tracks with corresponding videos on YouTube and downloads them with embedded metadata. The tool retrieves important information such as album art, song titles, artist names, and lyrics to organize downloaded files. spotDL is designed to be fast, accurate, and easy to use through a simple command-line interface. It supports downloading individual songs, albums, or entire playlists by providing Spotify URLs or search queries. By combining Spotify metadata with YouTube audio sources, spotDL provides a convenient way to build a local music library.
    Downloads: 103 This Week
    Last Update:
    See Project
  • 3
    savonet
    Audio toolkit for streaming, aiming at complex Internet radio systems. Our stream generator is highly extensible and flexible. Script based, it can be a fully automatic daemon streamer, but is also used for live shows with an user-friendly GUI.
    Leader badge
    Downloads: 419 This Week
    Last Update:
    See Project
  • 4
    Spleeter

    Spleeter

    Deezer source separation library including pretrained models

    Spleeter is the Deezer source separation library with pretrained models written in Python and using Tensorflow. It makes it easy to train music source separation models (assuming you have a dataset of isolated sources), and provides already trained state of the art models for performing various flavours of separation. 2 stems and 4 stems models have state of the art performances on the musdb dataset. Spleeter is also very fast as it can perform separation of audio files to 4 stems 100x faster than real-time when run on a GPU. We designed Spleeter so you can use it straight from command line as well as directly in your own development pipeline as a Python library. It can be installed with Conda, with pip or be used with Docker.
    Downloads: 52 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    EasyABC

    EasyABC

    EasyABC is an open source ABC editor

    EasyABC allows the user to create, edit, view, play, convert music written in the ABC music notation language. The program was originally written in Python 2.7 and WxPython by Nils Liberg and runs on Windows, OSX, and Linux. Jan Wybren de Jong has converted to run on Python 3.8 or higher. Frédéric Aupépin has been supporting EasyABC on OSX. EasyABC depends upon other external programs like abc2midi, abcm2ps, fluidsynth. If you install the Windows or Mac executables most of these programs are automatically included.
    Leader badge
    Downloads: 238 This Week
    Last Update:
    See Project
  • 6
    Spatial Media

    Spatial Media

    Specifications and tools for 360º video and spatial audio

    spatial-media provides tools for working with spherical video and spatial audio metadata so players and platforms can correctly render immersive media. The utilities inject, inspect, and extract metadata in common container formats (MP4/WebM) to signal 360° projection type, stereoscopy mode, and spatial audio layout. Creators use it to prepare 360/VR180 assets for upload so services know whether a video is monoscopic, top-bottom stereo, or side-by-side, and whether ambisonic audio is present. By separating metadata handling from encoding, the tools slot into existing editing/export workflows without requiring specialized NLE support. They help ensure assets play back with the right projection, field of view, and head-tracked audio in compliant players. For archive and validation use cases, the scripts can also read existing flags to diagnose why a video isn’t being recognized as immersive.
    Downloads: 37 This Week
    Last Update:
    See Project
  • 7
    AudioCraft

    AudioCraft

    Audiocraft is a library for audio processing and generation

    AudioCraft is a PyTorch library for text-to-audio and text-to-music generation, packaging research models and tooling for training and inference. It includes MusicGen for music generation conditioned on text (and optionally melody) and AudioGen for text-conditioned sound effects and environmental audio. Both models operate over discrete audio tokens produced by a neural codec (EnCodec), which acts like a tokenizer for waveforms and enables efficient sequence modeling. The repo provides inference scripts, checkpoints, and simple Python APIs so you can generate clips from prompts or incorporate the models into applications. It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 8
    VCClient

    VCClient

    Software that uses AI to perform real-time voice conversion

    VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a graphical user interface and API access, making it suitable for casual users as well as developers who want to integrate voice transformation into their own applications. The project also supports GPU acceleration, enabling faster inference and smoother real-time performance on compatible hardware. Additionally, it includes tools for training and managing voice models, giving users the ability to create personalized voice profiles.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 9
    HuMidi

    HuMidi

    Play MIDI like a human in ROBLOX with automatic sustain pedals.

    HuMidi is a universal piano auto player for ROBLOX. It plays even the most generic MIDI file with great depth, thanks to the humanization and automatic sustain pedal generation algorithms. The pedals are generated with a thorough analysis of the MIDI data, adding more depth to your performance that no other MIDI players could! Personally have been tested in: - Visual Pianos - Starving Pianists - Digital Piano - Piano's Got Talent This tool is universal, as long as the piano in-game allows to be played on a keyboard.
    Downloads: 255 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    SpeechRecognition

    SpeechRecognition

    Speech recognition module for Python

    Library for performing speech recognition, with support for several engines and APIs, online and offline. Recognize speech input from the microphone, transcribe an audio file, save audio data to an audio file. Show extended recognition results, calibrate the recognizer energy threshold for ambient noise levels (see recognizer_instance.energy_threshold for details). Listening to a microphone in the background, various other useful recognizer features. The easiest way to install this is using pip install SpeechRecognition. The first software requirement is Python 2.6, 2.7, or Python 3.3+. This is required to use the library. PyAudio is required if and only if you want to use microphone input (Microphone). PyAudio version 0.2.11+ is required, as earlier versions have known memory management bugs when recording from microphones in certain situations. To hack on this library, first make sure you have all the requirements listed in the "Requirements" section.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 11
    GNU Solfege
    GNU Solfege is *free* ear training software written in Python 3.4 using the Gtk+ 3 toolkit. The program is designed to be easily extended with lesson files (data files), so the user can create new exercises.
    Leader badge
    Downloads: 64 This Week
    Last Update:
    See Project
  • 12
    SoulSync

    SoulSync

    Automated Music Discovery and Collection Manager

    SoulSync is an intelligent music discovery and automation platform designed to bridge streaming services with self-hosted media libraries, enabling users to automatically grow and maintain curated music collections. The system continuously monitors selected artists and detects new releases, then generates personalized playlists such as Release Radar and Discovery Weekly using its built-in recommendation logic. It can automatically download missing tracks from multiple sources including Soulseek, YouTube, and Beatport, then verify file accuracy through AcoustID fingerprinting to ensure the correct audio was obtained. The platform also enriches tracks with lyrics, album art, and MusicBrainz metadata while organizing files into clean, customizable folder structures. Once processed, the library can be synchronized with media servers like Plex, Jellyfin, or Navidrome so collections stay up to date without manual intervention.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 13
    Quod Libet

    Quod Libet

    Music player and music library manager for Linux, Windows, and macOS

    Quod Libet is a cross-platform audio/music management program. It provides many ways to view your local library, and supports streaming audio and feeds (podcasts, etc). It has extremely flexible metadata editing and searching capabilities. With over 90 plugins included, you can extend and integrate with almost anything, or write your own! Ex Falso is a bare-bones tag editor with the same editing interface as Quod Libet. Quod Libet is a GTK+-based audio player written in Python, using the Mutagen tagging library. It’s designed around the idea that you know how to organize your music better than we do. It lets you make playlists based on regular expressions (don’t worry, regular searches work too). It lets you display and edit any tags you want in the file, for all the file formats it supports. Unlike some, Quod Libet will scale to libraries with tens of thousands of songs. It also supports most of the features you’d expect from a modern media player.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 14
    Spotify Music Downloader

    Spotify Music Downloader

    Spotify Music Downloader

    Download music from Spotify and other music sources.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 15
    Convert text to International Morse Code. Input is ASCII text. Output can be: - . -..- - on the console, raw 8bit PCM suitable for piping to /dev/audio, .wav files or even (mp3|ogg). Good for headlines on your MP3 player or code practice.
    Leader badge
    Downloads: 50 This Week
    Last Update:
    See Project
  • 16
    Curlew Multimedia Converter

    Curlew Multimedia Converter

    Easy to use Multimedia Converter for Linux

    Leader badge
    Downloads: 52 This Week
    Last Update:
    See Project
  • 17
    PyTube Downloader

    PyTube Downloader

    Let's quickly download YouTube videos & playlists with one click.

    PyTube Downloader lets you quickly download YouTube videos and playlists with one click. Choose from 144p to 8K quality and download multiple videos simultaneously. PyTube Downloader 让你轻松一键下载 YouTube 视频和播放列表。支持选择 144p 到 8K 的质量,并支持同时下载多个视频。
    Downloads: 111 This Week
    Last Update:
    See Project
  • 18

    StreamTuner2 ♪♬#

    Internet radio directory browser

    Streamtuner2 is an internet radio station and video browser. It simply lists stations in categories from different directories. Launches your preferred media apps for playback. It's built in Python now, but retains UI similarity with the original StreamTuner 0.99
    Leader badge
    Downloads: 57 This Week
    Last Update:
    See Project
  • 19
    Radio Tray is a streaming player for listening to online radios. Any bug or feature request should be reported in https://bitbucket.org/carlmig/radio-tray/issues
    Downloads: 26 This Week
    Last Update:
    See Project
  • 20
    This program, called "rebuild_db", allows the iPod shuffle to be used without iTunes. It rebuilds the iPod shuffle's databases based on the actual content of the iPod's file system, without restrictions regarding the directory structure.
    Downloads: 35 This Week
    Last Update:
    See Project
  • 21
    FF Multi Converter
    FF Multi Converter is a simple graphical application which enables you to convert audio, video, image and document files between all popular formats, using and combining other programs. It uses ffmpeg for audio/video files, unoconv for document files and ImageMagick for image file conversions.
    Downloads: 40 This Week
    Last Update:
    See Project
  • 22
    NovaSR

    NovaSR

    A lightning fast audio upsampler

    NovaSR is an extremely lightweight and high-performance audio upsampling model that transforms low-quality 16 kHz audio into clearer, high-fidelity 48 kHz audio with remarkable speed and efficiency. At only about 50 KB in size, the model is orders of magnitude smaller than typical audio super-resolution networks, yet it achieves high quality and realtime performance thanks to its compact architecture and efficient convolutional design. NovaSR is especially valuable for post-processing tasks in speech enhancement, TTS pipelines, and dataset restoration where low sampling rates degrade perceived audio clarity; the minimal model size also makes it suitable for edge and embedded use cases where memory is at a premium. Its performance can reach thousands of times realtime on modern GPUs, allowing massive audio batches to be processed with negligible compute overhead.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    Tauon

    Tauon

    The music player of today

    Tauon is a modern, streamlined music player app that's packed with features! An emphasis on playlists and drag-and-drop importing puts you in control of your music library. Faded volume control, 24-bit FLAC support, and gapless playback provide the ultimate listening experience. Excellent CUE sheet support, an original smart playlist system, and network playback from koel or Airsonic servers. Last.fm, Listenbrainz, and Maloja scribbling. MPRIS2 support for desktop integration. Tauon is a playlist-oriented music player that makes the assumption that folders are albums. (You may find the words "album" and "folder" used interchangeably within the interface and documentation) While some features and interface elements may not be immediately intuitive, I hope that once you are familiar, you will find that Tauon is pleasant, fast, and fun to use.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    PyKaraoke is a cross-platform karaoke player. It currently supports CDG (MP3+G, OGG+G, WAV+G), MIDI (.KAR, .MID) and MPEG formats.
    Downloads: 24 This Week
    Last Update:
    See Project
  • 25
    Internet DJ Console

    Internet DJ Console

    A feature packed DJ console and internet radio client for Linux users

    Conceived as an internet radio Shoutcast/Icecast client and DJ console IDJC has two main media players, a background track player, effects buttons, crossfader, webm, aac, ogg, and mp3 streaming, stream automation timers, aux input, voice and VoIP integration. Media file formats include: mp3, ogg, flac, wma, wav, m4a, m3u, xspf, pls, and cue sheet support, IRC track and station announcements, uses jack audio connection kit to provide a flexible audio chain. This list of features is by no means complete. More info at the project homepage.
    Downloads: 14 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB