Showing 6395 open source projects for "audio"

View related business solutions
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    MLX-Audio

    MLX-Audio

    A text-to-speech, speech-to-text and speech-to-speech library

    ...The project provides a straightforward CLI (mlx_audio.tts.generate) as well as a Python API for programmatic generation of audio, including parameters for voice choice, speed, language hints, output format, and sample rate. It includes examples such as audiobook generation to demonstrate long-form synthesis and joined audio segments. On top of that, MLX-Audio offers a modern web interface powered by FastAPI, with real-time waveform and 3D visualizations, file upload, and audio management.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    Kimi-Audio

    Kimi-Audio

    Audio foundation model excelling in audio understanding

    Kimi-Audio is an ambitious open-source audio foundation model designed to unify a wide array of audio processing tasks — from speech recognition and audio understanding to generative conversation and sound event classification — within a single cohesive architecture. Instead of fragmenting work across specialized models, Kimi-Audio handles automatic speech recognition (ASR), audio question answering, automatic audio captioning, speech emotion recognition, and audio-to-text chat in one system, enabling developers to build rich, multimodal audio applications without stitching together disparate components. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    Qwen2-Audio

    Qwen2-Audio

    Repo of Qwen2-Audio chat & pretrained large audio language model

    Qwen2-Audio is a large audio-language model by Alibaba Cloud, part of the Qwen series. It is trained to accept various audio signal inputs (including speech, sounds, etc.) and perform both voice chat and audio analysis, producing textual responses. It supports two major modes: Voice Chat (interactive voice only input) and Audio Analysis (audio + text instructions), with both base and instruction-tuned models.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Qwen-Audio

    Qwen-Audio

    Chat & pretrained large audio language model proposed by Alibaba Cloud

    Qwen-Audio is a large audio-language model developed by Alibaba Cloud, built to accept various types of audio input (speech, natural sounds, music, singing) along with text input, and output text. There is also an instruction-tuned version called Qwen-Audio-Chat which supports conversational interaction (multi-round), audio + text input, creative tasks and reasoning over audio.
    Downloads: 3 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 5
    Fun Audio Chat

    Fun Audio Chat

    Large Audio Language Model built for natural interactions

    Fun Audio Chat is an interactive voice-first conversational AI platform designed to let users engage in natural spoken dialogue with large language models in real time, turning speech into context-aware responses while maintaining a smooth back-and-forth experience. It combines speech recognition, audio processing, and AI generation so users can speak simply and receive spoken replies, enabling applications such as virtual assistants, voice bots, and hands-free chat interfaces. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    OBS Studio

    OBS Studio

    Open source software for live streaming and recording

    OBS Studio, also known as Open Broadcaster Software, is a free and open source software program for live streaming and video recording. Features of the software include device/source capture, recording, encoding and broadcasting. Stream on Windows, Mac or Linux. This software is commonly used by video game streamers on the popular streaming platform Twitch.
    Downloads: 254 This Week
    Last Update:
    See Project
  • 7
    OpenVINO AI Plugins for Audacity

    OpenVINO AI Plugins for Audacity

    A set of AI-enabled effects, generators, and analyzers for Audacity

    A set of AI-enabled effects, generators, and analyzers for Audacity. These AI features run 100% locally on your PC, no internet connection is necessary. OpenVINO™ is used to run AI models on supported accelerators found on the user's system such as CPU, GPU, and NPU.
    Downloads: 123 This Week
    Last Update:
    See Project
  • 8
    Ultimate Vocal Remover (UVR5)

    Ultimate Vocal Remover (UVR5)

    GUI for a Vocal Remover that uses Deep Neural Networks

    This application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 and v4 4-stem models).
    Downloads: 846 This Week
    Last Update:
    See Project
  • 9
    LosslessCut

    LosslessCut

    The swiss army knife of lossless video/audio editing

    LosslessCut aims to be the ultimate cross platform FFmpeg GUI for extremely fast and lossless operations on video, audio, subtitle and other related media files. The main feature is lossless trimming and cutting of video and audio files, which is great for saving space by rough-cutting your large video files taken from a video camera, GoPro, drone, etc. It lets you quickly extract the good parts from your videos and discard many gigabytes of data without doing a slow re-encode and thereby losing quality. ...
    Downloads: 134 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    Cider App

    Cider App

    A new cross-platform Apple Music experience based on Electron and Vue

    ...Whether it be Discord, LastFM, or even equalizers we've got you covered. Discord & Last.fm Integration. Quickly share and show others what you're listening to; right out of the box. Audio Enhancements. Audio Spatialization, Adrenaline Processor™, and Equalizers are all available and actively engineered by our Audio Engineer, Maikiwi. Remote Controllable. We know how irritating it is to Alt+Tab constantly while working or gaming, and we have a remote app just for that. Cider is available on a range of platforms, view our downloads page to find out if your device is supported!
    Downloads: 107 This Week
    Last Update:
    See Project
  • 11
    NeuralNote

    NeuralNote

    Audio Plugin for Audio to MIDI transcription using deep learning

    NeuralNote is an open-source audio software tool designed to convert recorded audio into MIDI data using modern machine learning techniques. The software functions as an audio plugin that can be used inside digital audio workstations as well as a standalone application for music production and analysis. Its main purpose is to perform audio-to-MIDI transcription, allowing musicians to record a performance and automatically transform it into editable MIDI notes. ...
    Downloads: 62 This Week
    Last Update:
    See Project
  • 12
    LTX-2.3

    LTX-2.3

    Official Python inference and LoRA trainer package

    LTX-2.3 is an open-source multimodal artificial intelligence foundation model developed by Lightricks for generating synchronized video and audio from prompts or other inputs. Unlike most earlier video generation systems that only produced silent clips, LTX-2 combines video and audio generation in a unified architecture capable of producing coherent audiovisual scenes. The model uses a diffusion-transformer-based architecture designed to generate high-fidelity visual frames while simultaneously producing corresponding audio elements such as speech, music, ambient sound, or effects. ...
    Downloads: 195 This Week
    Last Update:
    See Project
  • 13
    Strawberry Music Player

    Strawberry Music Player

    Strawberry Music Player

    Strawberry is a cross-platform music player and music collection organizer. It is aimed at music collectors and audiophiles. With Strawberry you can play and manage your digital music collection, or stream your favorite radios. Strawberry is a music player and music collection organizer. It is aimed at music collectors and audiophiles. With Strawberry you can play and manage your digital music collection, or stream your favorite radios. Strawberry is free software released under GPL. The...
    Downloads: 109 This Week
    Last Update:
    See Project
  • 14
    spotDL

    spotDL

    Download your Spotify playlists and songs along with album art

    ...It supports downloading individual songs, albums, or entire playlists by providing Spotify URLs or search queries. By combining Spotify metadata with YouTube audio sources, spotDL provides a convenient way to build a local music library.
    Downloads: 114 This Week
    Last Update:
    See Project
  • 15
    Snapcast

    Snapcast

    Synchronous multiroom audio player

    Snapcast is a multiroom client-server audio player, where all clients are time synchronized with the server to play perfectly synced audio. It's not a standalone player, but an extension that turns your existing audio player into a Sonos-like multiroom solution. Audio is captured by the server and routed to the connected clients. Several players can feed audio to the server in parallel and clients can be grouped to play the same audio stream. ...
    Downloads: 33 This Week
    Last Update:
    See Project
  • 16
    Spotube

    Spotube

    Open source Spotify client that doesn't require Premium

    An open source, cross-platform Spotify client compatible across multiple platforms utilizing Spotify's data API and YouTube, Piped video or JioSaavn as an audio source, eliminating the need for Spotify Premium. It is still recommended to support creators by engaging with their YouTube channels/Spotify tracks (or preferably by buying their merch/concert tickets/physical media).
    Downloads: 86 This Week
    Last Update:
    See Project
  • 17
    Audio Share

    Audio Share

    Audio Share can share Windows/Linux computer's audio to Android phone

    Audio Share can share Windows/Linux computer's audio to Android phone over network, so your phone becomes the speaker of computer. (You needn't buy a new speaker😄.) https://github.com/mkckr0/audio-share
    Leader badge
    Downloads: 240 This Week
    Last Update:
    See Project
  • 18
    Namida

    Namida

    A Beautiful and Feature-rich Music & Video Player

    ...It scans device or desktop storage to build an organized library from folders and tags, presenting artists, albums, and tracks with artwork and rich metadata. The player emphasizes smooth navigation and playlist workflows, enabling favorites, custom lists, and quick queueing for flexible listening sessions. Audio features commonly include gapless playback and adjustable options aimed at uninterrupted, consistent sound. Its UI aims to be lightweight and visually coherent, keeping controls accessible without cluttering the screen. Namida’s architecture favors reliability and predictable performance across large libraries, minimizing lag as collections grow. ...
    Downloads: 266 This Week
    Last Update:
    See Project
  • 19
    CAVA

    CAVA

    Cross-platform Audio Visualizer

    Turn your phone or tablet into a fantastic audio viewer. Easy to use and customize. Choose from several preset settings of incredible colors or create your own. CAVA is a bar spectrum audio viewer based on my own open source project with the same name. Take the audio from the device's microphone and visualize the amplitude of the different frequencies as bars on the screen. Each bar represents a certain bandwidth of low to high frequencies.
    Downloads: 26 This Week
    Last Update:
    See Project
  • 20
    yt-dlp-gui

    yt-dlp-gui

    A cross-platform GUI wrapper for yt-dlp written in PySide6

    ...The project supports preset definitions and global arguments through a config file, so users can customize their most common download workflows—like audio extraction, quality ranking, or embedding thumbnails—without retyping arguments each time. Downloads can be initiated from a portable app bundle or run manually with Python, making it flexible across platforms including Windows and Linux.
    Downloads: 280 This Week
    Last Update:
    See Project
  • 21
    BlockTheSpot

    BlockTheSpot

    Video, audio & banner adblock/skip for Spotify

    BlockTheSpot is an open-source modification tool designed to block advertisements and bypass certain limitations in the Spotify desktop application for Windows. It works by patching the Spotify client and injecting custom logic that prevents audio, video, and banner ads from being displayed or played during usage. The tool focuses primarily on performance and ad suppression rather than cosmetic changes, ensuring that playback remains uninterrupted. It includes configurable options that allow users to enable or disable features dynamically, often through developer settings or configuration files. ...
    Downloads: 144 This Week
    Last Update:
    See Project
  • 22
    Librosa

    Librosa

    Python library for audio and music analysis

    Librosa is a powerful Python library for analyzing and processing audio and music signals. Built on top of NumPy, SciPy, and matplotlib, it provides a wide range of tools for feature extraction, time-series manipulation, audio display, and music information retrieval. Whether you're building machine learning models for audio classification or visualizing spectrograms, Librosa is a go-to library for researchers and developers working in audio signal processing.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 23
    YesPlayMusic

    YesPlayMusic

    Play Music for Windows / macOS / Linux

    High-quality third-party NetEase cloud player, supports Windows / macOS / Linux :electron. Overseas users can play directly (need to log in to NetEase Cloud account). Support UnblockNeteaseMusic, automatically replace grayed out song links with various audio sources (web version not supported). Various audio sources" refers to audio sources that are enabled by default. YouTube audio source needs to install yt-dlp by itself. Daily automatic check-in (check-in on mobile and computer at the same time) Light/Dark Mode auto switch. Support Touch Bar. Support PWA, you can click ➕ on the right side of the address bar in Chrome/Edge to install it to your computer. ...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 24
    HandBrake

    HandBrake

    A open source video to convert video from any format to modern codecs

    HandBrake is an open-source, GPL-licensed, multiplatform, multithreaded video transcoder, available for MacOS X, Linux and Windows.
    Downloads: 177 This Week
    Last Update:
    See Project
  • 25
    NovaSR

    NovaSR

    A lightning fast audio upsampler

    NovaSR is an extremely lightweight and high-performance audio upsampling model that transforms low-quality 16 kHz audio into clearer, high-fidelity 48 kHz audio with remarkable speed and efficiency. At only about 50 KB in size, the model is orders of magnitude smaller than typical audio super-resolution networks, yet it achieves high quality and realtime performance thanks to its compact architecture and efficient convolutional design.
    Downloads: 6 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB