Showing 6596 open source projects for "audio linux"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Application Monitoring That Won't Slow Your App Down Icon
    Application Monitoring That Won't Slow Your App Down

    AppSignal's Rust-based agent is lightweight and stable. Already running in thousands of production apps.

    Full APM with errors, performance, logs, and uptime monitoring. 99.999% uptime SLA on the platform itself.
    Start Free
  • 1
    Real-Time Voice Cloning

    Real-Time Voice Cloning

    Clone a voice in 5 seconds to generate arbitrary speech in real-time

    Real-Time Voice Cloning is an influential deep-learning repository that demonstrates how to clone a voice from just a few seconds of audio and then generate arbitrary speech in that voice in near real time. It implements the SV2TTS pipeline (“Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis”) in three stages: a speaker encoder, a synthesizer, and a vocoder. In the first stage, short audio clips are converted into a fixed-dimensional speaker embedding that...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 2
    Amazon Chime SDK for JavaScript

    Amazon Chime SDK for JavaScript

    A JavaScript client library for integrating multi-party communications

    The Amazon Chime SDK is a set of real-time communications components that developers can use to quickly add messaging, audio, video, and screen sharing capabilities to their web or mobile applications. Developers can build on AWS's global communications infrastructure to deliver engaging experiences in their applications. For example, they can add video to a health application so patients can consult remotely with doctors on health issues, or create customized audio prompts for integration...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 3
    Markdownify MCP Server

    Markdownify MCP Server

    Convert files and web content into clean, usable Markdown easily

    Markdownify MCP is a Model Context Protocol server that converts many types of files and web content into clean Markdown. It supports formats such as PDFs, images, audio with transcription, DOCX, XLSX, and PPTX, along with web sources like YouTube transcripts, Bing results, and general webpages. Markdownify MCP is designed to simplify content extraction and make data easier to read, share, and reuse in structured workflows. Developers can install dependencies, build, and run the server...
    Downloads: 12 This Week
    Last Update:
    See Project
  • 4
    BizHawk

    BizHawk

    BizHawk is a multi-system emulator written in C#

    A multi-system emulator written in C#. As well as quality-of-life features for casual players, it also has recording/playback and debugging tools, making it the first choice for TASers (Tool-Assisted Speedrunners). Screenshotting and recording audio + video to file. Firmware management, input, framerate, and more in a HUD over the game. Rebindable hotkeys for controlling the frontend (keyboard+mouse+gamepad). A comprehensive input mapper for the emulated gamepads and other peripherals....
    Downloads: 51 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 5
    SDL

    SDL

    Simple DirectMedia Layer

    SDL (Simple DirectMedia Layer) is a cross-platform multimedia development library designed to provide low-level access to hardware components such as graphics, audio, input devices, and system resources, making it a foundational tool for building games, emulators, and interactive applications. It abstracts platform-specific functionality into a consistent API, allowing developers to write code once and deploy it across multiple operating systems including Windows, macOS, Linux, iOS, and Android. ...
    Downloads: 40 This Week
    Last Update:
    See Project
  • 6
    yami

    yami

    An open-source music player with simple UI

    Yami is a lightweight, open-source music player built in Python. It focuses on simplicity and ease of use, providing an intuitive user interface (UI) for users to manage and play their music. Whether you're playing local files or downloading from online sources using spotdl, Yami offers a seamless experience. This project is designed for users who want a minimalistic, cross-platform music player with the ability to integrate external sources like Spotify/YouTube Music.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    bfxr

    bfxr

    Flash + AIR sound effects generator. Based on Sfxr.

    The bfxr project by increpare is a sound-effects generator tool originally built using Flash + AIR, based on the earlier Sfxr project. Its purpose is to enable users, especially game developers and sound designers, to quickly generate retro, 8-bit/“chiptune” style sound effects (“bleeps”, “booms”, “zaps”, etc.) without deep knowledge of audio signal processing. It offers an interactive GUI through which you can tweak many parameters (oscillators, envelopes, filters, etc.) to sculpt custom...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 8
    DistroAV

    DistroAV

    DistroAV (formerly OBS-NDI): NDI integration for OBS Studio

    ...The plugin works with the NDI runtime and supports modern versions of OBS Studio across Windows, macOS, and Linux, enabling streamers and production setups to leverage low-latency, high-quality media over local networks for multi-device collaboration or extended setups. DistroAV’s development community focuses on maintaining compatibility with the latest OBS releases and NDI v6 tooling while addressing bugs and extending feature requests.
    Downloads: 58 This Week
    Last Update:
    See Project
  • 9
    Voice-Pro

    Voice-Pro

    Comprehensive Gradio WebUI for audio processing

    Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.
    Downloads: 32 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    Fish Speech

    Fish Speech

    SOTA Open Source TTS

    Fish Speech is a state-of-the-art open-source text-to-speech project that has evolved into the OpenAudio series of advanced TTS models. The repository hosts the code and tooling for training, fine-tuning, and serving high-quality TTS, while the current flagship models (OpenAudio-S1 and S1-mini) are distributed via Fish Audio’s playground and Hugging Face. The models are evaluated with Seed TTS metrics and achieve exceptionally low word and character error rates, indicating strong...
    Downloads: 26 This Week
    Last Update:
    See Project
  • 11
    WavTokenizer

    WavTokenizer

    SOTA discrete acoustic codec models with 40/75 tokens per second

    WavTokenizer is a state-of-the-art discrete acoustic codec designed specifically for audio language modeling, capable of compressing 24 kHz audio into just 40 or 75 tokens per second while preserving high perceptual quality. It is built to represent speech, music, and general audio with extremely low bitrate, making it ideal as a front-end for large audio language models like GPT-4o and similar architectures. The model uses a single-quantizer design together with temporal compression to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    Linux Studio Plugins Project

    Linux Studio Plugins Project

    Linux Studio Plugins Project

    LSP (Linux Studio Plugins) is a collection of open-source plugins currently compatible with LADSPA, LV2 and LinuxVST formats. Standalone plugins for JACK are provided since version 1.0.8. Experimental support of ARMv7 added since version 1.1.4 Experimental support of AArch64 added since version 1.1.9 Decomposition of modules and new UI introduced in 1.2.0 Added CLAP support in 1.2.5 Added VST3 support in 1.2.15 The basic idea is to fill the lack of good and useful plugins under the...
    Leader badge
    Downloads: 162 This Week
    Last Update:
    See Project
  • 13
    CLIAMP

    CLIAMP

    Terminal music player inspired by winamp

    CLIAMP is a command-line tool designed to enhance developer productivity by providing a streamlined interface for managing and interacting with projects directly from the terminal. It focuses on simplifying repetitive development tasks by offering a structured and scriptable environment where commands can be composed and reused efficiently. The tool emphasizes minimalism and speed, allowing developers to execute workflows without leaving the command-line environment. It is particularly...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 14
    Note67

    Note67

    A private, local meeting notes assistant

    note67 is a private, local meeting notes assistant application that combines audio capture, transcription, and AI-powered summarization to help users document conversations and meetings on their own devices without relying on cloud services. Built with a cross-platform architecture using Rust (via Tauri) for backend logic and a TypeScript/React frontend, it prioritizes privacy by performing audio transcription locally with Whisper models and generating summaries with locally-hosted AI,...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 15
    Unrud Video Downloader

    Unrud Video Downloader

    Download videos from websites like YouTube and many others

    Video Downloader is a desktop application designed to simplify the process of downloading videos from various online platforms through a user-friendly graphical interface. Built on top of yt-dlp, it abstracts the complexity of command-line tools and provides an accessible way for users to retrieve video and audio content. The application supports a wide range of features, including downloading entire playlists, handling private or password-protected content, and automatically selecting optimal formats based on user preferences. It also allows users to convert videos into audio files such as MP3, making it useful for media extraction workflows. ...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 16
    OpenVoice

    OpenVoice

    Instant voice cloning by MIT and MyShell. Audio foundation model

    OpenVoice is a versatile instant voice cloning system that can replicate a speaker’s tone color from just a short audio clip and then generate speech in multiple languages. It is designed not only to match the timbre of the reference voice, but also to give granular control over style parameters such as emotion, accent, rhythm, pauses, and intonation. The model supports cross-lingual and even zero-shot cross-lingual voice cloning, so a speaker recorded in one language can be made to speak...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 17
    Furnace

    Furnace

    A multi-system chiptune tracker compatible with DefleMask modules

    Furnace is a powerful multi-system chiptune tracker that enables users to compose music using the sound chips of classic computers, consoles, and arcade hardware. It supports an extensive range of audio chips, including FM synthesis, wavetable synthesis, and sample-based systems, making it one of the most versatile trackers available. The software is compatible with multiple operating systems and can be used both as a standalone application and as a development tool for retro-style audio...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    OpenAudible

    OpenAudible

    Audiobook Manager for Audible Users

    OpenAudible is a cross-platform audiobook manager designed for Audible users. Manage/Download all your audiobooks with this easy-to-use desktop application. Say goodbye to the hassle of managing your audiobooks across multiple devices. With OpenAudible, you can easily download, view, and manage all your Audible books in one place. Our lightning-fast conversion to MP3 and M4B audio formats makes it easy to enjoy your favorite books on any device. Plus, our automation features make it a breeze...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 19
    WanGP

    WanGP

    AI video generator optimized for low VRAM and older GPUs use

    Wan2GP is an open source AI video generation toolkit designed to make modern generative models accessible on consumer-grade hardware with limited GPU memory. It acts as a unified interface for running multiple video, image, and audio generation models, including Wan-based models as well as other systems like Hunyuan Video, Flux, and Qwen. A key focus of the project is reducing VRAM requirements, enabling some workflows to run on as little as 6 GB while still supporting older Nvidia and...
    Downloads: 40 This Week
    Last Update:
    See Project
  • 20
    AudioMuse-AI

    AudioMuse-AI

    AudioMuse-AI is an Open Source Dockerized environment

    AudioMuse-AI is an open-source system designed to automatically generate playlists and analyze music libraries using artificial intelligence and audio signal processing techniques. The platform runs locally in a Dockerized environment and performs detailed sonic analysis on audio files to understand characteristics such as tempo, mood, and acoustic similarity. By analyzing the underlying audio content rather than relying on external metadata services, the system can organize large personal...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    HunyuanVideo-Avatar

    HunyuanVideo-Avatar

    Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model

    HunyuanVideo-Avatar is a multimodal diffusion transformer (MM-DiT) model by Tencent Hunyuan for animating static avatar images into dynamic, emotion-controllable, and multi-character dialogue videos, conditioned on audio. It addresses challenges of motion realism, identity consistency, and emotional alignment. Innovations include a character image injection module, an Audio Emotion Module for transferring emotion cues, and a Face-Aware Audio Adapter to isolate audio effects on faces,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Lumos Engine

    Lumos Engine

    Cross-Platform C++ 2D/3D game engine

    Cross-platform 2D and 3D Game Engine written in C++ that supports both OpenGL and Vulkan. Support for Windows, Linux, macOS. Support for OpenGL/Vulkan. D audio using OpenAL. Rendering 3D models with PBR shading. Debug gui using ImGui 3D collision detection - cuboid/sphere/pyramid. 2D collision detection - Box2D. Basic lua scripting support.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 23
    TorchAudio

    TorchAudio

    Data manipulation and transformation for audio signal processing

    The aim of torchaudio is to apply PyTorch to the audio domain. By supporting PyTorch, torchaudio follows the same philosophy of providing strong GPU acceleration, having a focus on trainable features through the autograd system, and having consistent style (tensor names and dimension names). Therefore, it is primarily a machine learning library and not a general signal processing library. The benefits of PyTorch can be seen in torchaudio through having all the computations be through PyTorch...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    Text Generation Web UI

    Text Generation Web UI

    Oobabooga - The definitive Web UI for local AI, with powerful features

    A gradio web UI for running Large Language Models like LLaMA, llama.cpp, GPT-J, Pythia, OPT, and GALACTICA. Dropdown menu for switching between models. Notebook mode that resembles OpenAI's playground. Chat mode for conversation and role playing. Instruct mode compatible with Alpaca and Open Assistant formats. Nice HTML output for GPT-4chan. Markdown output for GALACTICA, including LaTeX rendering. Custom chat characters. Advanced chat features (send images, get audio responses with TTS)....
    Downloads: 72 This Week
    Last Update:
    See Project
  • 25
    JSNES

    JSNES

    A JavaScript NES emulator

    JSNES is a JavaScript-based emulator that replicates the functionality of the Nintendo Entertainment System (NES), enabling classic games to run directly in web browsers or Node.js environments. It implements the core components of NES hardware, including the CPU, graphics processing unit, and audio system, to deliver an accurate emulation experience. The project is designed as a library, allowing developers to embed emulation capabilities into web applications or custom interfaces. It...
    Downloads: 6 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB