Showing 6597 open source projects for "audio linux"

View related business solutions
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    Museeks

    Museeks

    A simple, clean and cross-platform music player

    A simple, clean and cross-platform music player. Museeks is on its way to a big rewrite with some major UI changes, please help shape the future of the music player in the discussions section! You will not find tons of features, as its goals is not to compete with more complete and more famous music players. Museeks is currently in development. This implies some things can break after an update (database schemes changes, config...).
    Downloads: 19 This Week
    Last Update:
    See Project
  • 2
    S&box

    S&box

    s&box is a modern game engine, built on Valve's Source 2

    S&box is the open-source codebase for s&box, a next-generation sandbox game development platform from the creators of Garry’s Mod that blends modding freedom with modern tooling and performance. Built on a cutting-edge game engine, s&box allows creators to prototype, build, and share interactive game modes, tools, and environments using C#, JavaScript, and visual scripting, promoting accessible content creation for developers of varying skill levels. The platform emphasizes multiplayer and...
    Downloads: 114 This Week
    Last Update:
    See Project
  • 3
    WhisperLive

    WhisperLive

    A nearly-live implementation of OpenAI's Whisper

    WhisperLive is a “nearly live” implementation of OpenAI’s Whisper model focused on real-time transcription. It runs as a server–client system in which the server hosts a Whisper backend and clients stream audio to be transcribed with very low delay. The project supports multiple inference backends, including Faster-Whisper, NVIDIA TensorRT, and OpenVINO, allowing you to target GPUs and different CPU architectures efficiently. It can handle microphone input, pre-recorded audio files, and...
    Downloads: 14 This Week
    Last Update:
    See Project
  • 4
    Lidify

    Lidify

    Lidify is built for music lovers who want the convenience of streaming

    Lidify is a self-hosted, on-demand audio streaming platform that aims to deliver a Spotify-like experience while keeping your music library fully under your control. You point it at your personal collection, and it scans, catalogs, and enriches your library with metadata so browsing feels polished instead of “folder-based.” Beyond basic playback, it leans into discovery with personalized “made for you” mixes and one-click radio modes that generate stations from your own listening history and...
    Downloads: 6 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    Phoniebox

    Phoniebox

    A Raspberry Pi jukebox, playing local music, podcasts, web radio

    Phoniebox is a contactless jukebox for the Raspberry Pi, that plays audio files, playlists, podcasts, web streams, and Spotify triggered by RFID cards. All plug and play via USB, no soldering iron needed. It also features GPIO button control support.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 6
    Riffusion App

    Riffusion App

    Stable diffusion for real-time music generation (web app)

    Riffusion App Hobby is an open-source interactive web application that enables real-time music generation using stable diffusion models adapted for audio synthesis. Unlike traditional music generation tools, it treats audio as spectrogram images and applies diffusion techniques to generate continuous sound transitions, allowing users to create evolving musical loops and compositions. The application is built with modern web technologies including Next.js, React, and three.js, providing a...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Anki

    Anki

    Anki is a smart spaced repetition flashcard program

    Anki is a free, open-source spaced repetition flashcard application designed for efficient long‑term memorization. It supports a wide variety of media types (text, images, audio, LaTeX), advanced scheduling algorithms (SM‑2, FSRS), and extensibility via add‑ons. It’s widely used for education, language learning, medical training, and more.
    Downloads: 36 This Week
    Last Update:
    See Project
  • 8
    Scriberr

    Scriberr

    Self-hosted AI audio transcription

    Scriberr is a self-hosted AI-powered transcription platform designed to convert audio and video into highly accurate text while prioritizing privacy and local processing. Unlike cloud-based transcription services, Scriberr runs entirely on the user’s machine, ensuring that sensitive recordings are never sent to third-party servers and remain fully under user control. It leverages modern speech recognition models such as Whisper and other advanced architectures to deliver precise transcripts...
    Downloads: 17 This Week
    Last Update:
    See Project
  • 9
    LMSFM Linux

    LMSFM Linux

    Musician-oriented Linux distro

    Let's Make Some F*&^in' Music is a USB-based live Linux distro based on Slackware with the intent of providing a comprehensive music recording and production studio using only FOSS.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 10
    media-chrome

    media-chrome

    Custom elements (web components) for making audio and video player

    media-chrome is an open source library that provides fully customizable media player controls using native web components, allowing developers to design consistent and flexible audio and video player interfaces across different platforms and frameworks. Instead of relying on default browser controls or proprietary player APIs, Media Chrome introduces a set of reusable custom elements that can be composed using standard HTML, styled with CSS, and integrated into any JavaScript framework...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 11
    LatentSync

    LatentSync

    Taming Stable Diffusion for Lip Sync

    LatentSync is an open-source framework from ByteDance that produces high-quality lip-synchronization for video by using an audio-conditioned latent diffusion model, bypassing traditional intermediate motion representations. In effect, given a source video (with masked or reference frames) and an audio track, LatentSync directly generates frames whose lip motions and expressions align with the audio, producing convincing talking-head or animated lip-sync output. The system leverages a U-Net...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    SoulSync

    SoulSync

    Automated Music Discovery and Collection Manager

    SoulSync is an intelligent music discovery and automation platform designed to bridge streaming services with self-hosted media libraries, enabling users to automatically grow and maintain curated music collections. The system continuously monitors selected artists and detects new releases, then generates personalized playlists such as Release Radar and Discovery Weekly using its built-in recommendation logic. It can automatically download missing tracks from multiple sources including...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 13
    You-Get

    You-Get

    Dumb downloader that scrapes the web

    You-Get is a small command-line utility for downloading media (video, audio and images) from the Web when there are no other means to do so. It can download video and audio files from such popular web sites as YouTube, Twitter, Niconico, Vimeo, Flickr, Instagram and a whole lot more. You-Get is a great option for when you want to enjoy your favorite videos, audio or images from the internet without having to open any web browsers or get interrupted by ads. It’s also a good choice for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    KrillinAI

    KrillinAI

    Video translation and dubbing tool powered by LLMs

    KrillinAI is an end-to-end content localization, translation, and dubbing tool aimed at helping creators transform videos into multiple languages with minimal manual effort. It integrates several stages of the pipeline: video acquisition (either from local files or remote via download tools), speech recognition (ASR), subtitle segmentation and alignment, machine translation (with context-aware translation to preserve semantics), and voice cloning + text-to-speech (TTS) to produce dubbed...
    Downloads: 22 This Week
    Last Update:
    See Project
  • 15
    Orange Juice Audio Enhancer for Linux
    Audio Enhancer for Linux by MX Linux's Freja. (Development time spents 6 years). It's GPLv3. 22 Nov 2025 - Latest "Super Headphones.json" is a perfect evolve style of "Orange Juice Mild". Super style for Headphones Music Listenning for extrox (MX-25 based or later) family, wow! ____________ Most recommended Default is "Orange Juice Mild" .
    Downloads: 8 This Week
    Last Update:
    See Project
  • 16
    Buster

    Buster

    Captcha solver extension for humans

    ...The success rate of the extension can be improved by simulating user interactions with the help of a client app. Follow the instructions from the extension's options to download and install the client app on Windows, Linux and macOS, or get the app from this repository.
    Downloads: 65 This Week
    Last Update:
    See Project
  • 17
    ReClip

    ReClip

    Download videos from almost any website

    ReClip is a lightweight, self-hosted media downloader that provides a simple web-based interface for downloading videos and audio from a wide range of online platforms. Built around the yt-dlp engine, it supports over a thousand websites, including major platforms like YouTube, TikTok, and Instagram, allowing users to retrieve media content in various formats. The application emphasizes simplicity and minimalism, featuring a clean interface built with plain HTML, CSS, and JavaScript without...
    Downloads: 77 This Week
    Last Update:
    See Project
  • 18
    TRIBE v2

    TRIBE v2

    A multimodal model for brain response prediction

    TRIBE v2 is a multimodal foundation model developed by Meta AI for predicting human brain activity from naturalistic stimuli such as video, audio, and text. It is designed for in-silico neuroscience, enabling researchers to model how the brain responds to complex real-world inputs. The system integrates state-of-the-art encoders—including LLaMA for text, V-JEPA for video, and Wav2Vec-BERT for audio—into a unified Transformer architecture. This combined representation is mapped onto the...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 19
    Hugging Face - Speech To Speech

    Hugging Face - Speech To Speech

    Open speech-to-speech models and pipelines by Hugging Face toolkit AI

    This project from Hugging Face focuses on enabling direct speech-to-speech processing using modern machine learning models. It provides tools and reference implementations that allow audio input to be transformed into audio output without requiring an intermediate text representation. Hugging Face - Speech To Speech builds on recent advances in speech modeling, combining components such as speech recognition, translation, and synthesis into unified pipelines. It is designed to help...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 20
    SoniTranslate

    SoniTranslate

    Synchronized Translation for Videos

    SoniTranslate is a video translation and dubbing system that produces synchronized target-language audio tracks for existing video content. It provides a web UI built with Gradio, allowing users to upload a video, choose source and target languages, and then run a pipeline that handles transcription, translation and re-synthesis of speech. Under the hood, it uses advanced speech and diarization models to separate speakers, align audio with timecodes and respect subtitle timing, which lets...
    Downloads: 25 This Week
    Last Update:
    See Project
  • 21
    FastRTC

    FastRTC

    The python library for real-time communication

    FastRTC is a Python library designed to simplify real-time communication (RTC), especially for audio and video streaming applications. It abstracts away much of the complexity that typically comes with implementing WebRTC by providing a simple interface — e.g. a Stream class — that can be mounted within a web backend (for example a FastAPI application). This makes it particularly well suited for building real-time voice (or video) interfaces for applications such as AI assistants, live chat,...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 22
    Speakr

    Speakr

    Speakr is a personal, self-hosted web application

    Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications. Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    LiveAvatar

    LiveAvatar

    Streaming Real-time Audio-Driven Avatar Generation

    LiveAvatar is an open-source research and implementation project that provides a unified framework for real-time, streaming, interactive avatar video generation driven by audio and other control signals. It implements techniques from state-of-the-art diffusion-based avatar modeling to support infinite-length continuous video generation with low latency, enabling interactive AI avatars that maintain continuity and realism over extended sessions. The project co-designs algorithms and system...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Umami

    Umami

    A simple, fast, website analytics alternative to Google Analytics

    Umami is a simple, easy to use, self-hosted web analytics solution. The goal is to provide you with a friendlier, privacy-focused alternative to Google Analytics and a free, open-sourced alternative to paid solutions. Umami collects only the metrics you care about and everything fits on a single page. You can view a live demo here. Umami measures just the important metrics that you care about: pageviews, devices used, and where your visitors are coming from. Everything is displayed on a...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25
    Real-Time Voice Cloning

    Real-Time Voice Cloning

    Clone a voice in 5 seconds to generate arbitrary speech in real-time

    Real-Time Voice Cloning is an influential deep-learning repository that demonstrates how to clone a voice from just a few seconds of audio and then generate arbitrary speech in that voice in near real time. It implements the SV2TTS pipeline (“Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis”) in three stages: a speaker encoder, a synthesizer, and a vocoder. In the first stage, short audio clips are converted into a fixed-dimensional speaker embedding that...
    Downloads: 11 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB