Search Results for "audio linux" - Page 2

Sort By:

3869 projects for "audio linux" with 1 filter applied:

BSD Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
1

HunyuanVideo-Avatar

Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model

HunyuanVideo-Avatar is a multimodal diffusion transformer (MM-DiT) model by Tencent Hunyuan for animating static avatar images into dynamic, emotion-controllable, and multi-character dialogue videos, conditioned on audio. It addresses challenges of motion realism, identity consistency, and emotional alignment. Innovations include a character image injection module, an Audio Emotion Module for transferring emotion cues, and a Face-Aware Audio Adapter to isolate audio effects on faces,...

Downloads: 1 This Week

Last Update: 2025-12-16
See Project
2

WanGP

AI video generator optimized for low VRAM and older GPUs use

Wan2GP is an open source AI video generation toolkit designed to make modern generative models accessible on consumer-grade hardware with limited GPU memory. It acts as a unified interface for running multiple video, image, and audio generation models, including Wan-based models as well as other systems like Hunyuan Video, Flux, and Qwen. A key focus of the project is reducing VRAM requirements, enabling some workflows to run on as little as 6 GB while still supporting older Nvidia and...

Downloads: 54 This Week

Last Update: 2 days ago
See Project
3

AudioMuse-AI

AudioMuse-AI is an Open Source Dockerized environment

AudioMuse-AI is an open-source system designed to automatically generate playlists and analyze music libraries using artificial intelligence and audio signal processing techniques. The platform runs locally in a Dockerized environment and performs detailed sonic analysis on audio files to understand characteristics such as tempo, mood, and acoustic similarity. By analyzing the underlying audio content rather than relying on external metadata services, the system can organize large personal...

Downloads: 2 This Week

Last Update: 1 day ago
See Project
4

Butterchurn

Butterchurn is a WebGL implementation of the Milkdrop Visualizer

Butterchurn is a WebGL-based music visualization engine that recreates the classic MilkDrop visualizer experience entirely in the browser using modern web technologies. It is designed to render complex, real-time audio-reactive graphics that respond dynamically to music input, producing highly immersive and fluid visual effects. The engine uses GPU acceleration through WebGL to achieve high performance, allowing it to handle intricate shader-based visualizations without overwhelming system...

Downloads: 2 This Week

Last Update: 6 days ago
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

MusicFreePlugins

MusicFreePlayPlugin

The MusicFreePluginsc project is a collection and framework for plugins that extend the functionality of the MusicFree ecosystem by providing access to various music sources and features. It defines a standardized interface for plugin development, allowing contributors to implement features such as search, playback, and metadata retrieval. The system is designed to be modular, enabling users to install, update, and manage plugins independently of the core application. It supports multiple...

Downloads: 9 This Week

Last Update: 2026-04-17
See Project
6

MusicFree

Plug-in, customized, ad-free free music player

The MusicFree project is an open-source, plugin-based music player designed for mobile platforms such as Android and HarmonyOS, emphasizing flexibility, customization, and privacy. Unlike traditional music apps, it does not include built-in audio sources but instead relies entirely on plugins to fetch and manage music content. This modular architecture allows users to integrate multiple sources and extend functionality without modifying the core application. The project supports features...

Downloads: 6 This Week

Last Update: 2026-04-17
See Project
7

media-chrome

Custom elements (web components) for making audio and video player

media-chrome is an open source library that provides fully customizable media player controls using native web components, allowing developers to design consistent and flexible audio and video player interfaces across different platforms and frameworks. Instead of relying on default browser controls or proprietary player APIs, Media Chrome introduces a set of reusable custom elements that can be composed using standard HTML, styled with CSS, and integrated into any JavaScript framework...

Downloads: 6 This Week

Last Update: 2026-04-13
See Project
8

Riffusion App

Stable diffusion for real-time music generation (web app)

Riffusion App Hobby is an open-source interactive web application that enables real-time music generation using stable diffusion models adapted for audio synthesis. Unlike traditional music generation tools, it treats audio as spectrogram images and applies diffusion techniques to generate continuous sound transitions, allowing users to create evolving musical loops and compositions. The application is built with modern web technologies including Next.js, React, and three.js, providing a...

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
9

SoniTranslate

Synchronized Translation for Videos

SoniTranslate is a video translation and dubbing system that produces synchronized target-language audio tracks for existing video content. It provides a web UI built with Gradio, allowing users to upload a video, choose source and target languages, and then run a pipeline that handles transcription, translation and re-synthesis of speech. Under the hood, it uses advanced speech and diarization models to separate speakers, align audio with timecodes and respect subtitle timing, which lets...

Downloads: 19 This Week

Last Update: 2025-11-28
See Project
Try Google Cloud Risk-Free With $300 in Credit
No hidden charges. No surprise bills. Cancel anytime.

Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.

Start Free
10

bfxr

Flash + AIR sound effects generator. Based on Sfxr.

The bfxr project by increpare is a sound-effects generator tool originally built using Flash + AIR, based on the earlier Sfxr project. Its purpose is to enable users, especially game developers and sound designers, to quickly generate retro, 8-bit/“chiptune” style sound effects (“bleeps”, “booms”, “zaps”, etc.) without deep knowledge of audio signal processing. It offers an interactive GUI through which you can tweak many parameters (oscillators, envelopes, filters, etc.) to sculpt custom...

Downloads: 14 This Week

Last Update: 2 days ago
See Project
11

cmus - C* Music Player

Small, fast & powerful console music player for Unix-like systems

cmus, also known as the C* Music Player is a small yet fast and powerful console music player for Unix-like operating systems. It comes with a number of great features, such as gapless playback, ReplayGain support, MP3 and Ogg streaming, easy-to-use directory browser, powerful playlist filters / live filtering and more. cmus also supports several input and output plugins. Input plugins include: Ogg Vorbis, MP3, FLAC, Opus, Musepack, WavPack, WAV, AAC, MP4, audio CD and more. Output plugins...

Downloads: 2 This Week

Last Update: 2024-10-26
See Project
12

ReClip

Download videos from almost any website

ReClip is a lightweight, self-hosted media downloader that provides a simple web-based interface for downloading videos and audio from a wide range of online platforms. Built around the yt-dlp engine, it supports over a thousand websites, including major platforms like YouTube, TikTok, and Instagram, allowing users to retrieve media content in various formats. The application emphasizes simplicity and minimalism, featuring a clean interface built with plain HTML, CSS, and JavaScript without...

Downloads: 53 This Week

Last Update: 2026-04-09
See Project
13

WavTokenizer

SOTA discrete acoustic codec models with 40/75 tokens per second

WavTokenizer is a state-of-the-art discrete acoustic codec designed specifically for audio language modeling, capable of compressing 24 kHz audio into just 40 or 75 tokens per second while preserving high perceptual quality. It is built to represent speech, music, and general audio with extremely low bitrate, making it ideal as a front-end for large audio language models like GPT-4o and similar architectures. The model uses a single-quantizer design together with temporal compression to...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
14

AudioLM - Pytorch

Implementation of AudioLM audio generation model in Pytorch

Implementation of AudioLM, a Language Modeling Approach to Audio Generation out of Google Research, in Pytorch It also extends the work for conditioning with classifier free guidance with T5. This allows for one to do text-to-audio or TTS, not offered in the paper. Yes, this means VALL-E can be trained from this repository. It is essentially the same. This repository now also contains a MIT licensed version of SoundStream. It is also compatible with EnCodec, however, be aware that it...

Downloads: 2 This Week

Last Update: 2025-01-12
See Project
15

Bili23 Downloader

Cross platform GUI tool for downloading videos from Bilibili sites

Bili23-Downloader is an open source desktop application designed for downloading video content from the Bilibili platform. It provides a graphical interface that allows users to download various types of media including user-uploaded videos, series episodes, movies, and other hosted content. It focuses on ease of use with a zero-configuration setup, making it accessible to both beginners and experienced users. It supports high performance downloads through multi-threading and includes resume...

Downloads: 11 This Week

Last Update: 2026-04-07
See Project
16

Unrud Video Downloader

Download videos from websites like YouTube and many others

Video Downloader is a desktop application designed to simplify the process of downloading videos from various online platforms through a user-friendly graphical interface. Built on top of yt-dlp, it abstracts the complexity of command-line tools and provides an accessible way for users to retrieve video and audio content. The application supports a wide range of features, including downloading entire playlists, handling private or password-protected content, and automatically selecting optimal formats based on user preferences. It also allows users to convert videos into audio files such as MP3, making it useful for media extraction workflows. ...

Downloads: 11 This Week

Last Update: 2026-04-09
See Project
17

Remotion

Make videos programmatically with React

Remotion is a cutting-edge library that lets developers create real videos programmatically using React components, transforming familiar UI paradigms into a flexible, code-driven video production workflow. Instead of traditional timeline editors, Remotion leverages HTML, CSS, and JavaScript to define video frames, animations, and transitions, which means developers can use states, props, loops, and component hierarchies to automate complex motion graphics. Because it integrates with the...

Downloads: 30 This Week

Last Update: 4 days ago
See Project
18

OpenCorePkg

OpenCore bootloader

OpenCorePkg is an open-source, modular UEFI (Unified Extensible Firmware Interface) bootloader and development framework, primarily designed to enable macOS booting on non-Apple hardware (Hackintosh). It includes Apple-specific UEFI drivers, utilities for macOS installation support, and shared libraries used across Acidanthera projects. Apple disk image loading support. Apple keyboard input aggregation. Apple PE image signature verification. Apple UEFI secure boot supplemental code. Audio...

Downloads: 178 This Week

Last Update: 2026-03-20
See Project
19

Ultravox

Fast multimodal LLM for real-time voice interaction and AI apps

Ultravox is an open source multimodal large language model designed specifically for real-time voice-based interactions. It is built to process both text and spoken audio directly, eliminating the need for a separate speech recognition stage and enabling more seamless conversational experiences. Ultravox works by combining text prompts with encoded audio inputs, allowing it to understand spoken language alongside written instructions in a unified pipeline. Internally, it leverages pretrained...

Downloads: 1 This Week

Last Update: 2026-03-18
See Project
20

ffmpeg-normalize

Audio Normalization for Python/ffmpeg

ffmpeg-normalize is a command-line utility designed to normalize audio levels in media files using FFmpeg, ensuring consistent volume across multiple tracks. It supports both EBU R128 loudness normalization and peak normalization methods, allowing users to choose the appropriate standard for their needs. The tool analyzes audio streams and applies adjustments to achieve target loudness levels without introducing distortion. It can process multiple files in batch mode, making it suitable for...

Downloads: 0 This Week

Last Update: 1 day ago
See Project
21

Kaset

The missing YouTube Music macOS app

Kaset is a social audio platform framework that allows users to host, share, and interact with audio content in community-oriented spaces, combining elements of podcasting, voice rooms, and feedback-driven discovery. It provides an interface where creators can upload episodes, host live or scheduled voice sessions, and cultivate listener communities through comments, reactions, and follow systems. The platform emphasizes audio discovery with playlists, curated channels, and trending audio...

Downloads: 0 This Week

Last Update: 2026-03-28
See Project
22

SerenityOS

The Serenity Operating System

SerenityOS is an open source Unix-like operating system project with its own custom kernel, graphical user interface, system libraries, and userland tools. It combines a nostalgic “90s UI aesthetic” with modern system capabilities: a preemptive, multi-threaded kernel, own browsers, network stack, file systems, IPC, security features, and a suite of graphical / developer applications. The project is both a hobbyist OS and a polished engineering sandbox.

Downloads: 22 This Week

Last Update: 2026-04-18
See Project
23

Whisper

Robust Speech Recognition via Large-Scale Weak Supervision

OpenAI Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multitasking model that can perform multilingual speech recognition, speech translation, and language identification. A Transformer sequence-to-sequence model is trained on various speech processing tasks, including multilingual speech recognition, speech translation, spoken language identification, and voice activity detection. These tasks are jointly represented...

Downloads: 92 This Week

Last Update: 2025-06-26
See Project
24

JSNES

A JavaScript NES emulator

JSNES is a JavaScript-based emulator that replicates the functionality of the Nintendo Entertainment System (NES), enabling classic games to run directly in web browsers or Node.js environments. It implements the core components of NES hardware, including the CPU, graphics processing unit, and audio system, to deliver an accurate emulation experience. The project is designed as a library, allowing developers to embed emulation capabilities into web applications or custom interfaces. It...

Downloads: 3 This Week

Last Update: 2026-04-12
See Project
25

Markdownify MCP Server

Convert files and web content into clean, usable Markdown easily

Markdownify MCP is a Model Context Protocol server that converts many types of files and web content into clean Markdown. It supports formats such as PDFs, images, audio with transcription, DOCX, XLSX, and PPTX, along with web sources like YouTube transcripts, Bing results, and general webpages. Markdownify MCP is designed to simplify content extraction and make data easier to read, share, and reuse in structured workflows. Developers can install dependencies, build, and run the server...

Downloads: 4 This Week

Last Update: 2026-04-18
See Project