Search Results for "audio linux" - Page 9

Sort By:

Showing 6662 open source projects for "audio linux"

View related business solutions

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
1

linux-file-converter-addon

Convert various image, audio and video formats from your context menu.

Convert between various image, audio and video formats using the context menu. The addon is written in Python and available for Nautilus, Nemo, Thunar and Dolphin file viewers. It adds a new option to the context menu to create an easy way to convert between a huge amount of file types. The program offers many options to customize the appearance of its context menu. There are also a few extra formats which can be added by installing optional dependencies. The tool has a built-in auto-update...

Downloads: 13 This Week

Last Update: 2025-11-13
See Project
2

notebooklm-py

Unofficial Python API and agentic skill for Google NotebookLM

notebooklm-py is an unofficial Python API and agent-ready integration layer for Google NotebookLM that exposes NotebookLM functionality through code, the command line, and AI agent workflows. Its goal is to provide programmatic access not just to standard notebook operations, but also to many capabilities that are either limited or unavailable in the web interface, making it especially useful for automation and custom pipelines. The project covers notebook management, source ingestion,...

Downloads: 7 This Week

Last Update: 2026-03-17
See Project
3

mpv

Command line video player

mpv is a free (as in freedom) media player for the command line. It supports a wide variety of media file formats, audio and video codecs, and subtitle types. Powerful scripting capabilities can make the player do almost anything. There is a large selection of user scripts on the wiki. While mpv strives for minimalism and provides no real GUI, it has a small controller on top of the video for basic control. mpv has an OpenGL, Vulkan, and D3D11 based video output that is capable of many...

Downloads: 93 This Week

Last Update: 2025-12-22
See Project
4

TorchAudio

Data manipulation and transformation for audio signal processing

The aim of torchaudio is to apply PyTorch to the audio domain. By supporting PyTorch, torchaudio follows the same philosophy of providing strong GPU acceleration, having a focus on trainable features through the autograd system, and having consistent style (tensor names and dimension names). Therefore, it is primarily a machine learning library and not a general signal processing library. The benefits of PyTorch can be seen in torchaudio through having all the computations be through PyTorch...

Downloads: 0 This Week

Last Update: 2026-02-17
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
5

Kooha

Elegantly record your screen

Capture your screen in an intuitive and straightforward way without distractions. Kooha is a simple screen recorder with a minimal interface. You can simply click the record button without having to configure a bunch of settings.

Downloads: 19 This Week

Last Update: 2025-08-23
See Project
6

Shaka Player

JavaScript player library / DASH & HLS client / MSE-EME player

Shaka Player is an open-source JavaScript library for adaptive media. It plays adaptive media formats (such as DASH and HLS) in a browser, without using plugins or Flash. Instead, Shaka Player uses the open web standards MediaSource Extensions and Encrypted Media Extensions. Shaka Player also supports offline storage and playback of media using IndexedDB. Content can be stored on any browser. Storage of licenses depends on browser support. Our main goal is to make it as easy as possible to...

Downloads: 12 This Week

Last Update: 1 day ago
See Project
7

Remotion

Make videos programmatically with React

Remotion is a cutting-edge library that lets developers create real videos programmatically using React components, transforming familiar UI paradigms into a flexible, code-driven video production workflow. Instead of traditional timeline editors, Remotion leverages HTML, CSS, and JavaScript to define video frames, animations, and transitions, which means developers can use states, props, loops, and component hierarchies to automate complex motion graphics. Because it integrates with the...

Downloads: 6 This Week

Last Update: 1 day ago
See Project
8

clone-voice

A sound cloning tool with a web interface, using your voice

Clone-voice is a local voice-cloning tool that lets you synthesize speech in any target voice or convert one recording into another voice using the same timbre. It is built around Coqui’s XTTS-v2 model, so it inherits multilingual support and modern neural TTS quality while wrapping it in a user-friendly desktop workflow. The app is designed to be very easy to use: you download a precompiled package, double-click app.exe, and it launches a browser-based web interface where you control...

Downloads: 15 This Week

Last Update: 2025-11-28
See Project
9

projectM

Cross-platform Music Visualization Library

Cross-platform Music Visualization Library. Open-source and Milkdrop-compatible. Experience psychedelic and mesmerizing visuals by transforming music into equations that render a limitless array of user-contributed visualizations. projectM is an open-source project that reimplements the esteemed Winamp Milkdrop by Geiss in a more modern, cross-platform reusable library. Its purpose in life is to read an audio input and to produce mesmerizing visuals, detecting tempo, and rendering advanced...

Downloads: 46 This Week

Last Update: 2025-11-28
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
10

Plyr

Simple HTML5, YouTube and Vimeo player

A simple, accessible and customizable media player for HTML5 Video, HTML5 Audio, YouTube and Vimeo. Premium video monetization from Video Intelligence. Plyr is a simple, lightweight, accessible and customizable HTML5, YouTube and Vimeo media player that supports modern browsers. Accessible - full support for VTT captions and screen readers. Customizable - make the player look how you want with the markup you want. Responsive - works with any screen size. Monetization - make money from your...

Downloads: 6 This Week

Last Update: 2026-01-03
See Project
11

edge-tts

Use Microsoft Edge's online text-to-speech service from Python

edge-tts is a Python module and command-line tool that gives you direct access to Microsoft Edge’s online text-to-speech service without needing the Edge browser, Windows, or any API key. It wraps the same cloud voices used by Edge, exposing them through a simple CLI (edge-tts, edge-playback) and a Python API, so you can script high-quality speech generation in your own applications. The tool lets you list available voices, specify locale and voice name, and generate audio files in common...

Downloads: 19 This Week

Last Update: 2025-12-12
See Project
12

SameBoy

Game Boy and Game Boy Color emulator written in C

SameBoy is a user friendly, powerful and open source Game Boy, Game Boy Color and Super Game Boy emulator for macOS, Windows and Unix-like platforms. SameBoy is extremely accurate and includes a wide range of both powerful debugging features and user-facing features, making it ideal for both casual players and developers. Of course, SameBoy also has every feature one would expect from an emulator – from save states to scaling filters. Supports Game Boy (DMG), Game Boy Pocket and Light (MGB),...

Downloads: 11 This Week

Last Update: 2026-03-04
See Project
13

NanoBoyAdvance

A cycle-accurate Nintendo Game Boy Advance emulator

NanoBoyAdvance is a cycle-accurate Game Boy Advance emulator that prioritizes precision and correctness in replicating original hardware behavior. It is designed to emulate the GBA at a very low level, including CPU timing, DMA operations, graphics processing, and memory behavior, ensuring that even edge cases and obscure hardware quirks are faithfully reproduced. The emulator achieves extremely high compatibility, passing multiple hardware test suites and accurately running games that rely...

Downloads: 8 This Week

Last Update: 2026-04-07
See Project
14

Orpheus TTS

Towards Human-Sounding Speech

Orpheus TTS is a state-of-the-art open-source text-to-speech system built on a Llama-3B backbone, treating speech synthesis as a large language model problem instead of a traditional TTS pipeline. It is designed to produce human-like speech with natural intonation, emotion, and rhythm, targeting quality comparable to or better than many closed-source systems. The project ships both pretrained and finetuned English models, as well as a family of multilingual models released as a research...

Downloads: 3 This Week

Last Update: 2025-12-05
See Project
15

RealtimeTTS

Converts text to speech in realtime

RealtimeTTS is a low-latency text-to-speech library built for real-time applications such as voice chat with LLMs, assistants, and interactive tools. It is designed around a streaming model: you can feed it text incrementally (for example, as an LLM responds) and get audio output almost immediately, which keeps end-to-end latency very low. The library is engine-agnostic and plugs into a wide range of cloud and local TTS systems, including OpenAI, ElevenLabs, Azure, Coqui, Piper, StyleTTS2,...

Downloads: 3 This Week

Last Update: 2026-03-28
See Project
16

Agili Hacker Podcast

AI tool that turns Hacker News posts into daily podcast updates

Hacker Podcast is an AI-powered project that turns top Hacker News stories into a Chinese podcast. It automatically fetches trending posts each day, processes the content with AI, and generates concise summaries before converting them into audio. This creates a hands-free way to stay updated on tech, startups, and developer discussions without reading long threads. Hacker Podcast combines content aggregation, natural language processing, and text-to-speech to deliver clear and digestible...

Downloads: 2 This Week

Last Update: 5 days ago
See Project
17

kew

Music for the Shell

KEW (short for Key-Enabled Wallet) is an open-source command-line wallet and key management tool built for modern blockchain and Web3 workflows, designed to give developers and active users a secure and flexible way to manage cryptographic keys, accounts, and signing operations from the terminal. It focuses on simplicity, reproducibility, and composability, letting users manage multiple wallets, derive keys from mnemonics, and perform signing for a variety of chain-specific transaction...

Downloads: 2 This Week

Last Update: 2026-02-09
See Project
18

GLM-TTS

Controllable & emotion-expressive zero-shot TTS

GLM-TTS is an advanced text-to-speech synthesis system built on large language model technologies that focuses on producing high-quality, expressive, and controllable spoken output, including features like emotion modulation and zero-shot voice cloning. It uses a two-stage architecture where a generative LLM first converts text into intermediate speech token sequences and then a Flow-based neural model converts those tokens into natural audio waveforms, enabling rich prosody and voice...

Downloads: 2 This Week

Last Update: 2026-04-10
See Project
19

ElevenLabs Python

The official Python SDK for the ElevenLabs API

elevenlabs-python is the official Python SDK for the ElevenLabs API, giving developers a convenient way to access ElevenLabs’ high-quality, lifelike voices. The library wraps the HTTP API into a typed Python client, so you can perform text-to-speech, streaming, voice cloning, voice management, and agents-related operations with simple method calls. It exposes ElevenLabs’ main models such as Eleven Multilingual v2, Eleven Flash v2.5, and Eleven Turbo v2.5, each targeting different trade-offs...

Downloads: 2 This Week

Last Update: 12 hours ago
See Project
20

Furnace

A multi-system chiptune tracker compatible with DefleMask modules

Furnace is a powerful multi-system chiptune tracker that enables users to compose music using the sound chips of classic computers, consoles, and arcade hardware. It supports an extensive range of audio chips, including FM synthesis, wavetable synthesis, and sample-based systems, making it one of the most versatile trackers available. The software is compatible with multiple operating systems and can be used both as a standalone application and as a development tool for retro-style audio...

Downloads: 0 This Week

Last Update: 2026-04-07
See Project
21

Metadata Extractor

Extracts Exif, IPTC, XMP, ICC and other metadata from image and video

metadata-extractor is a Java library for reading metadata from media files. The library understands several formats of metadata, many of which may be present in a single image.

Downloads: 13 This Week

Last Update: 2026-04-08
See Project
22

Rackula

Drag and drop rack visualizer

Rackula is a browser-based rack layout designer aimed at homelabbers, audio/video technicians, and equipment organizers who want a visual way to plan and document physical device racks. It runs entirely client-side with no backend server required, making it lightweight, fast, and easy to self-host or run locally without external dependencies. Users can drag and drop devices into customizable rack spaces, annotate equipment, set unit sizes, and manage complex layouts as their setup evolves....

Downloads: 69 This Week

Last Update: 2026-03-26
See Project
23

LTX-2

Python inference and LoRA trainer package for the LTX-2 audio–video

LTX-2 is a powerful, open-source toolkit developed by Lightricks that provides a modular, high-performance base for building real-time graphics and visual effects applications. It is architected to give developers low-level control over rendering pipelines, GPU resource management, shader orchestration, and cross-platform abstractions so they can craft visually compelling experiences without starting from scratch. Beyond basic rendering scaffolding, LTX-2 includes optimized math libraries,...

Downloads: 83 This Week

Last Update: 2026-04-13
See Project
24

GenAI Processors

GenAI Processors is a lightweight Python library

GenAI Processors is a lightweight Python library for building modular, asynchronous, and composable AI pipelines around Gemini. Its central abstraction is the Processor, a unit of work that consumes an asynchronous stream of parts (text, images, audio, JSON) and produces another stream, making it natural to chain operations and keep everything streaming end-to-end. Processors can be composed sequentially (to build multi-step flows) or in parallel (to fan-out work and merge results), which...

Downloads: 1 This Week

Last Update: 2026-03-10
See Project
25

Tagify

Lightweight, efficient Tags input component in Vanilla JS

Transforms an input field or a textarea into a Tags component, in an easy, customizable way, with great performance and a small code footprint, exploded with features. Customizable HTML templates for the different areas of the component (wrapper, tags, dropdown, dropdown item, dropdown header, dropdown footer) Shows suggestions list (flexible settings & styling) at full (component) width or next to the typed texted (caret) Allows setting suggestions' aliases for easier fuzzy-searching....

Downloads: 1 This Week

Last Update: 3 days ago
See Project