Showing 6662 open source projects for "audio linux"

View related business solutions
  • Go from Code to Production URL in Seconds Icon
    Go from Code to Production URL in Seconds

    Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

    Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.
    Try it free
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    ioquake3

    ioquake3

    The ioquake3 community effort to continue supporting/developing id's

    ...It is designed to let players run Quake 3, its expansion Team Arena, and community mods on contemporary systems while also serving as a solid base for new projects. The engine modernizes the original codebase with a CMake build system, an SDL2 backend for cross-platform windowing and input, and OpenAL sound for better audio quality and multi-speaker setups. It adds numerous quality-of-life improvements such as VoIP support, AVI demo capture, improved console completion and history, and optional Ogg Vorbis support. ioquake3 also improves portability and maintainability by supporting x86_64 on Linux, MinGW builds on Windows, and various other operating systems, and even provides web support via Emscripten.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    ALVR - Air Light VR

    ALVR - Air Light VR

    Stream VR games from your PC to your headset via Wi-Fi

    ALVR (Air Light VR) is an open-source system that enables wireless virtual reality streaming by transmitting VR content from a PC to a standalone headset over a network connection. It allows users to run PC-based VR applications while using devices such as standalone headsets, effectively bridging the gap between high-performance desktop VR and portable hardware. The system works by encoding video output from the PC, streaming it over Wi-Fi, and decoding it on the headset in real time, while...
    Downloads: 38 This Week
    Last Update:
    See Project
  • 3
    Nextcloud Server

    Nextcloud Server

    A safe home for all your data

    Nextcloud server is a free and open source server software that allows you to store all of your data in a server of your choosing. With Nextcloud you can easily access and store data in the data center you trust, sync data among various devices, and share your data for collaboration purposes. It offers the best security in the self hosted file sync and share world, and is expandable with hundreds of apps.
    Downloads: 35 This Week
    Last Update:
    See Project
  • 4
    Allegro

    Allegro

    The official Allegro 5 git repository. Pull requests welcome

    Allegro 5 is the latest major revision of the Allegro library, designed to take advantage of modern hardware, including hardware acceleration using 3D cards.
    Downloads: 4 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5
    red5-server

    red5-server

    Red5 Server core

    Red5 is an Open Source Flash Server written in Java that supports streaming Video (FLV, F4V, MP4, 3GP). Streaming Audio (MP3, F4A, M4A, AAC) Recording Client Streams (FLV and AVC+AAC in FLV container) Shared objects, live stream publishing, remoting, and protocols: RTMP, RTMPT, RTMPS, and RTMPE.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    LuxTTS

    LuxTTS

    A high-quality rapid TTS voice cloning model

    LuxTTS is an open-source text-to-speech (TTS) system focused on delivering high-quality, rapid voice synthesis and voice cloning that runs extremely fast and efficiently on consumer hardware. It implements a lightweight architecture based on ZipVoice and optimized sampling techniques so that it can generate speech at speeds up to roughly 150 times real-time on a single GPU and faster than real-time on CPU, all while producing audio at high fidelity with 48 kHz quality. The project supports...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    Audiblez

    Audiblez

    Generate audiobooks from e-books

    Audiblez is a tool for generating high-quality .m4b audiobooks directly from .epub e-books using the Kokoro-82M neural text-to-speech model. It focuses on making audiobook creation easy and fast: from a single command, the tool splits an e-book into chapters, synthesizes audio for each section, and then merges the results into a structured audiobook with chapter-based WAV files and a final .m4b container. The Kokoro-82M model it uses is compact (82M parameters) yet natural sounding, trained...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 8
    ffmpeg.wasm

    ffmpeg.wasm

    FFmpeg for browser, powered by WebAssembly

    ffmpeg.wasm is a pure WebAssembly (and JavaScript/TypeScript) port of FFmpeg that enables in-browser media recording, conversion, and streaming—letting developers perform video/audio processing entirely client-side without server uploads. Transpiled via Emscripten from FFmpeg and its codecs into WebAssembly. Supports both single-threaded and multi-threaded cores using web workers. Written in TypeScript for improved developer experience.
    Downloads: 19 This Week
    Last Update:
    See Project
  • 9
    PowerPoint-ist

    PowerPoint-ist

    Web presentation editor replicating many PowerPoint features online

    PPTist is a web-based presentation editing application designed to replicate many of the commonly used features found in traditional slide presentation software. It allows users to create, edit, and present slide decks directly within a web browser while maintaining a desktop-like editing experience. PPTist is built with Vue 3 and TypeScript and focuses on providing a highly interactive slide editing environment with extensive customization and extension potential. PPTist supports a wide...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    FastKoko

    FastKoko

    Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model

    FastKoko is a self-hosted text-to-speech server built around the Kokoro-82M model and exposed through a FastAPI backend. It is designed to be easy to deploy via Docker, with separate CPU and GPU images so that users can choose between pure CPU inference and NVIDIA GPU acceleration. The project exposes an OpenAI-compatible speech endpoint, which means existing code that talks to the OpenAI audio API can often be pointed at a Kokoro-FastAPI instance with minimal changes. It supports multiple...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11
    linux-file-converter-addon

    linux-file-converter-addon

    Convert various image, audio and video formats from your context menu.

    Convert between various image, audio and video formats using the context menu. The addon is written in Python and available for Nautilus, Nemo, Thunar and Dolphin file viewers. It adds a new option to the context menu to create an easy way to convert between a huge amount of file types. The program offers many options to customize the appearance of its context menu. There are also a few extra formats which can be added by installing optional dependencies. The tool has a built-in auto-update...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 12
    Qwen3-ASR

    Qwen3-ASR

    Qwen3-ASR is an open-source series of ASR models

    Qwen3-ASR is an automatic speech recognition system in the QwenLM family, developed to convert spoken language into text with strong accuracy and real-time performance. As a specialized ASR variant of the broader Qwen language model ecosystem, it focuses on capturing reliable transcriptions from audio sources such as recordings, live streams, or conversational inputs while supporting low latency use cases. The architecture combines advanced neural acoustic modeling with context-aware...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Descent 3

    Descent 3

    Descent 3 by Outrage Entertainment

    ...It provides the full C and C++ engine source code, including the historically significant “1.5” patch that was previously created by developers and later stabilized by fans. The codebase covers the game’s rendering, physics, audio, networking, tools, and editor components, allowing enthusiasts to build, run, and modify the classic 6-degrees-of-freedom space shooter on modern systems. To actually play the game, users must supply their own original game assets, following instructions in the repository’s usage documentation. The project uses CMake and related modern tooling for cross-platform builds, with support for Linux and Windows among other environments. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 14
    OpenAI Python

    OpenAI Python

    The official Python library for the OpenAI API

    The OpenAI Python library provides convenient access to the OpenAI REST API from any Python 3.7+ application. The library includes type definitions for all request params and response fields, and offers both synchronous and asynchronous clients powered by httpx.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 15
    NExT-GPT

    NExT-GPT

    Code and models for ICML 2024 paper, NExT-GPT

    NExT-GPT is an open-source research framework that implements an advanced multimodal large language model capable of understanding and generating content across multiple modalities. Unlike traditional models that primarily handle text, NExT-GPT supports input and output combinations involving text, images, video, and audio in a unified architecture. The system connects a large language model with multimodal encoders and diffusion-based decoders so it can interpret information from different...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    DreamShell

    DreamShell

    Operating system for the Sega Dreamcast

    The Dreamshell is the operating system for the Sega Dreamcast based on the KallistiOS kernel. It has a dynamic loadable modular system and interface for creating applications with XML UI and both C/C++ and Lua scripts on. You can see examples in ready-made applications and modules, drivers for various devices, formats and interfaces. Examples for audio and video decoding, compression, packaging, binding, network, emulation, scripts and more. From hardcore low-level assembler to high-level...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 17
    rtmp-rtsp-stream-client-java

    rtmp-rtsp-stream-client-java

    Library to stream in rtmp and rtsp for Android. All code in Java

    Library for streaming in RTMP and RTSP. All code in Java.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    OpenAI Go

    OpenAI Go

    The official Go library for the OpenAI API

    OpenAI Go is the official Go client library for accessing the OpenAI API. It enables developers to integrate OpenAI’s models and features into Go applications with a clean and idiomatic interface. The library provides support for a wide range of API endpoints including chat completions, assistants, embeddings, image generation, audio processing, and batch jobs. It includes built-in tools for handling authentication, managing API requests, and parsing structured responses. The repository also...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    IndexTTS2

    IndexTTS2

    Industrial-level controllable zero-shot text-to-speech system

    IndexTTS is a modern, zero-shot text-to-speech (TTS) system engineered to deliver high-quality, natural-sounding speech synthesis with few requirements and strong voice-cloning capabilities. It builds on state-of-the-art models such as XTTS and other modern neural TTS backbones, improving them with a conformer-based speech conditional encoder and upgrading the decoder to a high-quality vocoder (BigVGAN2), leading to clearer and more natural audio output. The system supports zero-shot voice...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    AWS CDI SDK

    AWS CDI SDK

    AWS Cloud Digital Interface (CDI) SDK

    The AWS Cloud Digital Interface (CDI) SDK is the library which implements the low-latency reliable transport of data between systems in the cloud including AWS Media Services and Amazon EC2 instances. The SDK takes care of the complex task of ensuring that large amounts of data are seamlessly transferred across the AWS network using multi-path technology to ensure lossless quality without the need for extensive network buffering and delay. The SDK consists of two main API layers, depending...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Expo Orbit

    Expo Orbit

    Accelerate your development workflow with one-click build launches

    ​Expo Orbit is a desktop application developed by the Expo team to streamline the development workflow for React Native and Expo projects. It offers a user-friendly interface that allows developers to manage simulators and devices, install and launch builds, and handle updates with ease. Orbit supports various platforms, including macOS, Windows, and Linux, and integrates seamlessly with Expo Application Services (EAS) to facilitate efficient testing and deployment of applications.​
    Downloads: 12 This Week
    Last Update:
    See Project
  • 22
    D++

    D++

    C++ Discord API Bot Library - D++ is Lightweight and scalable

    D++ is a lightweight and simple library for Discord written in modern C++. It is designed to cover as much of the API specification as possible and to have an incredibly small memory footprint, even when caching large amounts of data. It is created by the developer of TriviaBot and contributed to by a dedicated team of developers.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 23
    Translate-Subtitle-File

    Translate-Subtitle-File

    Subtitle Creation Assistant

    Subtitle group machine translation assistant - [Function 1: Translate subtitle file] .srt .ass .vtt [Function 2: Voice to text] (Drag in video or audio to recognize subtitles) (The latest version v4.1.0 Update time 2021 2 May 23) 12 translation service providers can be configured, such as Google, Baidu, Tencent, Caiyun, IBM, Azure, Amazon, etc. (6 voice service providers can be configured: Alibaba Cloud, Xunfei, Tencent Cloud, IBM, Azure, Amazon ) Advantages: 1. You can use multiple service...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 24
    Open Vision Agents by Stream

    Open Vision Agents by Stream

    Build Vision Agents quickly with any model or video provider

    Open Vision Agents by Stream is an open source framework from Stream for building real time, multimodal AI agents that watch, listen, and respond to live video streams. It focuses on combining video understanding models, such as YOLO and Roboflow based detectors, with real time large language models like OpenAI Realtime and Gemini Live to create interactive experiences. The framework uses Stream’s ultra low latency edge network so agents can join sessions quickly and maintain very low audio...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 25
    MoviePy

    MoviePy

    Video editing with Python

    MoviePy is a Python module for video editing, which can be used for basic operations (like cuts, concatenations, title insertions), video compositing (a.k.a. non-linear editing), video processing, or to create advanced effects. It can read and write the most common video formats, including GIF. MoviePy is an open source software originally written by Zulko and released under the MIT licence. It works on Windows, Mac, and Linux, with Python 2 or Python 3. The code is hosted on Github, where...
    Downloads: 27 This Week
    Last Update:
    See Project