Showing 89 open source projects for "audio development"

View related business solutions
  • Compliant and Reliable File Transfers Backed by Top Security Certifications Icon
    Compliant and Reliable File Transfers Backed by Top Security Certifications

    Cerberus FTP Server delivers SOC 2 Type II certified security and FIPS 140-2 validated encryption.

    Stop relying on non-certified, legacy file transfer tools that creak under the weight of modern security demands. Get full audit trails, advanced access controls and more supported by an award-winning team of experts. Start your free 25-day trial today.
    Start Free Trial
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    pyglet

    pyglet

    pyglet is a cross-platform windowing and multimedia library for Python

    Pyglet is a cross-platform windowing and multimedia library for Python, intended for developing games and other visually rich applications. It supports windowing, input event handling, OpenGL graphics, loading images and videos, and playing sounds and music.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Groq Python

    Groq Python

    The official Python Library for the Groq API

    ...For those building AI-based tooling, automation scripts, or ML-backed backends, groq-python abstracts away HTTP request plumbing and exposes a clean API, accelerating development and reducing boilerplate.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Android Developer Roadmap

    Android Developer Roadmap

    The Android Developer Roadmap offers comprehensive learning paths

    Android Developer Roadmap suggests learning paths to understanding Android development. The Android Developer Roadmap is designed to give you a comprehensive understanding of the current Android ecosystem and provides suggested learning paths along the way to help you grasp the concepts. Each node indicates concepts of Android systems, Android SDK, and generally used libraries. It would be helpful to refer to the Android Developers Reference or GitHub for the specific terminologies. In...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 4
    LLM Tornado

    LLM Tornado

    The .NET library to build AI agents with 30+ built-in connectors

    LLM Tornado is a provider-agnostic .NET SDK designed to build, orchestrate, and deploy AI agents and workflows with a strong focus on flexibility and integration. It provides a unified interface that connects to more than 30 AI providers and vector databases, allowing developers to switch between models and services without rewriting application logic. The framework introduces a powerful orchestration system based on graph-like structures, where agents, tasks, and transitions can be defined...
    Downloads: 3 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build, govern, and optimize agents and models with Gemini Enterprise Agent Platform.
    Start Free
  • 5
    YoutubeExplode

    YoutubeExplode

    Abstraction layer over YouTube's internal API

    YoutubeExplode is a .NET library that provides a high-level abstraction for interacting with YouTube data, enabling developers to retrieve metadata and download media streams programmatically. The project exposes a clean API that allows applications to query videos, playlists, channels, and search results without relying on the official YouTube Data API. Under the hood, the library parses raw page data and leverages reverse-engineered internal endpoints to obtain structured information and...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Cookbook (Google Gemini)

    Cookbook (Google Gemini)

    Examples and guides for using the Gemini API

    The Gemini Cookbook is an official repository of examples and guides for using Google’s Gemini API. It provides a structured learning path with quick-start tutorials for beginners and practical examples for advanced users. The repository covers a wide range of Gemini capabilities, including text, images, video, speech, robotics, and multimodal interactions. It highlights newly introduced features such as Gemini 2.5 models (Flash and Pro), Gemini’s native image generation, Veo for video...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Multimodal

    Multimodal

    TorchMultimodal is a PyTorch library

    This project, also known as TorchMultimodal, is a PyTorch library for building, training, and experimenting with multimodal, multi-task models at scale. The library provides modular building blocks such as encoders, fusion modules, loss functions, and transformations that support combining modalities (vision, text, audio, etc.) in unified architectures. It includes a collection of ready model classes—like ALBEF, CLIP, BLIP-2, COCA, FLAVA, MDETR, and Omnivore—that serve as reference...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Ruma

    Ruma

    A set of Rust crates for interacting with the Matrix chat network

    Matrix is an open specification for an online communication protocol. It includes all the features you'd expect from a modern chat platform including instant messaging, group chats, audio and video calls, searchable message history, synchronization across all your devices, and end-to-end encryption. Matrix is federated, so no single company controls the system or your data. You can use an existing server you trust or run your own, and the servers synchronize messages seamlessly. Learn more...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    opencore-amr

    Audio codecs extracted from Android Open Source Project

    Library of OpenCORE Framework implementation of Adaptive Multi Rate Narrowband and Wideband (AMR-NB and AMR-WB) speech codec. Library of VisualOn implementation of Adaptive Multi Rate Wideband (AMR-WB) encoder and Advanced Audio Coding (AAC) encoder. Modified library of Fraunhofer AAC decoder and encoder.
    Leader badge
    Downloads: 6,684 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 10
    Flutter Rust Bridge

    Flutter Rust Bridge

    Rust binding generator, feature-rich, but seamless and simple

    flutter_rust_bridge is a toolchain that generates ergonomic bindings between Rust and Dart/Flutter so you can write performance-critical logic in Rust while building UI in Flutter. It abstracts away the low-level FFI details, letting developers define Rust functions and data structures that are safely callable from Dart. The project supports passing complex types, handling async operations and streams, and integrating with Flutter across mobile and desktop targets. By leaning on Rust’s...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Mikmod Sound System
    Mikmod is a module player and library supporting many formats, including mod, s3m, it, and xm. Originally a player for MS-DOS, MikMod has been ported to other platforms, such as Unix, Macintosh, BeOS, and Java(!!)
    Leader badge
    Downloads: 300 This Week
    Last Update:
    See Project
  • 12
    Munt
    A multi-platform software synthesiser emulating (somewhat inaccurately) pre-GM MIDI devices such as the Roland MT-32, CM-32L, CM-64 and LAPC-I. In no way endorsed by or affiliated with Roland Corp. Source hosted at: http://www.github.com/munt/
    Leader badge
    Downloads: 277 This Week
    Last Update:
    See Project
  • 13
    Extended Module Player

    Extended Module Player

    A multi-format module player and library

    libxmp is a library that renders module files to PCM data. xmp is a module player for Unix-like systems, based on libxmp, that plays over 90 mainstream and obscure module formats from Amiga, Atari, Acorn, Apple IIgs and PC, including Protracker (MOD), Scream Tracker 3 (S3M), Fast Tracker II (XM) and Impulse Tracker (IT) files.
    Leader badge
    Downloads: 330 This Week
    Last Update:
    See Project
  • 14
    Drumstick Libraries

    Drumstick Libraries

    MIDI libraries for Qt/C++

    Drumstick is a tool to play music. This is a set of C++ MIDI libraries using Qt5 objects, idioms and style. It contains a C++ wrapper around the ALSA library sequencer interface; ALSA sequencer provides software support for MIDI technology on Linux. A complementary library provides classes for SMF (Standard MIDI files: .MID/.KAR), and Cakewalk (.WRK) file formats processing. A multiplatform realtime MIDI I/O library is also provided.
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 15
    JSpeech

    JSpeech

    Java library designed to integrate Speech-to-Text

    jSpeech is a Java library designed to integrate Speech-to-Text (STT) capabilities, command control, and diarization (speaker identification) into applications in a simple, modular, and decoupled way.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Vanilla Web Projects

    Vanilla Web Projects

    Mini projects built with HTML5, CSS & JavaScript

    vanillawebprojects is a collection of small, hands-on web apps built with only HTML, CSS, and plain JavaScript to teach core front-end skills without frameworks. Each project targets a concrete concept—DOM manipulation, events, localStorage, fetch, form validation, audio/video APIs, and more—so learners see ideas applied in working code. The codebases are intentionally compact and readable, making them easy to clone, tweak, and extend. Projects progress from beginner-friendly to moderately...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Portable OpenAL Sound

    Portable OpenAL Sound

    Concurrent, asynchronous sounds package for Ada apps.

    This is an Ada package that can play WAV files on Windows, OSX, and Linux, using Ada tasking and OpenAL libraries. It includes a partial Ada binding to OpenAL. It provides sound-playing capabilities for Ada apps to * asynchronously start and stop music/sound loops, * initiate transient sounds, * allow unlimited sound concurrency. It is suitable for any Ada application that needs music, sound loops or transient sound effects; eg. games. The proper command to extract the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    howler.js

    howler.js

    Javascript audio library for the modern web

    howler.js is an audio library for the modern web. It defaults to Web Audio API and falls back to HTML5 Audio. This makes working with audio in JavaScript easy and reliable across all platforms. Additional information, live demos and a user showcase are available at howlerjs.com. Single API for all audio needs, defaults to Web Audio API and falls back to HTML5 Audio, handles edge cases and bugs across environments, supports all codecs for full cross-browser support, automatic caching for...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Lullaby

    Lullaby

    A collection of C++ libraries designed to help teams

    Lullaby is a modular collection of high-performance C++ libraries developed by Google for creating immersive virtual and augmented reality (VR/AR) experiences. It provides a flexible framework built around an Entity-Component-System (ECS) architecture, enabling developers to design efficient, scalable, and data-driven 3D applications. The framework includes tools and APIs for rendering full 3D environments, managing spatial audio, handling animations, and constructing interactive UI elements...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    AssetStudio

    AssetStudio

    AssetStudio is a tool for exploring, extracting and exporting assets

    AssetStudio is a cross-platform tool for exploring, extracting, and exporting assets from Unity games—supporting assetbundles and built-in assets. It handles textures, sprites, audio, meshes, shaders, and more, exporting to formats like png, bmp, mp3, wav. The original is archived (supports Unity ≤2022.1); forks like AssetStudio2024 add support for newer Unity versions and Lua asset decompiling.
    Downloads: 681 This Week
    Last Update:
    See Project
  • 21
    AugLy

    AugLy

    A data augmentations library for audio, image, text, and video

    AugLy is a data augmentations library that currently supports four modalities (audio, image, text & video) and over 100 augmentations. Each modality’s augmentations are contained within its own sub-library. These sub-libraries include both function-based and class-based transforms, composition operators, and have the option to provide metadata about the transform applied, including its intensity. AugLy is a great library to utilize for augmenting your data in model training, or to evaluate...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    SVoice (Speech Voice Separation)

    SVoice (Speech Voice Separation)

    We provide a PyTorch implementation of the paper Voice Separation

    SVoice is a PyTorch-based implementation of Facebook Research’s study on speaker voice separation as described in the paper “Voice Separation with an Unknown Number of Multiple Speakers.” This project presents a deep learning framework capable of separating mixed audio sequences where several people speak simultaneously, without prior knowledge of how many speakers are present. The model employs gated neural networks with recurrent processing blocks that disentangle voices over multiple...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    QB64

    QB64

    BASIC for the modern era

    QB64 is a modern self-hosting BASIC compiler that brings classic QBasic/QuickBASIC programs into the modern era. It enables legacy code to run on today’s operating systems with minimal modifications, while also introducing new commands and libraries for multimedia, networking, and 64-bit computing. QB64 compiles to C++ and includes a built-in IDE, making it accessible for beginners, hobbyists, and retro programming enthusiasts. It aims to preserve the ease and nostalgia of BASIC while...
    Downloads: 112 This Week
    Last Update:
    See Project
  • 24
    AmplitudeJS

    AmplitudeJS

    Open Source HTML5 Web Audio Library

    Open Source HTML5 Web Audio Library. Design your web audio player, the way you want. No dependencies are required. AmplitudeJS is a lightweight JavaScript library that allows you to control the design of your media controls in your webpage, not the browser. No dependencies (jQuery not required). AmplitudeJS is available under the MIT License. Open-source software is not sustainable without proper financial support. Maintaining an open-source project takes a lot of effort and resources. If...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    CHOW Phaser

    CHOW Phaser

    Phaser effect based loosely on the Schulte Compact Phasing 'A'

    ChowPhaser is an open-source audio plugin that emulates the classic Schulte Compact Phasing 'A' effect. It offers a unique phasing effect with nonlinear feedback and modulation capabilities, suitable for various audio processing applications.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB