Showing 588 open source projects for "audio"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Git Large File Storage

    Git Large File Storage

    Git extension for versioning large files

    An open source Git extension for versioning large files. Git Large File Storage (LFS) replaces large files such as audio samples, videos, datasets, and graphics with text pointers inside Git, while storing the file contents on a remote server like GitHub.com or GitHub Enterprise. Download and install the Git command line extension. Once downloaded and installed, set up Git LFS for your user account. In each Git repository where you want to use Git LFS, select the file types you'd like Git LFS to manage (or directly edit your .gitattributes). ...
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2
    Silk.NET

    Silk.NET

    About The high-speed OpenGL, OpenCL bindings library

    ...Silk.NET caters for anything you could need in swift development of multimedia, graphics, compute applications. Silk.NET is an all-in-one solution, complete with Graphics, Compute, Audio, Input, and Windowing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    AMP

    AMP

    Web component framework for building ads, emails, websites and more

    AMP is an open source web component framework that allows you to easily create user-first websites, ads, emails, stories and more. AMP creates fast, smooth-loading web pages that prioritize the user-experience, consistently providing a fast experience across all devices and platforms.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 4
    DocsGPT

    DocsGPT

    Private AI platform for agents, enterprise search and RAG pipelines

    DocsGPT is an open-source AI platform for deploying private RAG pipelines, AI agents, and enterprise search on your own infrastructure. Connect any data source (PDFs, DOCX, CSV, Excel, HTML, audio, GitHub, databases, URLs) and get accurate, hallucination-free answers with source citations. Choose your LLM: OpenAI, Anthropic, Google Gemini, or local models. Works with Qdrant, MongoDB, and Elasticsearch and more. Deploy via Docker or Kubernetes with full data sovereignty. Build embeddable chat and search widgets, automate multi-step workflows with AI agents, and integrate via Slack, Telegram, Discord, or REST API. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    Membrane Core

    Membrane Core

    The core of Membrane Framework, multimedia processing framework

    membrane_core is the foundation of the Membrane multimedia framework for Elixir, providing the abstractions and runtime needed to build real-time audio and video pipelines. It models media processing as a graph of lightweight, supervised OTP processes—elements connected by links—so work is isolated, fault-tolerant, and easy to scale or reconfigure at runtime. The core defines a clear lifecycle and callback API for elements, plus concepts like buffers, events, and capabilities/format negotiation to keep components interoperable and type-safe. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    KeyboardKit

    KeyboardKit

    KeyboardKit is a Swift library that helps you build custom keyboard

    ...KeyboardKit comes with support for 50+ locales, with more being added over time. KeyboardKit has tools for customizing everything from keys, key behavior, layout, callout actions, audio & haptic feedback, colors and styling, autocomplete etc. The KeyboardKit app let you create keyboards directly on your iPhone and iPad.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    mpvacious

    mpvacious

    Adds mpv keybindings to create Anki cards from movies and TV shows

    mpvacious is your semi-automatic subs2srs for mpv. It supports multiple workflows and allows you to quickly create Anki cards while watching your favorite TV show.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Mattermost

    Mattermost

    Mattermost is an open source platform for secure collaboration

    ...Keep everyone on the same page while prototyping your latest innovation, or simply planning sprints or managing production incidents. Execute and automate workflows with flexible, custom integrations with popular technical tools like GitHub, GitLab and ServiceNow. Seamlessly switch from chat to audio calls and screenshare without switching tools. Checklist-based process playbooks with workflow orchestration in one unified location. Orchestrate and execute repeatable processes with better command. Accelerate productivity and reduce risk without sacrificing security.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 9
    MLT Multimedia Framework

    MLT Multimedia Framework

    MLT Multimedia Framework

    Author, manage, and run multitrack audio/video compositions. The engine of a non-linear video editor that can be used in all sorts of apps, not just desktop video editors. MLT is an open source multimedia framework, designed and developed for television broadcasting. It provides a toolkit for broadcasters, video editors, media players, transcoders, web streamers and many more types of applications.
    Downloads: 3 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 10
    DocArray

    DocArray

    The data structure for multimodal data

    DocArray is a library for nested, unstructured, multimodal data in transit, including text, image, audio, video, 3D mesh, etc. It allows deep-learning engineers to efficiently process, embed, search, recommend, store, and transfer multimodal data with a Pythonic API. Door to multimodal world: super-expressive data structure for representing complicated/mixed/nested text, image, video, audio, 3D mesh data. The foundation data structure of Jina, CLIP-as-service, DALL·E Flow, DiscoArt etc. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    txtai

    txtai

    Build AI-powered semantic search applications

    ...Backed by state-of-the-art machine learning models, data is transformed into vector representations for search (also known as embeddings). Innovation is happening at a rapid pace, models can understand concepts in documents, audio, images and more. Machine-learning pipelines to run extractive question-answering, zero-shot labeling, transcription, translation, summarization and text extraction. Cloud-native architecture that scales out with container orchestration systems (e.g. Kubernetes). Applications range from similarity search to complex NLP-driven data extractions to generate structured databases. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    pyglet

    pyglet

    pyglet is a cross-platform windowing and multimedia library for Python

    Pyglet is a cross-platform windowing and multimedia library for Python, intended for developing games and other visually rich applications. It supports windowing, input event handling, OpenGL graphics, loading images and videos, and playing sounds and music.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ROC Engine

    ROC Engine

    Simple and light-weight 3D game engine

    ROC Engine is a simple and lightweight 3D game engine. Keyframed skeletal animation with hardware skinning. Audio playback with spatialization (Ogg, WAV, FLAC). TrueType font rendering with Unicode characters support. All information about modeling and scripting is available at wiki. Check demo scene resources out for an example of engine usage.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    VERT.sh

    VERT.sh

    The next-generation file converter

    VERT is a modern, privacy-focused file conversion platform that leverages WebAssembly to perform conversions entirely on the user’s device rather than relying on cloud-based processing. Built with Svelte and TypeScript, it provides a clean and responsive interface for converting a wide variety of file types, including images, audio, video, and documents. One of its defining characteristics is its local-first approach, which eliminates the need to upload files to external servers, thereby improving both privacy and performance. The system supports over 250 file formats and includes customizable conversion settings, allowing users to fine-tune output parameters. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 15
    AriaNg Native

    AriaNg Native

    A better aria2 desktop frontend than AriaNg, containing all features

    ...Retry tasks. Adjust task order by dragging. More information on tasks (health percentage, client information of bt peers, etc.) Filter files by specified file types (videos, audio, pictures, documents, applications, archives, etc.) or file extensions. Tree view for the multi-directory task. Download/upload speed chart for aria2 or single task. Full support for aria2 settings.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 16
    Venom

    Venom

    Venom is the most complete javascript library for Whatsapp

    Venom is a high-performance system developed with JavaScript to create a bot for WhatsApp, support for creating any interaction, such as customer service, media sending, sentence recognition based on artificial intelligence and all types of design architecture for WhatsApp. It's a high-performance alternative API to whatzapp, you can send, text messages, files, images, videos and more. Remember, the API was developed on a platform called RESTful Web services, providing interoperability...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    wa-automate-nodejs

    wa-automate-nodejs

    WhatsApp tool for chatbots with advanced features

    wa-automate-nodejs is the most advanced NodeJS library which provides a high-level API to control WA. Want to convert your WA account to an API instantly? You can now with the CLI. For more details see Easy API. After executing create() function, @open-wa/wa-automate will create an instance of WA web. If you are not logged in, it will print a QR code in the terminal. Scan it with your phone and you are ready to go! @open-wa/wa-automate will remember the session so there is no need to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    YoutubeExplode

    YoutubeExplode

    Abstraction layer over YouTube's internal API

    ...Under the hood, the library parses raw page data and leverages reverse-engineered internal endpoints to obtain structured information and stream manifests. Developers can use it to access details such as titles, authors, durations, captions, and available media formats, as well as to download audio or video streams for further processing. The library is designed to be intuitive and cross-platform through .NET Standard compatibility, making it suitable for desktop tools, automation pipelines, and media utilities.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 19
    Apple Music-like Lyrics

    Apple Music-like Lyrics

    A lyric player component library aims to look similar to iPad

    ...The project demonstrates how modern web technologies can be used to replicate complex media UI patterns that were traditionally limited to native applications. It supports parsing lyric formats with timestamp metadata and synchronizing them with playback, enabling accurate alignment between audio and text. The implementation is highly customizable, allowing developers to adapt the visual style, animation timing, and layout to fit different applications.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    Cookbook (Google Gemini)

    Cookbook (Google Gemini)

    Examples and guides for using the Gemini API

    The Gemini Cookbook is an official repository of examples and guides for using Google’s Gemini API. It provides a structured learning path with quick-start tutorials for beginners and practical examples for advanced users. The repository covers a wide range of Gemini capabilities, including text, images, video, speech, robotics, and multimodal interactions. It highlights newly introduced features such as Gemini 2.5 models (Flash and Pro), Gemini’s native image generation, Veo for video...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Cinder

    Cinder

    Community-developed library for professional-quality creative coding

    ...To keep up-to-date with Cinder’s development, consider working from the github repository directly. Cinder is a C++ library for programming with aesthetic intent - the sort of development often called creative coding. This includes domains like graphics, audio, video, and computational geometry. Cinder is cross-platform, with official support for macOS, Windows, Linux, iOS, and Windows UWP. Cinder is production-proven, powerful enough to be the primary tool for professionals, but still suitable for learning and experimentation. Full keyboard, mouse (including scroll wheel), window, and file drag and drop.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    Android Developer Roadmap

    Android Developer Roadmap

    The Android Developer Roadmap offers comprehensive learning paths

    Android Developer Roadmap suggests learning paths to understanding Android development. The Android Developer Roadmap is designed to give you a comprehensive understanding of the current Android ecosystem and provides suggested learning paths along the way to help you grasp the concepts. Each node indicates concepts of Android systems, Android SDK, and generally used libraries. It would be helpful to refer to the Android Developers Reference or GitHub for the specific terminologies. In...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 23
    Wire Server

    Wire Server

    Wire back-end services

    Modern day communication meets the most advanced security and superior user experience. Protect your privacy and data like never before. Secure messaging, conferencing, file-sharing and more through end-to-end encryption for cloud, private cloud and On-Premises. All messaging on Wire uses end-to-end encryption (E2EE), giving users a strong degree of privacy and security. Wire is 100% open source with its code available on GitHub, independently audited and ISO, CCPA, GDPR, SOX-compliant. Wire...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    Universal Windows Platform app samples

    Universal Windows Platform app samples

    API samples for the Universal Windows Platform.

    This repo contains the samples that demonstrate the API usage patterns for the Universal Windows Platform (UWP) in the Windows Software Development Kit (SDK) for Windows 10. These code samples were created with the Universal Windows Platform templates available in Visual Studio, and are designed to run on desktop, mobile, and future devices that support the Universal Windows Platform. If you are unfamiliar with Git and GitHub, you can download the entire collection as a ZIP file, but be sure...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    react-use

    react-use

    Component for React

    Tracks device battery state. Plays audio and exposes its controls. Tracks geo location state of user's device. Triggers callback when user clicks outside target area. Tracks mouse hover state of some element. Display an element or video full-screen. Tracks location hash value. Tracks whether user is being inactive. Tracks an HTML element's intersection. Synthesizes speech from a text string.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB