Showing 452 open source projects for "audio"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 1
    Apple Music-like Lyrics

    Apple Music-like Lyrics

    A lyric player component library aims to look similar to iPad

    ...The project demonstrates how modern web technologies can be used to replicate complex media UI patterns that were traditionally limited to native applications. It supports parsing lyric formats with timestamp metadata and synchronizing them with playback, enabling accurate alignment between audio and text. The implementation is highly customizable, allowing developers to adapt the visual style, animation timing, and layout to fit different applications.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    DocArray

    DocArray

    The data structure for multimodal data

    DocArray is a library for nested, unstructured, multimodal data in transit, including text, image, audio, video, 3D mesh, etc. It allows deep-learning engineers to efficiently process, embed, search, recommend, store, and transfer multimodal data with a Pythonic API. Door to multimodal world: super-expressive data structure for representing complicated/mixed/nested text, image, video, audio, 3D mesh data. The foundation data structure of Jina, CLIP-as-service, DALL·E Flow, DiscoArt etc. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    pyglet

    pyglet

    pyglet is a cross-platform windowing and multimedia library for Python

    Pyglet is a cross-platform windowing and multimedia library for Python, intended for developing games and other visually rich applications. It supports windowing, input event handling, OpenGL graphics, loading images and videos, and playing sounds and music.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    VERT.sh

    VERT.sh

    The next-generation file converter

    VERT is a modern, privacy-focused file conversion platform that leverages WebAssembly to perform conversions entirely on the user’s device rather than relying on cloud-based processing. Built with Svelte and TypeScript, it provides a clean and responsive interface for converting a wide variety of file types, including images, audio, video, and documents. One of its defining characteristics is its local-first approach, which eliminates the need to upload files to external servers, thereby improving both privacy and performance. The system supports over 250 file formats and includes customizable conversion settings, allowing users to fine-tune output parameters. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 5
    AriaNg Native

    AriaNg Native

    A better aria2 desktop frontend than AriaNg, containing all features

    ...Retry tasks. Adjust task order by dragging. More information on tasks (health percentage, client information of bt peers, etc.) Filter files by specified file types (videos, audio, pictures, documents, applications, archives, etc.) or file extensions. Tree view for the multi-directory task. Download/upload speed chart for aria2 or single task. Full support for aria2 settings.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 6
    Android Developer Roadmap

    Android Developer Roadmap

    The Android Developer Roadmap offers comprehensive learning paths

    Android Developer Roadmap suggests learning paths to understanding Android development. The Android Developer Roadmap is designed to give you a comprehensive understanding of the current Android ecosystem and provides suggested learning paths along the way to help you grasp the concepts. Each node indicates concepts of Android systems, Android SDK, and generally used libraries. It would be helpful to refer to the Android Developers Reference or GitHub for the specific terminologies. In...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    wa-automate-nodejs

    wa-automate-nodejs

    WhatsApp tool for chatbots with advanced features

    wa-automate-nodejs is the most advanced NodeJS library which provides a high-level API to control WA. Want to convert your WA account to an API instantly? You can now with the CLI. For more details see Easy API. After executing create() function, @open-wa/wa-automate will create an instance of WA web. If you are not logged in, it will print a QR code in the terminal. Scan it with your phone and you are ready to go! @open-wa/wa-automate will remember the session so there is no need to...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    YoutubeExplode

    YoutubeExplode

    Abstraction layer over YouTube's internal API

    ...Under the hood, the library parses raw page data and leverages reverse-engineered internal endpoints to obtain structured information and stream manifests. Developers can use it to access details such as titles, authors, durations, captions, and available media formats, as well as to download audio or video streams for further processing. The library is designed to be intuitive and cross-platform through .NET Standard compatibility, making it suitable for desktop tools, automation pipelines, and media utilities.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Laravel FFMpeg

    Laravel FFMpeg

    This package provides an integration with FFmpeg for Laravel

    ...You can easily add a watermark using the addWatermark method. With the WatermarkFactory, you can open your watermark file from a specific disk, just like opening an audio or video file. When you discard the fromDisk method, it uses the default disk specified in the filesystems.php configuration file.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 10
    Mattermost

    Mattermost

    Mattermost is an open source platform for secure collaboration

    ...Keep everyone on the same page while prototyping your latest innovation, or simply planning sprints or managing production incidents. Execute and automate workflows with flexible, custom integrations with popular technical tools like GitHub, GitLab and ServiceNow. Seamlessly switch from chat to audio calls and screenshare without switching tools. Checklist-based process playbooks with workflow orchestration in one unified location. Orchestrate and execute repeatable processes with better command. Accelerate productivity and reduce risk without sacrificing security.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Cookbook (Google Gemini)

    Cookbook (Google Gemini)

    Examples and guides for using the Gemini API

    The Gemini Cookbook is an official repository of examples and guides for using Google’s Gemini API. It provides a structured learning path with quick-start tutorials for beginners and practical examples for advanced users. The repository covers a wide range of Gemini capabilities, including text, images, video, speech, robotics, and multimodal interactions. It highlights newly introduced features such as Gemini 2.5 models (Flash and Pro), Gemini’s native image generation, Veo for video...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    Cinder

    Cinder

    Community-developed library for professional-quality creative coding

    ...To keep up-to-date with Cinder’s development, consider working from the github repository directly. Cinder is a C++ library for programming with aesthetic intent - the sort of development often called creative coding. This includes domains like graphics, audio, video, and computational geometry. Cinder is cross-platform, with official support for macOS, Windows, Linux, iOS, and Windows UWP. Cinder is production-proven, powerful enough to be the primary tool for professionals, but still suitable for learning and experimentation. Full keyboard, mouse (including scroll wheel), window, and file drag and drop.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 13
    Wire Server

    Wire Server

    Wire back-end services

    Modern day communication meets the most advanced security and superior user experience. Protect your privacy and data like never before. Secure messaging, conferencing, file-sharing and more through end-to-end encryption for cloud, private cloud and On-Premises. All messaging on Wire uses end-to-end encryption (E2EE), giving users a strong degree of privacy and security. Wire is 100% open source with its code available on GitHub, independently audited and ISO, CCPA, GDPR, SOX-compliant. Wire...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14
    react-use

    react-use

    Component for React

    Tracks device battery state. Plays audio and exposes its controls. Tracks geo location state of user's device. Triggers callback when user clicks outside target area. Tracks mouse hover state of some element. Display an element or video full-screen. Tracks location hash value. Tracks whether user is being inactive. Tracks an HTML element's intersection. Synthesizes speech from a text string.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Daptin

    Daptin

    Daptin - Backend As A Service - GraphQL/JSON-API Headless CMS

    Take the API overhead out of your brain with Daptin. Complete granular access to your data over the network. A single binary to power up your application. Database-backed persistence, 3NF normalized tables. JSON API/GraphQL for CRUD apis. User and group management and access control. Social login with OAuth: tested with google, github, LinkedIn. Actions for abstracting out business flows. Extensive state tracking APIs. Enable Data Auditing from a single toggle. Synchronous Data Exchange with...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    DALI

    DALI

    A GPU-accelerated library containing highly optimized building blocks

    The NVIDIA Data Loading Library (DALI) is a library for data loading and pre-processing to accelerate deep learning applications. It provides a collection of highly optimized building blocks for loading and processing image, video and audio data. It can be used as a portable drop-in replacement for built-in data loaders and data iterators in popular deep learning frameworks. Deep learning applications require complex, multi-stage data processing pipelines that include loading, decoding, cropping, resizing, and many other augmentations. These data processing pipelines, which are currently executed on the CPU, have become a bottleneck, limiting the performance and scalability of training and inference. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 17
    Networked-Aframe

    Networked-Aframe

    Web framework for building multi-user virtual reality experiences

    Multi-user VR on the Web. A framework for writing multi-user VR apps in HTML and JS. Built on top of A-Frame. Follow the NAF Getting Started tutorial to build your own example from scratch, including setting up a local server. Networked-Aframe works by syncing entities and their components to connected users. To connect to a room you need to add the networked-scene component to the a-scene element. For an entity to be synced, add the networked component to it. By default the position and...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18
    Changelog.com

    Changelog.com

    Changelog makes world-class developer pods

    ...Because the repository is open, developers can study its architecture to learn how Phoenix is used in practice for a high-traffic, media-centric website. It also showcases integration with external services for things like audio hosting, search, and analytics, making it an instructive case study for full-stack Elixir development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Node.js Telegram Bot API

    Node.js Telegram Bot API

    Telegram Bot API for NodeJS

    TelegramBot is an EventEmitter that emits several events. Message, received a new incoming Message of any kind. Depending on the properties of the Message, one of these events may ALSO be emitted, text, audio, document, photo, sticker, video, voice, contact, location, new_chat_members, left_chat_member, new_chat_title, new_chat_photo, delete_chat_photo, group_chat_created, game, pinned_message, poll, dice, migrate_from_chat_id, migrate_to_chat_id, channel_chat_created, supergroup_chat_created, successful_payment, invoice, video_note, etc. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Static Web Audio Player Maker

    Static Web Audio Player Maker

    Create a html audio player with your music directories

    On Linux and Windows
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    Multimodal

    Multimodal

    TorchMultimodal is a PyTorch library

    ...The library provides modular building blocks such as encoders, fusion modules, loss functions, and transformations that support combining modalities (vision, text, audio, etc.) in unified architectures. It includes a collection of ready model classes—like ALBEF, CLIP, BLIP-2, COCA, FLAVA, MDETR, and Omnivore—that serve as reference implementations you can adopt or adapt. The design emphasizes composability: you can mix and match encoder, fusion, and decoder components rather than starting from monolithic models. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Ruma

    Ruma

    A set of Rust crates for interacting with the Matrix chat network

    Matrix is an open specification for an online communication protocol. It includes all the features you'd expect from a modern chat platform including instant messaging, group chats, audio and video calls, searchable message history, synchronization across all your devices, and end-to-end encryption. Matrix is federated, so no single company controls the system or your data. You can use an existing server you trust or run your own, and the servers synchronize messages seamlessly. Learn more in the Introduction to Matrix. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    DiscordGo

    DiscordGo

    (Golang) Go bindings for Discord

    DiscordGo is a Go package that provides low level bindings to the Discord chat client API. DiscordGo has nearly complete support for all of the Discord API endpoints, websocket interface, and voice interface. If you would like to help the DiscordGo package please use this link to add the official DiscordGo test bot dgo to your server. This provides indispensable help to this project. Construct a new Discord client which can be used to access the variety of Discord API functions and to set...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Jina

    Jina

    Build cross-modal and multimodal applications on the cloud

    ...Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP, GraphQL protocols with TLS. Intuitive design pattern for high-performance microservices. Seamless Docker container integration: sharing, exploring, sandboxing, versioning and dependency control via Jina Hub. Fast deployment to Kubernetes, Docker Compose and Jina Cloud. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Flutter Rust Bridge

    Flutter Rust Bridge

    Rust binding generator, feature-rich, but seamless and simple

    ...The project supports passing complex types, handling async operations and streams, and integrating with Flutter across mobile and desktop targets. By leaning on Rust’s memory safety and zero-cost abstractions, it enables compute-heavy tasks—parsing, crypto, image/audio processing, and more—without sacrificing Flutter’s developer experience. Build scripts and templates streamline packaging and distribution so the Rust side fits cleanly into CI and multi-platform releases. In practice, teams gain a maintainable way to share one performant Rust core across multiple Flutter apps while keeping the UI reactive and fast.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB