Search Results for "multi channel audio capture"

Showing 61 open source projects for "multi channel audio capture"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    audio_video_streaming

    audio_video_streaming

    Compilation of authoritative information on audio and video streaming

    audio_video_streaming is a comprehensive curated repository that aggregates hundreds of resources related to audio and video streaming technologies, including articles, research papers, protocols, and practical projects. It serves as a learning hub for developers interested in multimedia systems, covering topics such as encoding, decoding, transmission protocols, and real-time communication frameworks. The repository includes example implementations like multi-user video chat systems, WebRTC demos, and cross-platform media players to provide hands-on learning opportunities. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Step-Audio 2

    Step-Audio 2

    Multi-modal large language model designed for audio understanding

    Step-Audio2 is an advanced, end-to-end multimodal large language model designed for high-fidelity audio understanding and natural speech conversation: unlike many pipelines that separate speech recognition, processing, and synthesis, Step-Audio2 processes raw audio, reasons about semantic and paralinguistic content (like emotion, speaker characteristics, non-verbal cues), and can generate contextually appropriate responses — including potentially generating or transforming audio output. It...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    LiveKit

    LiveKit

    End-to-end stack for WebRTC. SFU media server and SDKs

    LiveKit is an open-source project that provides a scalable, multi-user conferencing system based on WebRTC, designed to offer real-time video, audio, and data capabilities for developers.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 4
    Scanopy

    Scanopy

    Clean network diagrams, One-time setup, zero upkeep

    Scanopy is a powerful multi-modal data capture and analysis toolkit that enables users to collect, process, and visualize structured and unstructured information from a variety of sources in a flexible pipeline. It is built to handle complex scanning tasks — such as OCR, document analysis, audio transcription, network data capture, and image extraction — while providing unified APIs and workflows that make managing heterogeneous data sources seamless. ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    VibeVoice

    VibeVoice

    Open-source multi-speaker long-form text-to-speech model

    VibeVoice-1.5B is Microsoft’s frontier open-source text-to-speech (TTS) model designed for generating expressive, long-form, multi-speaker conversational audio such as podcasts. Unlike traditional TTS systems, it excels in scalability, speaker consistency, and natural turn-taking for up to 90 minutes of continuous speech with as many as four distinct speakers. A key innovation is its use of continuous acoustic and semantic speech tokenizers operating at an ultra-low frame rate of 7.5 Hz, enabling high audio fidelity with efficient processing of long sequences. ...
    Downloads: 31 This Week
    Last Update:
    See Project
  • 6
    ioquake3

    ioquake3

    The ioquake3 community effort to continue supporting/developing id's

    ...It is designed to let players run Quake 3, its expansion Team Arena, and community mods on contemporary systems while also serving as a solid base for new projects. The engine modernizes the original codebase with a CMake build system, an SDL2 backend for cross-platform windowing and input, and OpenAL sound for better audio quality and multi-speaker setups. It adds numerous quality-of-life improvements such as VoIP support, AVI demo capture, improved console completion and history, and optional Ogg Vorbis support. ioquake3 also improves portability and maintainability by supporting x86_64 on Linux, MinGW builds on Windows, and various other operating systems, and even provides web support via Emscripten.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 7
    Recorder

    Recorder

    HTML5 js recording mp3 wav ogg webm amr format

    ​Supports microphone recording and real-time processing in most of the implemented getUserMediamobile and PC browsers, mainly including Chrome, Firefox, Safari, iOS 14.3+, Android WebView, Tencent Android X5 kernel (QQ, WeChat, Mini Program WebView) , uni-app (App, H5), and most Android phones updated after 2021 have their own browsers; do not support: UC-based kernel (typical Alipay), most of the old domestic mobile phones that have not been updated have their own browsers and any other...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    DistroAV

    DistroAV

    DistroAV (formerly OBS-NDI): NDI integration for OBS Studio

    DistroAV is an open-source integration plugin for OBS Studio that provides Network Device Interface (NDI) support so users can send and receive live audio and video over IP networks directly within OBS, formerly known as the OBS-NDI project before being renamed. By implementing NDI input sources, dedicated output transports, and special filter modes, it allows creativity-oriented workflows such as capturing remote cameras, sharing scenes between machines, or distributing live feeds without capture cards or physical cabling. ...
    Downloads: 86 This Week
    Last Update:
    See Project
  • 9
    MuseGAN

    MuseGAN

    An AI for Music Generation

    MuseGAN is a deep learning research project designed to generate symbolic music using generative adversarial networks. The system focuses specifically on generating multi-track polyphonic music, meaning that it can simultaneously produce multiple instrument parts such as drums, bass, piano, guitar, and strings. Instead of generating raw audio, the model operates on piano-roll representations of music, which encode notes as time-pitch matrices for each instrument track. This representation allows the neural network to capture rhythmic patterns, harmonic relationships, and structural dependencies across instruments. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 10
    airgeddon

    airgeddon

    This is a multi-use bash script for Linux systems

    airgeddon is an alive project growing day by day. Interface mode switcher (Monitor-Managed) keeping selection even on interface name changing. DoS over wireless networks using different methods (mdk3, mdk4, aireplay-ng). "DoS Pursuit mode" is available to avoid AP channel hopping (available also on DoS performed on Evil Twin attacks). Full support for 2.4Ghz and 5Ghz bands. Assisted WPA/WPA2 personal networks Handshake file and PMKID capturing. Cleaning and optimizing Handshake captured...
    Downloads: 36 This Week
    Last Update:
    See Project
  • 11
    annyang!

    annyang!

    Speech recognition for your site

    annyang is a tiny javascript library that lets your visitors control your site with voice commands. annyang supports multiple languages, has no dependencies, weighs just 2kb and is free to use. annyang understands commands with named variables, splats, and optional words. Use named variables for one word arguments in your command. Use splats to capture multi-word text at the end of your command (greedy). Use optional words or phrases to define a part of the command as optional. annyang plays...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    yt-dlp

    yt-dlp

    A youtube-dl fork with additional features and fixes

    yt-dlp is a youtube-dl fork based on the now inactive youtube-dlc. The main focus of this project is adding new features and patches while also keeping up to date with the original project
    Downloads: 530 This Week
    Last Update:
    See Project
  • 13
    comfyui-mixlab-nodes

    comfyui-mixlab-nodes

    Workflow and speech recognition app

    comfyui-mixlab-nodes is a large collection of custom nodes for ComfyUI that turns workflows into interactive apps and adds real-time multimedia, LLM, and TTS capabilities. It introduces a “Workflow-to-APP” concept, where a ComfyUI graph can be transformed into a Web App through an AppInfo node, complete with categories, batch prompts, and editable configurations. The project also brings Real-time Design features like screen capture and floating video nodes, enabling creative pipelines that...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 14

    Audio Mechanica 2025 - Sony PSP

    Audio Effects Software for Sony PSP

    ...It's little more than an interesting demo however, due to the lag between filling the audio input buffer, and populating the output buffer, there's enough delay to put anyone off using it for anything rhythmic in real time. Maybe it could potentially serve as a wet delay channel in a multi amp rig, or inline for recording guitar to a PC.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Shotcut

    Shotcut

    Shotcut is a free, open source, cross-platform video editor.

    Shotcut is a free, open source, cross-platform video editor for Windows, Mac and Linux. Major features include support for a wide range of formats; no import required meaning native timeline editing; Blackmagic Design support for input and preview monitoring; and resolution support to 4k. Copyright © 2011-2023 by Meltytech, LLC Shotcut is a trademark of Meltytech, LLC.
    Leader badge
    Downloads: 50,160 This Week
    Last Update:
    See Project
  • 16
    AzioVoice Recorder

    AzioVoice Recorder

    AzioVoice Recorder is an app designed to record audio

    Description Starting from version 1.3.1.0, the project has been renamed to AzioVoice Recorder and is officially published in the Microsoft Store at: https://apps.microsoft.com/detail/9PP795T0KSFP The app supports recording in WAV format with adjustable settings like sample rate, bit depth, and channels. It includes several audio filters for basic enhancement and features a simple file explorer for playback and management. Themes can be switched between dark and light, with settings saved persistently. Core Functionality Multi-Device Support: Record from any available audio input device High-Quality Recording: WAV format output with configurable audio settings File Management: Browse, play, and delete recordings with metadata display Customizable Settings: Configure sample rate, bit depth, and channel count
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    OmniPull

    OmniPull

    Just pull anything

    OmniPull is a powerful, cross-platform download manager built with Python and PySide6. It provides a modern, intuitive interface for managing downloads with advanced features like multi-threading, queue management, and media extraction.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 18
    lamco-rdp-server

    lamco-rdp-server

    A full-featured Wayland-native RDP server for Linux written in Rust.

    lamco-rdp-server is a native Wayland RDP server for Linux desktops. It provides remote desktop access using the standard RDP protocol, so you can connect from any RDP client: Windows Remote Desktop, FreeRDP, Remmina, or others. It supports multiple screen capture and input backends: * xdg-desktop-portal: works with GNOME, KDE Plasma, and other portal-compatible compositors * Native wlroots protocols: works with Sway, Hyprland, and other wlroots-based compositors The server detects...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 19
    Cynthia v1.0.6187

    Cynthia v1.0.6187

    Reliably play midi music files from a folder or ".m3u" playlist

    Reliably play midi music files from a folder or ".m3u" playlist. Adjust playback speed, volume and output device on the fly during playback. A large playback progress bar makes jumping forward and backward in time a breeze with just a single click or tap. Supports ".mid", ".midi" and ".rmi" files in format 0 (single track) and format 1 (multi-track). Comes complete with 24 sample midis ready to play.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 20
    FileCentipede

    FileCentipede

    File centipede is an internet file download manager

    File centipede is an internet file download/upload manager, Torrent client, WebDAV client, FTP client, and SSH client. It's fast, customizable, user-friendly, multi-protocols supported, and free with no ads. It also contains many useful auxiliary tools such as HTTP requester, file merge, and encoders. With the browser integration, you can download audio and videos from websites, even encrypted videos. File centipede makes everything easy and efficient, it's the best internet download...
    Downloads: 47 This Week
    Last Update:
    See Project
  • 21
    Image16

    Image16

    Image viewer/editor specialized in astrophotography 16bits processing

    Opens/saves TIF, FITS, PNG, XISF, PXM, RAW, JPG, XCF, PSD, WEBP. AsinH, Sqrt, Ln, Exp and Sqr stretching, HSL and RGB adjusting, Levels and Curves adjusting and Background Extraction for Color Balance, Flatness and Gradient Correction. Arithmetic between images. Resizing and croping. Color Balance based on background, selection or stars B-V (after platesolving). Debayering/Bayering. Inversion, mirroring, rotation, barrel/pincushion correction. RGB alignment/shifting. Erode, Dilate, ......
    Downloads: 12 This Week
    Last Update:
    See Project
  • 22
    OpenClinic GA

    OpenClinic GA

    Open Source Integrated Hospital Information Management System

    OpenClinic GA is an open source integrated hospital information management system covering management of administrative, financial, clinical, lab, x-ray, pharmacy, meals distribution and other data. Extensive statistical and reporting capabilities. OpenClinic GA and OpenClinic GMAO are owned by Frank Verbeke, MD, PhD at Post-Factum BV (https://www.post-factum.be)
    Leader badge
    Downloads: 135 This Week
    Last Update:
    See Project
  • 23
    Demucs

    Demucs

    Code for the paper Hybrid Spectrogram and Waveform Source Separation

    ...Demucs supports GPU-accelerated inference and can process multi-channel audio with chunked streaming for real-time or batch operation. It also provides training scripts and utilities to fine-tune on custom datasets, along with remixing and enhancement tools.
    Downloads: 93 This Week
    Last Update:
    See Project
  • 24
    Advision

    Advision

    Advision is a cloud cross-platform Digital Signage software.

    Advision is a cloud cross-platform Digital Signage software. We help offline business and advertising companies to implement reliable Digital Signage infrastructure using our own software and hardware solutions. Advision is used for DOOH advertising networks, digital signage at points of sale (retail), in the fast food industry for digital menus, and corporate TV.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    HiFi-GAN

    HiFi-GAN

    Generative Adversarial Networks for Efficient and High Fidelity Speech

    HiFi-GAN is a GAN-based neural vocoder designed to generate high-fidelity speech waveforms from mel spectrograms with exceptional efficiency. It introduces a generator architecture tailored to model the periodic structure of speech and a set of discriminators that focus on different scales and periods of the waveform to better capture naturalness. The model targets a sweet spot between sample quality and generation speed, outperforming many previous GAN vocoders while being far faster than...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • Next
MongoDB Logo MongoDB