Showing 467 open source projects for "audio capture usb"

View related business solutions
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 1
    USB_FORMAT

    USB_FORMAT

    Make Bootable USB Drive with MBR and 2 Partitions

    USB Format Tool - Make Bootable USB Drive with MBR and 2 Partitions. USB Multi-Boot of Linux ISO + Windows 10 VHD + Win10XPE in BIOS Or UEFI Secure mode.
    Downloads: 138 This Week
    Last Update:
    See Project
  • 2
    Kimi-Audio

    Kimi-Audio

    Audio foundation model excelling in audio understanding

    ...It uses a novel model setup that combines continuous acoustic features with discrete semantic tokens to richly capture sound and meaning across speech, music, and environmental audio.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 3
    OBS Studio

    OBS Studio

    Open source software for live streaming and recording

    OBS Studio, also known as Open Broadcaster Software, is a free and open source software program for live streaming and video recording. Features of the software include device/source capture, recording, encoding and broadcasting. Stream on Windows, Mac or Linux. This software is commonly used by video game streamers on the popular streaming platform Twitch.
    Downloads: 236 This Week
    Last Update:
    See Project
  • 4
    scrcpy

    scrcpy

    Display and control your Android device

    scrcpy is an application for displaying and controlling your Android device through USB connection (or over TCP/IP). It is cross-platform (GNU/Linux, macOS and Windows) and does not require any root access. scrcpy displays only the device screen but offers great performance (30~60fps) and quality (1920×1080 or above). It’s got low latency (35~70ms) and a very low startup time (less than a second). It offers plenty of great features and is non-intrusive, with nothing left installed inside...
    Downloads: 10,917 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 5
    SysDVR

    SysDVR

    Stream switch games to your PC via USB or network

    This is a sysmodule that allows capturing the running game output to a pc via USB or network connection. Stream and switch games to your PC via USB or network. Cross-platform, can stream to Windows, Mac and Linux. Stream via USB or Wifi. Video quality is fixed to 720p @ 30fps with h264 compression, this is a hardware limit. Audio quality is fixed to 16bit PCM @ 48kHz stereo. Not compressed. Very low latency with an optimal setup, most games are playable.
    Downloads: 118 This Week
    Last Update:
    See Project
  • 6
    audio_video_streaming

    audio_video_streaming

    Compilation of authoritative information on audio and video streaming

    ...It also documents widely used technologies such as RTP, RTMP, HLS, and WebRTC, helping users understand the full lifecycle of streaming pipelines from capture to rendering. In addition to educational materials, it references industry tools, libraries, and frameworks, making it a valuable roadmap for both beginners and advanced engineers. The project emphasizes structured learning paths and practical experimentation,
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    Step-Audio 2

    Step-Audio 2

    Multi-modal large language model designed for audio understanding

    ...It integrates a latent-space audio encoder, discrete acoustic tokens, and reinforcement-learning–based training (CoT + RL) to enhance its ability to capture and reproduce voice styles, intonations, and subtle vocal cues. Moreover, Step-Audio2 supports tool-calling and retrieval-augmented generation (RAG), allowing it to access external knowledge sources or audio/text databases, thus reducing hallucinations and improving coherence in complex dialogues.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Kooha

    Kooha

    Elegantly record your screen

    Capture your screen in an intuitive and straightforward way without distractions. Kooha is a simple screen recorder with a minimal interface. You can simply click the record button without having to configure a bunch of settings.
    Downloads: 21 This Week
    Last Update:
    See Project
  • 9
    Snapcast

    Snapcast

    Synchronous multiroom audio player

    Snapcast is a multiroom client-server audio player, where all clients are time synchronized with the server to play perfectly synced audio. It's not a standalone player, but an extension that turns your existing audio player into a Sonos-like multiroom solution. Audio is captured by the server and routed to the connected clients. Several players can feed audio to the server in parallel and clients can be grouped to play the same audio stream. One of the most generic ways to use Snapcast is...
    Downloads: 39 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 10
    You-Get

    You-Get

    Dumb downloader that scrapes the web

    You-Get is a small command-line utility for downloading media (video, audio and images) from the Web when there are no other means to do so. It can download video and audio files from such popular web sites as YouTube, Twitter, Niconico, Vimeo, Flickr, Instagram and a whole lot more. You-Get is a great option for when you want to enjoy your favorite videos, audio or images from the internet without having to open any web browsers or get interrupted by ads. It’s also a good choice for...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 11
    Screenity

    Screenity

    The most powerful screen recorder & annotation tool for Chrome

    Screenity is a feature-packed screen and camera recorder for Chrome. Annotate your screen to give feedback, emphasize your clicks, edit your recording, and much more. Make unlimited recordings of your tab, desktop, any application, and camera. Annotate by drawing anywhere on the screen, adding text, and creating arrows. Highlight your clicks, focus on your mouse, or hide it from the recording. Individual microphone and computer audio controls, push to talk, and more. Custom countdowns, show...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 12
    Note67

    Note67

    A private, local meeting notes assistant

    note67 is a private, local meeting notes assistant application that combines audio capture, transcription, and AI-powered summarization to help users document conversations and meetings on their own devices without relying on cloud services. Built with a cross-platform architecture using Rust (via Tauri) for backend logic and a TypeScript/React frontend, it prioritizes privacy by performing audio transcription locally with Whisper models and generating summaries with locally-hosted AI, eliminating the need to send sensitive meeting content to external servers. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 13
    Cap

    Cap

    Open source Loom alternative. Beautiful, shareable screen recordings

    Cap is an open source screen recording software designed to provide a lightweight, powerful, and user-friendly alternative to existing tools. It enables users to effortlessly capture their screens, edit recordings, and share content securely within seconds. Cap supports cross-platform functionality, making it compatible with both macOS and Windows operating systems. The software offers high-quality video capture with synchronized audio, ensuring professional-grade recordings are suitable for tutorials, presentations, and educational content. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 14
    Clementine

    Clementine

    Modern music player and library organizer

    Clementine is a multi-platform music player and library organizer inspired by Amarok 1.4. It has a fast and easy-to-use interface, and allows you to search and play music from your local library, internet radio, songs you’ve uploaded to cloud storage, and more! Use this as an open source alternative to itunes or spotify.
    Downloads: 57 This Week
    Last Update:
    See Project
  • 15
    VibeVoice

    VibeVoice

    Open-source multi-speaker long-form text-to-speech model

    ...The model integrates a Qwen2.5-based large language model with a diffusion head to produce realistic acoustic details and capture conversational context. Training involved curriculum learning with increasing sequence lengths up to 65K tokens, allowing VibeVoice to handle very long dialogues effectively. Safety mechanisms include an audible disclaimer and imperceptible watermarking in all generated audio to mitigate misuse risks.
    Downloads: 31 This Week
    Last Update:
    See Project
  • 16
    Scanopy

    Scanopy

    Clean network diagrams, One-time setup, zero upkeep

    Scanopy is a powerful multi-modal data capture and analysis toolkit that enables users to collect, process, and visualize structured and unstructured information from a variety of sources in a flexible pipeline. It is built to handle complex scanning tasks — such as OCR, document analysis, audio transcription, network data capture, and image extraction — while providing unified APIs and workflows that make managing heterogeneous data sources seamless.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 17
    Phoniebox

    Phoniebox

    A Raspberry Pi jukebox, playing local music, podcasts, web radio

    Phoniebox is a contactless jukebox for the Raspberry Pi, that plays audio files, playlists, podcasts, web streams, and Spotify triggered by RFID cards. All plug and play via USB, no soldering iron needed. It also features GPIO button control support.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    AudioNotes

    AudioNotes

    Extract audio and video content and organize it into a Markdown note

    AudioNotes is an application (or proof-of-concept) that likely combines audio recording or playback with note-taking or annotation functionality — enabling users to record voice or audio and attach textual or timestamped notes, making it ideal for lectures, interviews, meetings, or personal memos. Such a tool offers a more expressive and flexible way to capture and revisit information: instead of just typed notes or raw audio, users get both audio context and structured notes. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    OpenCorePkg

    OpenCorePkg

    OpenCore bootloader

    OpenCorePkg is an open-source, modular UEFI (Unified Extensible Firmware Interface) bootloader and development framework, primarily designed to enable macOS booting on non-Apple hardware (Hackintosh). It includes Apple-specific UEFI drivers, utilities for macOS installation support, and shared libraries used across Acidanthera projects. Apple disk image loading support. Apple keyboard input aggregation. Apple PE image signature verification. Apple UEFI secure boot supplemental code. Audio...
    Downloads: 181 This Week
    Last Update:
    See Project
  • 20
    ioquake3

    ioquake3

    The ioquake3 community effort to continue supporting/developing id's

    ...It is designed to let players run Quake 3, its expansion Team Arena, and community mods on contemporary systems while also serving as a solid base for new projects. The engine modernizes the original codebase with a CMake build system, an SDL2 backend for cross-platform windowing and input, and OpenAL sound for better audio quality and multi-speaker setups. It adds numerous quality-of-life improvements such as VoIP support, AVI demo capture, improved console completion and history, and optional Ogg Vorbis support. ioquake3 also improves portability and maintainability by supporting x86_64 on Linux, MinGW builds on Windows, and various other operating systems, and even provides web support via Emscripten.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 21
    Live API Web Console

    Live API Web Console

    A react-based starter app for using the Live API over websockets

    Live API Web Console is a React starter that demonstrates how to use Gemini’s Live API over WebSockets to build real-time, multimodal experiences. The app includes modules for streaming audio playback, recording user media from the microphone, webcam, or even screen capture, and it surfaces a unified event log so you can debug the session as it flows. Configuration lives in a simple .env file and the project boots with standard web tooling, letting you experiment quickly with models, system prompts, and tool declarations. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    DistroAV

    DistroAV

    DistroAV (formerly OBS-NDI): NDI integration for OBS Studio

    DistroAV is an open-source integration plugin for OBS Studio that provides Network Device Interface (NDI) support so users can send and receive live audio and video over IP networks directly within OBS, formerly known as the OBS-NDI project before being renamed. By implementing NDI input sources, dedicated output transports, and special filter modes, it allows creativity-oriented workflows such as capturing remote cameras, sharing scenes between machines, or distributing live feeds without capture cards or physical cabling. ...
    Downloads: 80 This Week
    Last Update:
    See Project
  • 23
    MuseGAN

    MuseGAN

    An AI for Music Generation

    ...The system focuses specifically on generating multi-track polyphonic music, meaning that it can simultaneously produce multiple instrument parts such as drums, bass, piano, guitar, and strings. Instead of generating raw audio, the model operates on piano-roll representations of music, which encode notes as time-pitch matrices for each instrument track. This representation allows the neural network to capture rhythmic patterns, harmonic relationships, and structural dependencies across instruments. The architecture is based on convolutional GAN models that learn temporal musical structure and inter-track relationships from training data. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    MediaDevices

    MediaDevices

    Go implementation of the MediaDevices API

    mediadevices is a Go library developed by the Pion WebRTC team that enables real-time access to audio and video devices for building native Go applications involving media streaming and conferencing. It provides a cross-platform, unified API for capturing and manipulating media streams and is often used in combination with Pion WebRTC for peer-to-peer communications. Its support for device enumeration, media constraints, and frame processing makes it a powerful building block for custom...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    clone-voice

    clone-voice

    A sound cloning tool with a web interface, using your voice

    ...It does not require an NVIDIA GPU to run basic tasks, although GPU acceleration can be used when available, making it accessible on modest machines. The tool supports around sixteen languages, including Chinese, English, Japanese, Korean, French, German, Italian, and others, and can capture reference voices directly from a microphone or from uploaded audio.
    Downloads: 23 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB