Showing 191 open source projects for "audio"

View related business solutions
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 1
    SenseVoice

    SenseVoice

    Multilingual speech recognition and audio understanding model

    SenseVoice is a speech foundation model designed to perform multiple voice understanding tasks from audio input. It provides capabilities such as automatic speech recognition, spoken language identification, speech emotion recognition, and audio event detection within a single system. SenseVoice is trained on more than 400,000 hours of speech data and supports over 50 languages for multilingual recognition tasks. It is built to achieve high transcription accuracy while maintaining efficient inference performance. ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 2
    YuE

    YuE

    Open source AI model for generating full songs from lyrics prompts

    ...It focuses on transforming text inputs such as lyrics and genre prompts into complete musical compositions that include both vocal and instrumental tracks. Unlike many shorter audio generators, the model is capable of producing songs that last several minutes while maintaining coherent musical structure and alignment with the provided lyrics. YuE introduces a family of models built on large language model architectures that process music generation as a sequence prediction task. YuE also incorporates techniques such as track-decoupled prediction and progressive conditioning to help manage complex audio signals and maintain consistency throughout long compositions. ...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    VMZ (Video Model Zoo)

    VMZ (Video Model Zoo)

    VMZ: Model Zoo for Video Modeling

    The codebase was designed to help researchers and practitioners quickly reproduce FAIR’s results and leverage robust pre-trained backbones for downstream tasks. It also integrates Gradient Blending, an audio-visual modeling method that fuses modalities effectively (available in the Caffe2 implementation). Although VMZ is now archived and no longer actively maintained, it remains a valuable reference for understanding early large-scale video model training, transfer learning, and multimodal integration strategies that influenced modern architectures like SlowFast and X3D.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Homebridge Raspberry Pi Image

    Homebridge Raspberry Pi Image

    Homebridge Raspberry Pi Image based on Raspberry Pi OS Lite

    This project provides a free Raspbian-based Raspberry Pi image with Homebridge and Homebridge Config UI X pre-installed. This image also provides a command called hb-config which helps you keep Node.js up-to-date, perform maintenance on your Homebridge server, and install additional optional software such as Pi Hole and deCONZ. The Homebridge service is installed using the method described in the Raspberry Pi Installation Guide on the Homebridge project wiki. The easiest way to flash the...
    Downloads: 7 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    LiveBeats
    live_beats is an example Phoenix LiveView application developed by Fly.io to demonstrate real-time features and deployment workflows in modern Elixir web applications. It showcases how developers can build interactive, stateful web experiences without relying heavily on client-side JavaScript frameworks. The project streams live audio updates and visual feedback directly through WebSocket connections managed by LiveView, highlighting the power of Elixir’s concurrent architecture. In addition to real-time interactivity, live_beats demonstrates effective project structuring, deployment strategies, and integration with Fly.io’s cloud hosting platform. It serves as both a learning tool and a reference for best practices in LiveView development, covering topics such as state management, event handling, and distributed scalability. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Rig

    Rig

    Rust framework for building modular and scalable LLM-powered apps

    ...Rig includes built-in support for agent workflows, allowing systems to perform multi-turn reasoning, tool calling, and retrieval-based tasks within structured pipelines. It also supports capabilities such as text generation, embeddings, transcription, image generation, and audio generation depending on the provider used. Developers can integrate language models into their software with minimal boilerplate while maintaining flexibility for complex AI workflows.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 7
    Distrobox

    Distrobox

    Use any linux distribution inside your terminal

    ...The created container will be tightly integrated with the host, allowing sharing of the HOME directory of the user, external storage, external USB devices and graphical apps (X11/Wayland), and audio. Simply put it's a fancy wrapper around podman, docker, or lilipod to create and start containers highly integrated with the hosts. The distrobox environment is based on an OCI image. This image is used to create a container that seamlessly integrates with the rest of the operating system by providing access to the user's home directory, the Wayland and X11 sockets, networking, removable devices (like USB sticks), systemd journal, SSH agent, D-Bus, ulimits, /dev and the udev database, etc.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 8
    Acontext

    Acontext

    Context data platform for building observable, self-learning AI agents

    Acontext is a cloud-native context data platform designed to support the development and operation of advanced AI agents. It provides a unified system to store and manage contexts, multimodal messages, artifacts, and task workflows, enabling developers to engineer context effectively for their agent products. The platform observes agent tasks and user feedback in real time, offering robust observability into workflows and helping teams understand how agents perform over time. Acontext also...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    CC2.TV / CC2 - Audio- und TV-Datenbank

    CC2.TV / CC2 - Audio- und TV-Datenbank

    Meta-Datenbank-Anwendung für die Audio- und TV-Sendungen des CC2.TV

    Dieses Programm stellt eine Meta-Datenbank-Anwendung für die Audio- und Video-Sendungen des CC2.TV für GNU/Linux Systeme zur Verfügung. Es ermöglicht das Durchsuchen, Verwalten und Abspielen der umfangreichen Inhalte des CC2.TV-Audiocasts und -Videocasts. Ziel ist es, die über 3000 Audiocast-Themen und über 1000 Videocast-Themen, die sich auf Computerthemen, Technik und gesellschaftliche Aspekte konzentrieren, komfortabel zugänglich zu machen.
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 10
    eduMEET WebRTC meeting service

    eduMEET WebRTC meeting service

    Multiparty web-meetings using mediasoup and WebRTC

    edumeet, multiparty web meetings using mediasoup and WebRTC.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Dodger-Tools

    Dodger-Tools

    BASH tools and and Debian packages for Linux users and administrators

    The Dodger-Tools are a constantly growing collection of BASH tools and Debian packages for Linux users and administrators.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12

    Esegui SB

    Flexible video encoding script supporting multiple formats and codecs.

    Audio Track Extraction: Extract and encode audio tracks independently. Audio normalization: Adjusts audio levels to ensure consistent volume across tracks. The script leverages FFmpeg's built-in libraries and tools for these functions. For detailed instructions and troubleshooting, refer to the provided documentation.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    AI File Sorter

    AI File Sorter

    Local AI file organization with categorization and rename suggestions

    ...It can also analyze document text to improve categorization and renaming. Supported formats include PDF, DOCX, XLSX, PPTX, ODT, ODS, ODP, and common text files. For supported audio and video files, AI File Sorter can read embedded metadata (such as ID3, Vorbis, and MP4 tags) to suggest normalized names like year_artist_album_title.ext. AI analysis runs read-only, and all suggestions must be reviewed before being applied. AI File Sorter can run fully offline using local models like Mistral or LLaMA, so files and metadata stay on your device unless you configure a remote endpoint.
    Downloads: 218 This Week
    Last Update:
    See Project
  • 14
    Snowmix

    Snowmix

    Video mixer for mixing live and recorded video and audio feeds

    New version 0.5.2.1 Released December 29th 2025. Snowmix is a Swiss army knife tool for mixing live and recorded video and audio feeds. It supports 2D and 3D clipping, scaling and transparent overlay of video, png graphics and text. It supports animation of video, images and texts through native commands changing scale, placement, transparency and rotation. Animation and actions can also be controlled through native scripting and an embedded Tcl and/or Python interpreter.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 15

    Podget

    A simple podcast aggregator.

    Podget is a simple podcast aggregator optimized for running as a scheduled background job (i.e. cron), with support for categories & folders, importing servers from OPML lists & iTunes PCAST files, exporting an OPML file, automatic playlist creation and cleanup. Updated in Version 0.8 on June5, 2016 to include support for ATOM feeds in additions to RSS feeds. As of Nov 9, 2015 the CVS hosted by Sourceforge will no longer be updated. The last version to be uploaded into the CVS was...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    Bluetooth Internet Radio

    Bluetooth Internet Radio

    Auto-setup bluetooth internet radio on Ubuntu / Raspberry Pi / Armbian

    Fully automated setup of bluetooth, internet radio player (PyRadio), local music files player (mplayer), on a headless RaspberryPi, connecting to a stereo system's bluetooth receiver (bash script, chmod +x it to run). To install automatically, copy => paste => run the commands below in a terminal program (using the 'Terminal' app in the system menu, or over remote SSH), while logged in AS THE USER THAT WILL RUN THE APP (user must have sudo privileges): wget --no-cache -O...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    NebiOS Alternative Mirror (Official)

    NebiOS Alternative Mirror (Official)

    An Ubuntu Linux-based OS that aims to end user.

    NebiOS is Ubuntu-based secure, privacy-focused universal OS for daily life, creativity, gaming & more. Official mirror - use if main site has download issues. Main site: https://nebios.org ⚠️ DON'T DOWNLOAD FROM UNKNOWN SOURCES! 🛡️ Privacy: NebiOS has no telemetry, no background data services, no tracking daemons, and cannot send any personal data without user action. Full policy: nebisoftware.com/legal/privacy.html === 🚀 NebiOS X out now! ⚠️ Note on Virtual Machines:...
    Leader badge
    Downloads: 17 This Week
    Last Update:
    See Project
  • 18
    DVA-Profession

    DVA-Profession

    Mass video digitization workflow management for archiving

    Professional digital video archiving system solution, developed and used by the Austrian national audio/video archive, designed to handle vast amounts of video content from ingest to long-term storage, including analysis, transcoding and metadata.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    shelfelf

    shelfelf

    webapp for itunes, mpd, squeezebox, xmms2 &c.

    The shelfelf is a web frontend to send commands to your music player (itunes, mpd, squeezebox, xmms2, ...). The shelfelf works with all tested browsers (except for Microsoft's Internet Explorer, Amiga's AWeb and lynx), but the interface is designed for tablets (with a browser running in fullscreen mode) like Apple's iPad. Linux (or Mac OS X) with Apache webserver, MySQL database and the ability to send commands to your stereo equipment is required.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    microenc

    microenc

    Batch audio encoding script for Linux/BSD

    microenc is a small Bash shell script for Linux/BSD for encoding directories with audio files to other formats using FFmpeg as encoder.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 21
    mbp-fedora

    mbp-fedora

    Fedora ISO with Apple T2 patches built-in

    ...This project provides an Ansible-based automation toolset for configuring Fedora to work optimally on MacBooks, handling quirks related to Apple’s proprietary hardware such as keyboard, touchpad, Wi-Fi, audio, and secure boot features. It aims to make Fedora a first-class experience on Apple hardware without requiring deep Linux knowledge or manual driver patching, and it’s actively updated with fixes for new kernel versions.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    SPTK is a suite of speech signal processing tools for UNIX environments, e.g., LPC analysis, PARCOR analysis, LSP analysis, PARCOR synthesis filter, LSP synthesis filter, vector quantization techniques, and other extended versions of them.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 23
    Batch script for audio encoding. Supports MP3, AAC, Vorbis, Opus, FLAC, AC3, DTS and WAV (decoding) output and is intended to be used on directories with audio files. Filtering through MPlayer and automatic tagging with MediaInfo is also supported. audenc is no longer developed. Please use: https://sourceforge.net/projects/microenc/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    SVoice (Speech Voice Separation)

    SVoice (Speech Voice Separation)

    We provide a PyTorch implementation of the paper Voice Separation

    ...The repository includes all necessary scripts for training, dataset preparation, distributed training, evaluation, and audio separation.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Apodio Gnu/linux multimedia distribution

    Apodio Gnu/linux multimedia distribution

    Apodio - Gnu/linux Os for Artists & More

    APODIO is a GNU/Linux operating system containing audio, text-friendly, graphic and video tools. It can be used as a liveCD/USB or be installed on a partition of your hard disk. http://apodio.org
    Downloads: 16 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB