Showing 201 open source projects for "audio gui"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 1
    Ultimate Vocal Remover (UVR5)

    Ultimate Vocal Remover (UVR5)

    GUI for a Vocal Remover that uses Deep Neural Networks

    This application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 and v4 4-stem models).
    Downloads: 26,109 This Week
    Last Update:
    See Project
  • 2
    LosslessCut

    LosslessCut

    The swiss army knife of lossless video/audio editing

    LosslessCut aims to be the ultimate cross platform FFmpeg GUI for extremely fast and lossless operations on video, audio, subtitle and other related media files. The main feature is lossless trimming and cutting of video and audio files, which is great for saving space by rough-cutting your large video files taken from a video camera, GoPro, drone, etc. It lets you quickly extract the good parts from your videos and discard many gigabytes of data without doing a slow re-encode and thereby losing quality. ...
    Downloads: 298 This Week
    Last Update:
    See Project
  • 3
    Czkawka

    Czkawka

    Multi functional app to find duplicates, empty folders, similar images

    Czkawka (Polish for “hiccup”) is a lightning‑fast, multi‑purpose file cleaning tool written in Rust. It helps users declutter storage by finding duplicate files, similar images or audio, empty folders, and unusually large files through CPU‑efficient multithreading. Available with both GUI (GTK‑based) and CLI versions for flexible usage.
    Downloads: 237 This Week
    Last Update:
    See Project
  • 4
    bfxr

    bfxr

    Flash + AIR sound effects generator. Based on Sfxr.

    ...Its purpose is to enable users, especially game developers and sound designers, to quickly generate retro, 8-bit/“chiptune” style sound effects (“bleeps”, “booms”, “zaps”, etc.) without deep knowledge of audio signal processing. It offers an interactive GUI through which you can tweak many parameters (oscillators, envelopes, filters, etc.) to sculpt custom sound effects; you can preview in real time, export, and iterate. The project includes libraries, HTML templates, and both ActionScript and JavaScript code. It has been well-received (over a thousand stars), but as of 2025, it has been superseded by a newer version called bfxr2, which is a JavaScript reworking of the original.
    Downloads: 14 This Week
    Last Update:
    See Project
  • Ship Agents Faster Icon
    Ship Agents Faster

    Transform your applications and workflows into powerful agentic systems at global scale.

    Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.
    Get Started Free
  • 5
    FeelUOwn

    FeelUOwn

    Trying to be a robust, user-friendly and hackable music player

    FeelUOwn is a user-friendly, and hackable music player.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6
    Audiblez

    Audiblez

    Generate audiobooks from e-books

    ...Audiblez can run entirely from the command line via a PyPI package or through a simple cross-platform GUI built on wxPython, giving both advanced users and non-technical users an accessible workflow.
    Downloads: 43 This Week
    Last Update:
    See Project
  • 7
    Bili23 Downloader

    Bili23 Downloader

    Cross platform GUI tool for downloading videos from Bilibili sites

    Bili23-Downloader is an open source desktop application designed for downloading video content from the Bilibili platform. It provides a graphical interface that allows users to download various types of media including user-uploaded videos, series episodes, movies, and other hosted content. It focuses on ease of use with a zero-configuration setup, making it accessible to both beginners and experienced users. It supports high performance downloads through multi-threading and includes resume...
    Downloads: 36 This Week
    Last Update:
    See Project
  • 8
    ebook2audiobook

    ebook2audiobook

    Generate audiobooks from e-books, voice cloning & 1107+ languages

    ebook2audiobook is a tool to convert legally obtained eBooks (non-DRM) into fully narrated audiobooks, complete with chapters and metadata. It automates the pipeline: it reads the eBook file, splits it into appropriate segments (chapters, paragraphs), uses text-to-speech (TTS) models to synthesize audio, optionally applies voice cloning, and outputs a final audiobook — ideal for people who prefer listening over reading, or for accessibility purposes. The tool supports a wide array of...
    Downloads: 29 This Week
    Last Update:
    See Project
  • 9
    TTS WebUI

    TTS WebUI

    A single Gradio + React WebUI with extensions for ACE-Step

    TTS-WebUI is a unified Gradio + React web interface that brings together a large ecosystem of text-to-speech, voice conversion, and audio generation models under a single UI. It supports a wide range of models such as Bark, MusicGen, Tortoise, RVC, StyleTTS2, ParlerTTS, CosyVoice, XTTSv2, Stable Audio, SeamlessM4T, and many others, exposing them as interchangeable backends for speech and music synthesis. The project provides an installer that sets up Conda, Python environments, and all necessary dependencies, so users can focus on experimenting with voices instead of managing tooling. ...
    Downloads: 4 This Week
    Last Update:
    See Project
  • $300 Free Credits for Your Google Cloud Projects Icon
    $300 Free Credits for Your Google Cloud Projects

    Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

    Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.
    Start Free Trial
  • 10
    ComfyUI

    ComfyUI

    The most powerful and modular diffusion model GUI, api and backend

    The most powerful and modular diffusion model is GUI and backend. This UI will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart-based interface. We are a team dedicated to iterating and improving ComfyUI, supporting the ComfyUI ecosystem with tools like node manager, node registry, cli, automated testing, and public documentation. Open source AI models will win in the long run against closed models and we are only at the beginning. Our core mission...
    Downloads: 189 This Week
    Last Update:
    See Project
  • 11
    Lumos Engine

    Lumos Engine

    Cross-Platform C++ 2D/3D game engine

    Cross-platform 2D and 3D Game Engine written in C++ that supports both OpenGL and Vulkan. Support for Windows, Linux, macOS. Support for OpenGL/Vulkan. D audio using OpenAL. Rendering 3D models with PBR shading. Debug gui using ImGui 3D collision detection - cuboid/sphere/pyramid. 2D collision detection - Box2D. Basic lua scripting support.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 12
    ffmpeg-commander

    ffmpeg-commander

    A web-based GUI for quickly generating common FFmpeg command-line

    ffmpeg-commander is a web-based graphical interface that simplifies the creation of FFmpeg commands for common video and audio encoding tasks. It provides a user-friendly environment where users can configure encoding options without needing to memorize complex command-line syntax. Built with modern web technologies, it generates FFmpeg commands dynamically based on user input. The tool focuses on common workflows such as format conversion, compression, and codec selection. It can optionally...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    ytDownloader

    ytDownloader

    Desktop App for downloading Videos and Audios from hundreds of sites

    ytDownloader is a modern desktop application designed to download videos and extract audio from hundreds of online platforms through a clean graphical user interface. Built as a cross-platform tool for Windows, macOS, and Linux, it leverages tools like yt-dlp and FFmpeg under the hood while abstracting their complexity into an intuitive user experience. The application supports downloading from major platforms such as YouTube, Facebook, TikTok, Instagram, Twitch, and Twitter, offering users...
    Downloads: 15 This Week
    Last Update:
    See Project
  • 14
    pyVideoTrans

    pyVideoTrans

    Translate the video from one language to another and embed dubbing

    pyVideoTrans is an ambitious open-source multimedia processing project that assembles speech recognition, subtitle generation, AI translation, voice synthesis, and video assembly into a unified pipeline for converting videos from one language to another with embedded dubbing and captions. At its core it runs speech-to-text models to transcribe audio tracks, translates the resulting text into a target language using local or cloud-based translation engines, synthesizes new speech to match the translated subtitles, and then merges that speech back into the video, creating a fully localized media file. The tool supports both command-line and GUI modes, making it accessible to developers and creatives needing batch or automated processing.
    Downloads: 28 This Week
    Last Update:
    See Project
  • 15
    annyang!

    annyang!

    Speech recognition for your site

    annyang is a tiny javascript library that lets your visitors control your site with voice commands. annyang supports multiple languages, has no dependencies, weighs just 2kb and is free to use. annyang understands commands with named variables, splats, and optional words. Use named variables for one word arguments in your command. Use splats to capture multi-word text at the end of your command (greedy). Use optional words or phrases to define a part of the command as optional. annyang plays...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    mpv

    mpv

    Command line video player

    mpv is a free (as in freedom) media player for the command line. It supports a wide variety of media file formats, audio and video codecs, and subtitle types. Powerful scripting capabilities can make the player do almost anything. There is a large selection of user scripts on the wiki. While mpv strives for minimalism and provides no real GUI, it has a small controller on top of the video for basic control. mpv has an OpenGL, Vulkan, and D3D11 based video output that is capable of many features loved by videophiles, such as video scaling with popular high quality algorithms, color management, frame timing, interpolation, HDR, and more. mpv can leverage most hardware decoding APIs on all platforms. ...
    Downloads: 42 This Week
    Last Update:
    See Project
  • 17
    DeepSqueak

    DeepSqueak

    DeepSqueak Using Machine Vision to Accelerate Bioacoustics Research

    Using Machine Vision to Accelerate Bioacoustics Research.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    StreamSpeech

    StreamSpeech

    StreamSpeech is a seamless model for offline speech recognition

    StreamSpeech is an “all-in-one” speech model designed to perform offline and simultaneous speech recognition, speech translation, and speech synthesis within a single unified architecture. Developed as part of an ACL 2024 paper, it targets streaming and low-latency scenarios where intermediate results and final translations or synthetic speech must be produced continuously as audio is being received. The model supports eight tasks: offline ASR, speech-to-text translation, speech-to-speech...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    yt-dlp-gui Linux

    yt-dlp-gui Linux

    Graphical interface for yt-dlp, a tool for downloading YouTube videos.

    yt-dlp-gui Linux in YAD . Graphical interface for yt-dlp, a tool for downloading YouTube videos. This script provides a simple graphical interface to download and convert videos or audio from YouTube using yt-dlp. Additionally, it checks if yt-dlp is installed and, if not, downloads and installs it automatically. It supports multiple languages and output formats.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 20
    Videomass

    Videomass

    Videomass is a free, open source and cross-platform GUI for FFmpeg

    Videomass is a free, open-source graphical interface for FFmpeg designed to make advanced video and audio processing accessible to both beginners and experienced users. Built in Python using wxPython, it provides a cross-platform environment for managing encoding, conversion, and editing tasks through a visual interface. The software supports multitasking operations, allowing users to process multiple media files simultaneously. It offers extensive configuration options while also providing...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    LaiNES

    LaiNES

    Compact cycle-accurate NES emulator

    LaiNES is a compact, cycle-accurate Nintendo Entertainment System emulator written in C++ that prioritizes precision and minimalism in its implementation. Its design focuses on accurately simulating the NES hardware at the clock-cycle level, ensuring that timing-sensitive behaviors and edge cases are faithfully reproduced. Despite its relatively small codebase, it supports a wide range of cartridge mappers, enabling compatibility with a large portion of NES games. The emulator includes a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    QjackCtl

    QjackCtl

    JACK Audio Connection Kit Qt GUI Interface

    JACK Audio Connection Kit - Qt GUI Interface: A simple Qt application to control the JACK server daemon. Written in C++ around the Qt framework for X11, most exclusively using Qt Designer. Provides a simple GUI dialog for setting several JACK server p
    Leader badge
    Downloads: 97 This Week
    Last Update:
    See Project
  • 23
    Homebridge Raspberry Pi Image

    Homebridge Raspberry Pi Image

    Homebridge Raspberry Pi Image based on Raspberry Pi OS Lite

    This project provides a free Raspbian-based Raspberry Pi image with Homebridge and Homebridge Config UI X pre-installed. This image also provides a command called hb-config which helps you keep Node.js up-to-date, perform maintenance on your Homebridge server, and install additional optional software such as Pi Hole and deCONZ. The Homebridge service is installed using the method described in the Raspberry Pi Installation Guide on the Homebridge project wiki. The easiest way to flash the...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 24
    NeoDLP

    NeoDLP

    Cross-platform Video/Audio Downloader Desktop App

    Cross-platform Video/Audio Downloader Desktop App with Modern UI and Browser Integration based on YT-DLP
    Downloads: 33 This Week
    Last Update:
    See Project
  • 25
    Qsynth

    Qsynth

    A fluidsynth Qt GUI Interface

    Qsynth is a fluidsynth GUI front-end application, written in C++ around the Qt framework, using Qt Designer. Eventually it may evolve into a softsynth management application allowing the user to control and manage a variety of command line softsynths.
    Leader badge
    Downloads: 110 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
Auth0 Logo