audio gui free download

Showing 416 open source projects for "audio gui"

View related business solutions

Linux Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Stop vibe-debugging.
Plug Claude into your app's actual errors.

AppSignal's MCP server hands Claude, Cursor, or Zed your real errors, traces, and the deploy that shipped them. AI writes the fix; you review the diff.

Free 30 days.
1

Ultimate Vocal Remover (UVR5)

GUI for a Vocal Remover that uses Deep Neural Networks

This application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 and v4 4-stem models).

1 Review

Downloads: 26,109 This Week

Last Update: 2025-01-20
See Project
2

LosslessCut

The swiss army knife of lossless video/audio editing

LosslessCut aims to be the ultimate cross platform FFmpeg GUI for extremely fast and lossless operations on video, audio, subtitle and other related media files. The main feature is lossless trimming and cutting of video and audio files, which is great for saving space by rough-cutting your large video files taken from a video camera, GoPro, drone, etc. It lets you quickly extract the good parts from your videos and discard many gigabytes of data without doing a slow re-encode and thereby losing quality. ...

7 Reviews

Downloads: 298 This Week

Last Update: 2026-06-04
See Project
3

Czkawka

Multi functional app to find duplicates, empty folders, similar images

Czkawka (Polish for “hiccup”) is a lightning‑fast, multi‑purpose file cleaning tool written in Rust. It helps users declutter storage by finding duplicate files, similar images or audio, empty folders, and unusually large files through CPU‑efficient multithreading. Available with both GUI (GTK‑based) and CLI versions for flexible usage.

Downloads: 237 This Week

Last Update: 2026-02-21
See Project
4

bfxr

Flash + AIR sound effects generator. Based on Sfxr.

...Its purpose is to enable users, especially game developers and sound designers, to quickly generate retro, 8-bit/“chiptune” style sound effects (“bleeps”, “booms”, “zaps”, etc.) without deep knowledge of audio signal processing. It offers an interactive GUI through which you can tweak many parameters (oscillators, envelopes, filters, etc.) to sculpt custom sound effects; you can preview in real time, export, and iterate. The project includes libraries, HTML templates, and both ActionScript and JavaScript code. It has been well-received (over a thousand stars), but as of 2025, it has been superseded by a newer version called bfxr2, which is a JavaScript reworking of the original.

Downloads: 14 This Week

Last Update: 1 day ago
See Project
Build Agents and Models on One Platform
Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.

Try It Free
5

FeelUOwn

Trying to be a robust, user-friendly and hackable music player

FeelUOwn is a user-friendly, and hackable music player.

Downloads: 2 This Week

Last Update: 5 days ago
See Project
6

Audiblez

Generate audiobooks from e-books

...Audiblez can run entirely from the command line via a PyPI package or through a simple cross-platform GUI built on wxPython, giving both advanced users and non-technical users an accessible workflow.

Downloads: 43 This Week

Last Update: 2025-11-30
See Project
7

Bili23 Downloader

Cross platform GUI tool for downloading videos from Bilibili sites

Bili23-Downloader is an open source desktop application designed for downloading video content from the Bilibili platform. It provides a graphical interface that allows users to download various types of media including user-uploaded videos, series episodes, movies, and other hosted content. It focuses on ease of use with a zero-configuration setup, making it accessible to both beginners and experienced users. It supports high performance downloads through multi-threading and includes resume...

Downloads: 36 This Week

Last Update: 2026-06-01
See Project
8

ebook2audiobook

Generate audiobooks from e-books, voice cloning & 1107+ languages

ebook2audiobook is a tool to convert legally obtained eBooks (non-DRM) into fully narrated audiobooks, complete with chapters and metadata. It automates the pipeline: it reads the eBook file, splits it into appropriate segments (chapters, paragraphs), uses text-to-speech (TTS) models to synthesize audio, optionally applies voice cloning, and outputs a final audiobook — ideal for people who prefer listening over reading, or for accessibility purposes. The tool supports a wide array of...

Downloads: 29 This Week

Last Update: 3 days ago
See Project
9

TTS WebUI

A single Gradio + React WebUI with extensions for ACE-Step

TTS-WebUI is a unified Gradio + React web interface that brings together a large ecosystem of text-to-speech, voice conversion, and audio generation models under a single UI. It supports a wide range of models such as Bark, MusicGen, Tortoise, RVC, StyleTTS2, ParlerTTS, CosyVoice, XTTSv2, Stable Audio, SeamlessM4T, and many others, exposing them as interchangeable backends for speech and music synthesis. The project provides an installer that sets up Conda, Python environments, and all necessary dependencies, so users can focus on experimenting with voices instead of managing tooling. ...

Downloads: 4 This Week

Last Update: 2026-05-14
See Project
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.

Try it Free
10

ComfyUI

The most powerful and modular diffusion model GUI, api and backend

The most powerful and modular diffusion model is GUI and backend. This UI will let you design and execute advanced stable diffusion pipelines using a graph/nodes/flowchart-based interface. We are a team dedicated to iterating and improving ComfyUI, supporting the ComfyUI ecosystem with tools like node manager, node registry, cli, automated testing, and public documentation. Open source AI models will win in the long run against closed models and we are only at the beginning. Our core mission...

Downloads: 189 This Week

Last Update: 3 days ago
See Project
11

Lumos Engine

Cross-Platform C++ 2D/3D game engine

Cross-platform 2D and 3D Game Engine written in C++ that supports both OpenGL and Vulkan. Support for Windows, Linux, macOS. Support for OpenGL/Vulkan. D audio using OpenAL. Rendering 3D models with PBR shading. Debug gui using ImGui 3D collision detection - cuboid/sphere/pyramid. 2D collision detection - Box2D. Basic lua scripting support.

Downloads: 2 This Week

Last Update: 2024-09-14
See Project
12

Real-Time Voice Cloning

Clone a voice in 5 seconds to generate arbitrary speech in real-time

Real-Time Voice Cloning is an influential deep-learning repository that demonstrates how to clone a voice from just a few seconds of audio and then generate arbitrary speech in that voice in near real time. It implements the SV2TTS pipeline (“Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis”) in three stages: a speaker encoder, a synthesizer, and a vocoder. In the first stage, short audio clips are converted into a fixed-dimensional speaker embedding that...

Downloads: 0 This Week

Last Update: 2026-03-09
See Project
13

ffmpeg-commander

A web-based GUI for quickly generating common FFmpeg command-line

ffmpeg-commander is a web-based graphical interface that simplifies the creation of FFmpeg commands for common video and audio encoding tasks. It provides a user-friendly environment where users can configure encoding options without needing to memorize complex command-line syntax. Built with modern web technologies, it generates FFmpeg commands dynamically based on user input. The tool focuses on common workflows such as format conversion, compression, and codec selection. It can optionally...

Downloads: 0 This Week

Last Update: 2026-04-28
See Project
14

ytDownloader

Desktop App for downloading Videos and Audios from hundreds of sites

ytDownloader is a modern desktop application designed to download videos and extract audio from hundreds of online platforms through a clean graphical user interface. Built as a cross-platform tool for Windows, macOS, and Linux, it leverages tools like yt-dlp and FFmpeg under the hood while abstracting their complexity into an intuitive user experience. The application supports downloading from major platforms such as YouTube, Facebook, TikTok, Instagram, Twitch, and Twitter, offering users...

Downloads: 15 This Week

Last Update: 2026-06-11
See Project
15

voxshare_gui

*VoxShare* is a simple Python-based push-to-talk multicast voice chat

VoxShare is a simple Python-based push-to-talk multicast voice chat application with a sleek modern GUI built using CustomTkinter. Provided as python source code or compiled standalone windows application (no need to install anything).

Downloads: 8 This Week

Last Update: 2025-07-01
See Project
16

dab sdr gui

DAB+ SDR receiver for scanning, playback, and signal analysis

DAB+ SDR is a software-defined radio application for receiving and exploring DAB+ digital radio. It includes band scanning to find available stations, live playback, an audio equalizer for sound adjustment, and a spectrum analyzer for real-time signal visualization. It is designed to make DAB+ listening and signal monitoring simple, practical, and flexible. Select your device in SDR settings menu.

Downloads: 16 This Week

Last Update: 2026-05-21
See Project
17

pyVideoTrans

Translate the video from one language to another and embed dubbing

pyVideoTrans is an ambitious open-source multimedia processing project that assembles speech recognition, subtitle generation, AI translation, voice synthesis, and video assembly into a unified pipeline for converting videos from one language to another with embedded dubbing and captions. At its core it runs speech-to-text models to transcribe audio tracks, translates the resulting text into a target language using local or cloud-based translation engines, synthesizes new speech to match the translated subtitles, and then merges that speech back into the video, creating a fully localized media file. The tool supports both command-line and GUI modes, making it accessible to developers and creatives needing batch or automated processing.

Downloads: 28 This Week

Last Update: 4 days ago
See Project
18

annyang!

Speech recognition for your site

annyang is a tiny javascript library that lets your visitors control your site with voice commands. annyang supports multiple languages, has no dependencies, weighs just 2kb and is free to use. annyang understands commands with named variables, splats, and optional words. Use named variables for one word arguments in your command. Use splats to capture multi-word text at the end of your command (greedy). Use optional words or phrases to define a part of the command as optional. annyang plays...

Downloads: 1 This Week

Last Update: 2026-03-11
See Project
19

mpv

Command line video player

mpv is a free (as in freedom) media player for the command line. It supports a wide variety of media file formats, audio and video codecs, and subtitle types. Powerful scripting capabilities can make the player do almost anything. There is a large selection of user scripts on the wiki. While mpv strives for minimalism and provides no real GUI, it has a small controller on top of the video for basic control. mpv has an OpenGL, Vulkan, and D3D11 based video output that is capable of many features loved by videophiles, such as video scaling with popular high quality algorithms, color management, frame timing, interpolation, HDR, and more. mpv can leverage most hardware decoding APIs on all platforms. ...

Downloads: 42 This Week

Last Update: 2025-12-22
See Project
20

DeepSqueak

DeepSqueak Using Machine Vision to Accelerate Bioacoustics Research

Using Machine Vision to Accelerate Bioacoustics Research.

Downloads: 0 This Week

Last Update: 2024-10-11
See Project
21

StreamSpeech

StreamSpeech is a seamless model for offline speech recognition

StreamSpeech is an “all-in-one” speech model designed to perform offline and simultaneous speech recognition, speech translation, and speech synthesis within a single unified architecture. Developed as part of an ACL 2024 paper, it targets streaming and low-latency scenarios where intermediate results and final translations or synthetic speech must be produced continuously as audio is being received. The model supports eight tasks: offline ASR, speech-to-text translation, speech-to-speech...

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
22

yt-dlp-gui Linux

Graphical interface for yt-dlp, a tool for downloading YouTube videos.

yt-dlp-gui Linux in YAD . Graphical interface for yt-dlp, a tool for downloading YouTube videos. This script provides a simple graphical interface to download and convert videos or audio from YouTube using yt-dlp. Additionally, it checks if yt-dlp is installed and, if not, downloads and installs it automatically. It supports multiple languages and output formats.

Downloads: 13 This Week

Last Update: 2025-03-31
See Project
23

QMP3Gain

Multiplatform GUI of MP3Gain which adjusts volume of MP3 files

User interface front end supporting famous MP3Gain engine which analyzes and losslessly adjusts MP3 files to a specified target volume. A complete re-creation (made in QT5) of the original MP3Gain GUI, however it can be run on more platforms.

7 Reviews

Downloads: 227 This Week

Last Update: 2026-02-21
See Project
24

FFmpeg Batch AV Converter

Free all in one audio/video ffmpeg batch encoder

FFmpeg Batch AV Converter is a free universal audio and video encoder for Windows and Linux (via Wine), that allows to use the full potential of ffmpeg command line with a few mouse clicks in a convenient GUI with drag and drop, progress information. Some fancy wizards make things easy for non-experts. Thanks to its multi-file encoding feature, it may be the fastest a/v batch encoder available, since it maximizes system resources usage by launching as many simultaneous processes up to user cpu thread count. ...

32 Reviews

Downloads: 1,275 This Week

Last Update: 2026-05-27
See Project
25

Videomass

Videomass is a free, open source and cross-platform GUI for FFmpeg

Videomass is a free, open-source graphical interface for FFmpeg designed to make advanced video and audio processing accessible to both beginners and experienced users. Built in Python using wxPython, it provides a cross-platform environment for managing encoding, conversion, and editing tasks through a visual interface. The software supports multitasking operations, allowing users to process multiple media files simultaneously. It offers extensive configuration options while also providing...

Downloads: 0 This Week

Last Update: 2026-04-24
See Project