Search Results for "audio gui interface"

Sort By:

1311 projects for "audio gui interface" with 1 filter applied:

BSD Clear Filters & Widen Search

Go from Code to Production URL in Seconds
Cloud Run deploys apps in any language instantly. Scales to zero. Pay only when code runs.

Skip the Kubernetes configs. Cloud Run handles HTTPS, scaling, and infrastructure automatically. Two million requests free per month.

Try it free
Fully Managed MySQL, PostgreSQL, and SQL Server
Automatic backups, patching, replication, and failover. Focus on your app, not your database.

Cloud SQL handles your database ops end to end, so you can focus on your app.

Try Free
1

TTS WebUI

A single Gradio + React WebUI with extensions for ACE-Step

TTS-WebUI is a unified Gradio + React web interface that brings together a large ecosystem of text-to-speech, voice conversion, and audio generation models under a single UI. It supports a wide range of models such as Bark, MusicGen, Tortoise, RVC, StyleTTS2, ParlerTTS, CosyVoice, XTTSv2, Stable Audio, SeamlessM4T, and many others, exposing them as interchangeable backends for speech and music synthesis.

Downloads: 2 This Week

Last Update: 2025-11-28
See Project
2

Bili23 Downloader

Cross platform GUI tool for downloading videos from Bilibili sites

...It can parse different types of links such as standard video pages, short links, and collection or activity pages to automatically retrieve downloadable media. It also allows users to choose video resolution, audio quality, and encoding format based on the available sources. Additional features include downloading subtitles, comments, metadata, and artwork associated with videos.

Downloads: 2 This Week

Last Update: 2026-03-10
See Project
3

Another Redis Desktop Manager

A faster, better and more stable Redis desktop manager

AnotherRedisDesktopManager is a cross-platform GUI client for Redis that simplifies connecting, browsing, and manipulating data. It supports standalone, Sentinel, and Cluster modes, plus SSH tunneling and ACL credentials for secure access in varied environments. The UI provides tree and table views of keys with inline editors for strings, hashes, lists, sets, sorted sets, and streams, including TTL management and batch operations. Built-in monitoring lets you watch stats, slow logs, and...

Downloads: 60 This Week

Last Update: 2025-09-08
See Project
4

bfxr

Flash + AIR sound effects generator. Based on Sfxr.

...Its purpose is to enable users, especially game developers and sound designers, to quickly generate retro, 8-bit/“chiptune” style sound effects (“bleeps”, “booms”, “zaps”, etc.) without deep knowledge of audio signal processing. It offers an interactive GUI through which you can tweak many parameters (oscillators, envelopes, filters, etc.) to sculpt custom sound effects; you can preview in real time, export, and iterate. The project includes libraries, HTML templates, and both ActionScript and JavaScript code. It has been well-received (over a thousand stars), but as of 2025, it has been superseded by a newer version called bfxr2, which is a JavaScript reworking of the original.

Downloads: 15 This Week

Last Update: 1 day ago
See Project
Full-stack observability with actually useful AI | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

Riffusion App

Stable diffusion for real-time music generation (web app)

Riffusion App Hobby is an open-source interactive web application that enables real-time music generation using stable diffusion models adapted for audio synthesis. Unlike traditional music generation tools, it treats audio as spectrogram images and applies diffusion techniques to generate continuous sound transitions, allowing users to create evolving musical loops and compositions. The application is built with modern web technologies including Next.js, React, and three.js, providing a responsive and visually engaging interface for experimentation. ...

Downloads: 2 This Week

Last Update: 2026-03-18
See Project
6

MusicPlayer2

Audio player that can play common audio formats

MusicPlayer2 is a simple music-player application (or prototype) implemented in — presumably — a web or desktop environment, intended to give users a clean, functional interface for managing and playing audio files. The project likely implements basic playlist management, playback controls (play, pause, skip), and possibly UI features to browse or organize music. Because many smaller music-player projects aim for simplicity, MusicPlayer2 may focus on providing a lightweight, minimal-dependency audio player compared to larger, heavy multimedia suites. ...

Downloads: 10 This Week

Last Update: 2025-12-27
See Project
7

WanGP

AI video generator optimized for low VRAM and older GPUs use

Wan2GP is an open source AI video generation toolkit designed to make modern generative models accessible on consumer-grade hardware with limited GPU memory. It acts as a unified interface for running multiple video, image, and audio generation models, including Wan-based models as well as other systems like Hunyuan Video, Flux, and Qwen. A key focus of the project is reducing VRAM requirements, enabling some workflows to run on as little as 6 GB while still supporting older Nvidia and certain AMD GPUs. Wan2GP provides a full web-based interface that simplifies interaction with complex generative pipelines, making it easier to configure prompts, models, and rendering settings. ...

Downloads: 28 This Week

Last Update: 1 day ago
See Project
8

OuteTTS

Interface for OuteTTS models

OuteTTS is an interface library for running OuteTTS text-to-speech models across a range of backends, making it easier to deploy the same model on different hardware and runtimes. It provides a high-level Interface API that wraps model configuration, speaker handling, and audio generation so you can focus on integrating speech into your application rather than wiring up low-level engines.

Downloads: 0 This Week

Last Update: 2025-11-28
See Project
9

SFML

Simple and Fast Multimedia Library

SFML provides a simple interface to the various components of your PC, to ease the development of games and multimedia applications. It is composed of five modules: system, window, graphics, audio and network. Discover their features more in detail in the tutorials and the API documentation. With SFML, your application can compile and run out of the box on the most common operating systems: Windows, Linux, macOS and soon Android & iOS.

Downloads: 122 This Week

Last Update: 2025-09-18
See Project
Gemini 3 and 200+ AI Models on One Platform
Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

Build generative AI apps with Vertex AI. Switch between models without switching platforms.

Start Free
10

OpenAI.fm

Code for openai.fm, a demo for the OpenAI Speech API

OpenAI.fm is an official interactive demo application built to showcase the OpenAI Speech API and its advanced text-to-speech capabilities, providing developers and creators with a hands-on web interface to convert text into high-quality, customizable audio using state-of-the-art TTS models. Developed using Next.js and the OpenAI Speech API, this demo illustrates how the latest neural voice models can produce natural, expressive speech with adjustable styles and voices, highlighting features like emotional range, tone, and real-time playback. ...

Downloads: 18 This Week

Last Update: 2026-01-28
See Project
11

Kaset

The missing YouTube Music macOS app

Kaset is a social audio platform framework that allows users to host, share, and interact with audio content in community-oriented spaces, combining elements of podcasting, voice rooms, and feedback-driven discovery. It provides an interface where creators can upload episodes, host live or scheduled voice sessions, and cultivate listener communities through comments, reactions, and follow systems.

Downloads: 4 This Week

Last Update: 6 days ago
See Project
12

EeveeSpotify

A tweak to enhance Spotify experience

EeveeSpotifyReborn is an unofficial modification for the Spotify mobile application that alters client-side behavior to unlock premium-like features without requiring a paid subscription. It operates by injecting changes into the Spotify app, making it interpret the user account as having premium access and enabling functionalities that are normally restricted. The project was developed through reverse engineering techniques, including analyzing application behavior and intercepting requests...

Downloads: 45 This Week

Last Update: 2026-03-23
See Project
13

Speakr

Speakr is a personal, self-hosted web application

Speakr is an open-source, real-time text-to-speech (TTS) web application that allows users to convert written text into natural-sounding speech in just a few clicks. It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications. Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and responsive speech generation without noticeable delay. ...

Downloads: 0 This Week

Last Update: 2026-03-18
See Project
14

SoniTranslate

Synchronized Translation for Videos

SoniTranslate is a video translation and dubbing system that produces synchronized target-language audio tracks for existing video content. It provides a web UI built with Gradio, allowing users to upload a video, choose source and target languages, and then run a pipeline that handles transcription, translation and re-synthesis of speech. Under the hood, it uses advanced speech and diarization models to separate speakers, align audio with timecodes and respect subtitle timing, which lets...

Downloads: 36 This Week

Last Update: 2025-11-28
See Project
15

MAI-UI

Real-World Centric Foundation GUI Agents

MAI-UI is a cutting-edge open-source project that implements a family of foundation GUI (Graphical User Interface) agent models capable of interpreting natural language and performing real-world GUI navigation and control tasks across mobile and desktop environments. Developed by Tongyi-MAI (Alibaba’s research initiative), the MAI-UI models are multimodal agents trained to understand user instructions and corresponding screenshots, grounding those instructions to on-screen elements and generating sequences of GUI actions such as taps, swipes, text input, and system commands. ...

Downloads: 4 This Week

Last Update: 2026-03-20
See Project
16

Clash Party

Another Mihomo GUI

Clash Party is a cross-platform graphical client built around the Mihomo (Clash Meta) core, designed to manage and route network traffic through configurable proxy protocols in a user-friendly desktop environment. It provides an advanced interface for configuring proxy rules, managing subscriptions, and optimizing network routing behavior across operating systems such as Windows, macOS, and Linux. The application integrates multiple proxy protocols, allowing users to route traffic through...

Downloads: 36 This Week

Last Update: 5 days ago
See Project
17

OpenCorePkg

OpenCore bootloader

...Apple UEFI secure boot supplemental code. Audio management with screen reading support. Basic ACPI and SMBIOS manipulation.

Downloads: 190 This Week

Last Update: 2026-03-20
See Project
18

Scriberr

Self-hosted AI audio transcription

...The application includes a polished user interface that simplifies the management of recordings, transcripts, and annotations, making it suitable for both casual users and professionals handling large volumes of audio. Beyond transcription, Scriberr also integrates features such as summarization, tagging, and interaction with language models, allowing users to extract insights from conversations or meetings efficiently.

Downloads: 8 This Week

Last Update: 2026-03-19
See Project
19

SerenityOS

The Serenity Operating System

SerenityOS is an open source Unix-like operating system project with its own custom kernel, graphical user interface, system libraries, and userland tools. It combines a nostalgic “90s UI aesthetic” with modern system capabilities: a preemptive, multi-threaded kernel, own browsers, network stack, file systems, IPC, security features, and a suite of graphical / developer applications. The project is both a hobbyist OS and a polished engineering sandbox.

Downloads: 31 This Week

Last Update: 2026-03-19
See Project
20

OmniTools

Self-hosted collection of powerful web-based tools for everyday tasks

...The tool catalog spans both technical and non-technical needs, including image, video, audio, PDF, text, date/time, math, and data format utilities like JSON/CSV/XML helpers. It’s also packaged for straightforward self-hosting, with a lightweight Docker image and simple run commands, so it can be deployed quickly on a homelab or internal network.

Downloads: 7 This Week

Last Update: 2026-01-27
See Project
21

Whisper-WebUI

A Web UI for easy subtitle using whisper model

Whisper WebUI is an open-source browser-based interface that simplifies the use of Whisper speech recognition models by providing an intuitive graphical environment for transcription, translation, and subtitle generation. Built with Gradio, it allows users to upload audio or video files, process them locally, and generate accurate text outputs without relying on command-line tools.

Downloads: 2 This Week

Last Update: 2026-03-18
See Project
22

EasyVoice

Open source text-to-speech tool, supports extra-long text

easyVoice is an open-source text-to-speech platform aimed at turning long-form text and novels into high-quality audio, with a strong focus on usability and scalability. It provides a web interface where users can paste or upload large texts and generate speech and subtitles in a single workflow, even for works exceeding 100,000 characters. The system supports multi-role voice acting, letting users assign different neural voices to different characters or narrative roles and configure parameters such as rate, pitch, and volume per role. ...

Downloads: 0 This Week

Last Update: 2026-01-26
See Project
23

HIDAPI library

A Simple cross-platform library for communicating with HID devices

HIDAPI is a multi-platform library that allows an application to interface with USB and Bluetooth HID-Class devices on Windows, Linux, FreeBSD, and macOS. HIDAPI can be either built as a shared library (.so, .dll or .dylib) or can be embedded directly into a target application by adding a single source file (per platform) and a single header.

Downloads: 10 This Week

Last Update: 2025-05-18
See Project
24

wxPython Project Phoenix

wxPython's Project Phoenix. A new implementation of wxPython

...This new implementation is focused on improving speed, maintainability and extensibility. Just like "Classic" wxPython, Phoenix wraps the wxWidgets C++ toolkit and provides access to the user interface portions of the wxWidgets API, enabling Python applications to have a native GUI on Windows, Macs or Unix systems, with a native look and feel and requiring very little (if any) platform-specific code.

Downloads: 4 This Week

Last Update: 2026-02-08
See Project
25

Page Agent

JavaScript in-page GUI agent. Control web interfaces

Page Agent is an open-source in-page AI agent framework that allows developers to control and interact with web interfaces using natural language directly within the browser. Unlike traditional browser automation tools, it operates entirely through in-page JavaScript, eliminating the need for browser extensions, headless browsers, or external automation environments. The system enables users to manipulate the DOM through text-based commands, allowing complex workflows such as form filling,...

Downloads: 4 This Week

Last Update: 3 days ago
See Project