audio quality free download

Showing 466 open source projects for "audio quality"

View related business solutions

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Custom VMs From 1 to 96 vCPUs With 99.95% Uptime
General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.

Try Free
1

MLX-Audio

A text-to-speech, speech-to-text and speech-to-speech library

MLX-Audio is a speech library built on Apple’s MLX framework and optimized for Apple Silicon machines (M-series Macs). It focuses on text-to-speech and speech-to-speech workflows, with APIs and a command-line interface that make it easy to generate high-quality audio from text. Because it uses MLX and targets Apple Silicon, inference is fast and can take advantage of hardware acceleration and quantization for efficient on-device performance.

Downloads: 11 This Week

Last Update: 4 days ago
See Project
2

Ultimate Vocal Remover (UVR5)

GUI for a Vocal Remover that uses Deep Neural Networks

This application uses state-of-the-art source separation models to remove vocals from audio files. UVR's core developers trained all of the models provided in this package (except for the Demucs v3 and v4 4-stem models).

1 Review

Downloads: 2,181 This Week

Last Update: 2025-01-20
See Project
3

SysDVR

Stream switch games to your PC via USB or network

This is a sysmodule that allows capturing the running game output to a pc via USB or network connection. Stream and switch games to your PC via USB or network. Cross-platform, can stream to Windows, Mac and Linux. Stream via USB or Wifi. Video quality is fixed to 720p @ 30fps with h264 compression, this is a hardware limit. Audio quality is fixed to 16bit PCM @ 48kHz stereo. Not compressed. Very low latency with an optimal setup, most games are playable.

Downloads: 103 This Week

Last Update: 2026-02-10
See Project
4

LosslessCut

The swiss army knife of lossless video/audio editing

LosslessCut aims to be the ultimate cross platform FFmpeg GUI for extremely fast and lossless operations on video, audio, subtitle and other related media files. The main feature is lossless trimming and cutting of video and audio files, which is great for saving space by rough-cutting your large video files taken from a video camera, GoPro, drone, etc. It lets you quickly extract the good parts from your videos and discard many gigabytes of data without doing a slow re-encode and thereby losing quality. ...

7 Reviews

Downloads: 630 This Week

Last Update: 2026-06-04
See Project
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
5

Praat

Doing Phonetics By Computer

Praat is a speech analysis application and source repository for doing phonetics by computer. It was created by Paul Boersma and David Weenink at the University of Amsterdam. The software lets researchers, linguists, clinicians, teachers, and students analyze, synthesize, manipulate, and annotate speech recordings. It supports acoustic inspection through waveforms, spectrograms, pitch tracks, formants, intensity, and related phonetic measurements. Praat also includes scripting tools for...

Downloads: 10 This Week

Last Update: 6 days ago
See Project
6

Namida

A Beautiful and Feature-rich Music & Video Player

Audio features commonly include gapless playback and adjustable options aimed at uninterrupted, consistent sound. Its UI aims to be lightweight and visually coherent, keeping controls accessible without cluttering the screen. Namida’s architecture favors reliability and predictable performance across large libraries, minimizing lag as collections grow.

Downloads: 570 This Week

Last Update: 2026-04-18
See Project
7

VoxCPM2

Tokenizer-Free TTS for Multilingual Speech Generation

VoxCPM2 is an advanced open-source text-to-speech system that redefines speech synthesis by eliminating traditional tokenization and instead generating continuous speech representations through a diffusion-based autoregressive architecture. Built on top of the MiniCPM model family, it enables highly natural, expressive, and context-aware speech generation that adapts tone, emotion, and pacing directly from input text. The system is trained on massive multilingual datasets, enabling support...

Downloads: 23 This Week

Last Update: 2026-04-28
See Project
8

NovaSR

A lightning fast audio upsampler

NovaSR is an extremely lightweight and high-performance audio upsampling model that transforms low-quality 16 kHz audio into clearer, high-fidelity 48 kHz audio with remarkable speed and efficiency. At only about 50 KB in size, the model is orders of magnitude smaller than typical audio super-resolution networks, yet it achieves high quality and realtime performance thanks to its compact architecture and efficient convolutional design. ...

Downloads: 0 This Week

Last Update: 2026-02-26
See Project
9

YouTube Playlist Downloader

A tool to download whole playlists, channels or single videos

...The tool allows users to input a playlist URL and automatically retrieve all associated videos, handling the sequence and download process in a structured way. It supports multiple output formats and quality settings, enabling users to choose between audio or video downloads depending on their needs. The application is built with usability in mind, often providing a graphical interface that abstracts away the complexity of command-line tools. It also manages file naming and organization, ensuring that downloaded content is stored in a clear and consistent structure. ...

Downloads: 94 This Week

Last Update: 2026-03-18
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
10

LatentSync

Taming Stable Diffusion for Lip Sync

LatentSync is an open-source framework from ByteDance that produces high-quality lip-synchronization for video by using an audio-conditioned latent diffusion model, bypassing traditional intermediate motion representations. In effect, given a source video (with masked or reference frames) and an audio track, LatentSync directly generates frames whose lip motions and expressions align with the audio, producing convincing talking-head or animated lip-sync output. ...

Downloads: 4 This Week

Last Update: 2025-12-02
See Project
11

YesPlayMusic

Play Music for Windows / macOS / Linux

High-quality third-party NetEase cloud player, supports Windows / macOS / Linux :electron. Overseas users can play directly (need to log in to NetEase Cloud account). Support UnblockNeteaseMusic, automatically replace grayed out song links with various audio sources (web version not supported). Various audio sources" refers to audio sources that are enabled by default.

Downloads: 18 This Week

Last Update: 2025-10-09
See Project
12

yt-dlp-gui

A cross-platform GUI wrapper for yt-dlp written in PySide6

...The project supports preset definitions and global arguments through a config file, so users can customize their most common download workflows—like audio extraction, quality ranking, or embedding thumbnails—without retyping arguments each time. Downloads can be initiated from a portable app bundle or run manually with Python, making it flexible across platforms including Windows and Linux.

Downloads: 400 This Week

Last Update: 2026-01-20
See Project
13

ffmpeg-normalize

Audio Normalization for Python/ffmpeg

...It can process multiple files in batch mode, making it suitable for large media libraries or production workflows. ffmpeg-normalize also preserves metadata and supports a wide range of input and output formats. Its design emphasizes accuracy and compliance with broadcasting standards. Overall, it provides a reliable solution for achieving consistent audio quality in multimedia content.

Downloads: 24 This Week

Last Update: 3 days ago
See Project
14

HunyuanVideo-Foley

Multimodal Diffusion with Representation Alignment

HunyuanVideo-Foley is a multimodal diffusion model from Tencent Hunyuan for high-fidelity Foley (sound effects) audio generation synchronized to video scenes. It is designed to generate audio that matches both visual content and textual semantic cues, for use in video production, film, advertising, games, etc. The model architecture aligns audio, video, and text representations to produce realistic synchronized soundtracks. Produces high-quality 48 kHz audio output suitable for professional use. ...

Downloads: 1 This Week

Last Update: 2025-09-28
See Project
15

Pedalboard

A Python library for audio

pedalboard is a Python library for working with audio: reading, writing, rendering, adding effects, and more. It supports the most popular audio file formats and a number of common audio effects out of the box and also allows the use of VST3® and Audio Unit formats for loading third-party software instruments and effects. pedalboard was built by Spotify’s Audio Intelligence Lab to enable using studio-quality audio effects from within Python and TensorFlow. ...

Downloads: 8 This Week

Last Update: 5 days ago
See Project
16

Anime Player

Video player for improving quality of hand-drawn images

A video player that enhances the quality of a hand-drawn image using Anime4K's high-performance scaling algorithm. This program is a video player written in the Python programming language using the PySimpleGUI graphical user interface library, an mpv media player, and the Anime4K scaling algorithm . Anime Player is designed to play video and audio files and includes functions such as opening files, URLs and folders, setting image scaling parameters using the Anime4K algorithm, creating an mpv config for watching videos using the Anime4K algorithm on Android, viewing help and information about tuning the algorithm. ...

Downloads: 10 This Week

Last Update: 2026-06-27
See Project
17

Aural

An audio file player for macOS, inspired by Winamp

Aural is a desktop audio player designed to provide high-quality music playback with a focus on usability and advanced audio control. It supports a wide range of audio formats and emphasizes gapless playback and precise audio handling. The application includes features for playlist management, metadata browsing, and customizable playback controls. It is designed to handle large music libraries efficiently, offering filtering and sorting capabilities. aural-player also integrates features such as equalization and audio effects to enhance listening experiences. ...

Downloads: 2 This Week

Last Update: 2026-04-24
See Project
18

Bili23 Downloader

Cross platform GUI tool for downloading videos from Bilibili sites

...It can parse different types of links such as standard video pages, short links, and collection or activity pages to automatically retrieve downloadable media. It also allows users to choose video resolution, audio quality, and encoding format based on the available sources. Additional features include downloading subtitles, comments, metadata, and artwork associated with videos.

Downloads: 16 This Week

Last Update: 2026-06-28
See Project
19

SFBAudioEngine

A powerhouse of audio functionality for macOS, iOS, and tvOS

SFBAudioEngine is an advanced audio engine designed for macOS and iOS, focusing on high-quality playback, precise audio control, and support for a wide range of audio formats. Built for modern Apple platforms, it provides developers with a robust tool for integrating sophisticated audio functionalities into their applications. It emphasizes extensibility, performance, and clean API design.

Downloads: 0 This Week

Last Update: 2026-06-08
See Project
20

Lidarr

Looks and smells like Sonarr but made for music

Lidarr is an open-source music collection manager tailored to automate the tracking, downloading, and organizing of music tracks and albums from Usenet, BitTorrent, or other sources. It continuously monitors RSS feeds for new releases from your favorite artists, automatically retrieves them, sorts files into your library, and ensures consistent naming and tagging so your collection stays tidy and accessible. The tool also supports quality upgrades: if a better version of a track becomes...

Downloads: 2 This Week

Last Update: 2026-01-16
See Project
21

MOSS-TTS-Nano

MOSS-TTS-Nano is an open-source multilingual tiny speech generation

MOSS-TTS-Nano is a lightweight text-to-speech model designed for real-time voice generation in resource-constrained environments. It is part of the broader MOSS-TTS family and focuses on delivering high-quality speech synthesis with a compact architecture. The model operates efficiently on CPU-only systems, enabling deployment without specialized hardware. It supports multilingual voice cloning and produces high-fidelity audio with low latency. The system uses an autoregressive audio tokenization pipeline to generate natural-sounding speech. ...

Downloads: 2 This Week

Last Update: 2026-06-02
See Project
22

Miso TTS

Miso TTS is an 8 billion, highly emotive text-to-speech model

Miso TTS is an advanced 8-billion-parameter text-to-speech model developed by Miso Labs for generating highly expressive and natural-sounding conversational speech. Built on an RVQ Transformer architecture inspired by Sesame CSM, it combines a powerful Llama-based backbone with an autoregressive audio decoder to produce high-quality audio from text. The model supports both standard speech synthesis and voice-conditioned generation using optional audio prompts for voice cloning. Miso TTS generates Mimi audio codes and can leverage conversation history to create more contextually aware and realistic dialogue. Designed for local deployment, it offers watermarking by default to help promote responsible use of generated audio. ...

Downloads: 1 This Week

Last Update: 2026-06-09
See Project
23

IndexTTS2

Industrial-level controllable zero-shot text-to-speech system

IndexTTS is a modern, zero-shot text-to-speech (TTS) system engineered to deliver high-quality, natural-sounding speech synthesis with few requirements and strong voice-cloning capabilities. It builds on state-of-the-art models such as XTTS and other modern neural TTS backbones, improving them with a conformer-based speech conditional encoder and upgrading the decoder to a high-quality vocoder (BigVGAN2), leading to clearer and more natural audio output.

Downloads: 13 This Week

Last Update: 2025-11-27
See Project
24

VibeVoice ComfyUI

ComfyUI integration for Microsoft's VibeVoice text-to-speech model

VibeVoice ComfyUI is a comprehensive wrapper that integrates Microsoft’s VibeVoice text-to-speech models directly into ComfyUI workflows. It exposes VibeVoice as a set of custom nodes so you can build single-speaker and multi-speaker voice generation pipelines visually, combining TTS with other audio or generative components. The integration supports high-quality single-speaker synthesis as well as scripted multi-speaker conversations, with optional voice cloning from audio samples for each speaker. It includes advanced control over generation parameters like attention backend, diffusion steps, sampling temperature, guidance scale, and quantization settings, allowing users to tune the trade-offs between quality, VRAM usage, and speed. ...

Downloads: 7 This Week

Last Update: 2025-11-28
See Project
25

LiveAvatar

Streaming Real-time Audio-Driven Avatar Generation

LiveAvatar is an open-source research and implementation project that provides a unified framework for real-time, streaming, interactive avatar video generation driven by audio and other control signals. It implements techniques from state-of-the-art diffusion-based avatar modeling to support infinite-length continuous video generation with low latency, enabling interactive AI avatars that maintain continuity and realism over extended sessions. The project co-designs algorithms and system optimizations, such as block-wise autoregressive processing and fast sampling strategies, to deliver real-time frame rates (e.g., ~45 FPS on appropriate GPU clusters) while handling non-stop generation without quality degradation. ...

Downloads: 0 This Week

Last Update: 2026-06-18
See Project