dvd-audio free download

4198 projects for "dvd-audio" with 1 filter applied:

BSD Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Error to trace to log to deploy. One click. No SSH.
Catch the cause before the pager goes off.

AppSignal links every error to the trace, the trace to the log, the log to the deploy that shipped it.

Free 30 days.
1

Kimi-Audio

Audio foundation model excelling in audio understanding

Kimi-Audio is an ambitious open-source audio foundation model designed to unify a wide array of audio processing tasks — from speech recognition and audio understanding to generative conversation and sound event classification — within a single cohesive architecture. Instead of fragmenting work across specialized models, Kimi-Audio handles automatic speech recognition (ASR), audio question answering, automatic audio captioning, speech emotion recognition, and audio-to-text chat in one system, enabling developers to build rich, multimodal audio applications without stitching together disparate components. ...

Downloads: 0 This Week

Last Update: 2026-01-27
See Project
2

Fun Audio Chat

Large Audio Language Model built for natural interactions

Fun Audio Chat is an interactive voice-first conversational AI platform designed to let users engage in natural spoken dialogue with large language models in real time, turning speech into context-aware responses while maintaining a smooth back-and-forth experience. It combines speech recognition, audio processing, and AI generation so users can speak simply and receive spoken replies, enabling applications such as virtual assistants, voice bots, and hands-free chat interfaces. ...

Downloads: 0 This Week

Last Update: 2026-02-27
See Project
3

Fish Audio Python SDK

The official Python library for the Fish Audio API

Fish Audio Python is the official Python SDK for working with the Fish Audio platform. It gives developers a programmatic way to access Fish Audio features such as text-to-speech generation, audio playback, saving output files, and API-based voice workflows. The package is designed for Python applications that need speech generation without manually handling raw HTTP requests.

Downloads: 2 This Week

Last Update: 2026-06-08
See Project
4

DVDStyler

A cross-platform DVD authoring application

DVDStyler is a cross-platform free DVD authoring application that makes possible for video enthusiasts to create professional-looking DVDs. DVDStyler provides over 20 DVD menu templates, allowing you to create your own menu designs and photo slideshows. After you select your DVD label name, video quality, video format, aspect ratio, and audio format, you can select a template to add video materials to.

167 Reviews

Downloads: 4,981 This Week

Last Update: 2024-08-06
See Project
MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
5

NeuralNote

Audio Plugin for Audio to MIDI transcription using deep learning

NeuralNote is an open-source audio software tool designed to convert recorded audio into MIDI data using modern machine learning techniques. The software functions as an audio plugin that can be used inside digital audio workstations as well as a standalone application for music production and analysis. Its main purpose is to perform audio-to-MIDI transcription, allowing musicians to record a performance and automatically transform it into editable MIDI notes. ...

Downloads: 85 This Week

Last Update: 2026-03-12
See Project
6

LTX-2.3

Official Python inference and LoRA trainer package

LTX-2.3 is an open-source multimodal artificial intelligence foundation model developed by Lightricks for generating synchronized video and audio from prompts or other inputs. Unlike most earlier video generation systems that only produced silent clips, LTX-2 combines video and audio generation in a unified architecture capable of producing coherent audiovisual scenes. The model uses a diffusion-transformer-based architecture designed to generate high-fidelity visual frames while simultaneously producing corresponding audio elements such as speech, music, ambient sound, or effects. ...

Downloads: 124 This Week

Last Update: 2026-05-28
See Project
7

Shairport Sync

AirPlay audio player

...In this way, synchronized multi-room audio is possible for players that support it, such as iTunes and the macOS Music app. Shairport Sync runs on Linux, FreeBSD and OpenBSD. It does not support AirPlay video or photo streaming. Shairport Sync offers full audio synchronization, a feature of AirPlay that previous implementations do not provide. Full audio synchronization means that audio is played on the output device at exactly the time specified by the audio source.

Downloads: 5 This Week

Last Update: 2026-04-27
See Project
8

SFBAudioEngine

A powerhouse of audio functionality for macOS, iOS, and tvOS

SFBAudioEngine is an advanced audio engine designed for macOS and iOS, focusing on high-quality playback, precise audio control, and support for a wide range of audio formats. Built for modern Apple platforms, it provides developers with a robust tool for integrating sophisticated audio functionalities into their applications. It emphasizes extensibility, performance, and clean API design.

Downloads: 3 This Week

Last Update: 2026-06-08
See Project
9

audio_video_streaming

Compilation of authoritative information on audio and video streaming

audio_video_streaming is a comprehensive curated repository that aggregates hundreds of resources related to audio and video streaming technologies, including articles, research papers, protocols, and practical projects. It serves as a learning hub for developers interested in multimedia systems, covering topics such as encoding, decoding, transmission protocols, and real-time communication frameworks. The repository includes example implementations like multi-user video chat systems, WebRTC demos, and cross-platform media players to provide hands-on learning opportunities. ...

Downloads: 0 This Week

Last Update: 2026-04-24
See Project
Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure
Native application identity and user-based security for your Azure cloud

Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.

Get a free trial
10

MusicPlayer2

Audio player that can play common audio formats

MusicPlayer2 is a simple music-player application (or prototype) implemented in — presumably — a web or desktop environment, intended to give users a clean, functional interface for managing and playing audio files. The project likely implements basic playlist management, playback controls (play, pause, skip), and possibly UI features to browse or organize music. Because many smaller music-player projects aim for simplicity, MusicPlayer2 may focus on providing a lightweight, minimal-dependency audio player compared to larger, heavy multimedia suites. ...

Downloads: 8 This Week

Last Update: 2025-12-27
See Project
11

HeartMuLa

A Family of Open Sourced Music Foundation Models

...The project also includes HeartCodec, a music codec optimized for high reconstruction fidelity, enabling efficient tokenization and reconstruction workflows that are critical for training and generation pipelines. For text extraction from audio, it provides HeartTranscriptor, a Whisper-based model tuned specifically for lyrics transcription, which helps bridge generated or recorded audio back into structured text. It also introduces HeartCLAP, which aligns audio and text into a shared embedding space.

Downloads: 29 This Week

Last Update: 2026-04-10
See Project
12

pyAudioAnalysis

Python Audio Analysis Library: Feature Extraction, Classification

pyAudioAnalysis is an open-source Python library designed for audio signal analysis, machine learning, and music information retrieval tasks. The project provides a collection of tools that allow developers to extract meaningful features from audio files and use those features for classification, segmentation, and analysis. The library supports multiple audio processing workflows, including feature extraction from raw audio signals, training of machine learning models, and automatic audio segmentation. ...

Downloads: 2 This Week

Last Update: 2026-03-10
See Project
13

EeveeSpotify

A tweak to enhance Spotify experience

EeveeSpotifyReborn is an unofficial modification for the Spotify mobile application that alters client-side behavior to unlock premium-like features without requiring a paid subscription. It operates by injecting changes into the Spotify app, making it interpret the user account as having premium access and enabling functionalities that are normally restricted. The project was developed through reverse engineering techniques, including analyzing application behavior and intercepting requests...

Downloads: 144 This Week

Last Update: 2026-03-23
See Project
14

HLS.js

HLS.js is a JavaScript library that plays HLS in browsers

HLS.js is a JavaScript library that implements an HTTP Live Streaming client. It relies on HTML5 video and MediaSource Extensions for playback. It works by transmuxing MPEG-2 Transport Stream and AAC/MP3 streams into ISO BMFF (MP4) fragments. Transmuxing is performed asynchronously using a Web Worker when available in the browser. HLS.js also supports HLS + fmp4, as announced during WWDC2016. HLS.js works directly on top of a standard HTML<video> element. HLS.js is written in ECMAScript6...

Downloads: 15 This Week

Last Update: 2026-04-13
See Project
15

VoxCPM2

Tokenizer-Free TTS for Multilingual Speech Generation

...The system is trained on massive multilingual datasets, enabling support for dozens of languages and dialects while maintaining high fidelity and realism in generated audio. VoxCPM stands out for its ability to perform voice cloning with minimal input, capturing not only the speaker’s timbre but also nuanced features such as rhythm, accent, and emotional delivery. It also introduces voice design capabilities, allowing users to generate entirely new voices from natural language descriptions without requiring reference audio.

Downloads: 21 This Week

Last Update: 2026-04-28
See Project
16

ffmpeg-normalize

Audio Normalization for Python/ffmpeg

ffmpeg-normalize is a command-line utility designed to normalize audio levels in media files using FFmpeg, ensuring consistent volume across multiple tracks. It supports both EBU R128 loudness normalization and peak normalization methods, allowing users to choose the appropriate standard for their needs. The tool analyzes audio streams and applies adjustments to achieve target loudness levels without introducing distortion.

Downloads: 8 This Week

Last Update: 2026-05-21
See Project
17

OpenAI.fm

Code for openai.fm, a demo for the OpenAI Speech API

OpenAI.fm is an official interactive demo application built to showcase the OpenAI Speech API and its advanced text-to-speech capabilities, providing developers and creators with a hands-on web interface to convert text into high-quality, customizable audio using state-of-the-art TTS models. Developed using Next.js and the OpenAI Speech API, this demo illustrates how the latest neural voice models can produce natural, expressive speech with adjustable styles and voices, highlighting features like emotional range, tone, and real-time playback. Users can experiment with different input text and voice options directly in their browser, gaining a sense of how high-fidelity AI audio can be integrated into applications ranging from podcasts and narration to accessibility tools and interactive agents. ...

Downloads: 15 This Week

Last Update: 2026-01-28
See Project
18

Dia2

TTS model capable of streaming conversational audio in realtime

Dia2 is a streaming dialogue text-to-speech model created by Nari Labs for generating conversational audio in real time. It is designed to begin producing speech before receiving the entire input text, which makes it useful for interactive voice applications. The model supports audio conditioning, allowing generated speech to follow a reference voice or conversational style more naturally. Dia2 provides 1B and 2B model checkpoints along with inference code for research and experimentation. ...

Downloads: 2 This Week

Last Update: 2026-06-08
See Project
19

HunyuanVideo-Avatar

Tencent Hunyuan Multimodal diffusion transformer (MM-DiT) model

HunyuanVideo-Avatar is a multimodal diffusion transformer (MM-DiT) model by Tencent Hunyuan for animating static avatar images into dynamic, emotion-controllable, and multi-character dialogue videos, conditioned on audio. It addresses challenges of motion realism, identity consistency, and emotional alignment. Innovations include a character image injection module, an Audio Emotion Module for transferring emotion cues, and a Face-Aware Audio Adapter to isolate audio effects on faces, enabling multiple characters to be animated in a scene. Character image injection module for better consistency between training and inference conditioning. ...

Downloads: 2 This Week

Last Update: 2025-12-16
See Project
20

AudioNotes

Extract audio and video content and organize it into a Markdown note

AudioNotes is an application (or proof-of-concept) that likely combines audio recording or playback with note-taking or annotation functionality — enabling users to record voice or audio and attach textual or timestamped notes, making it ideal for lectures, interviews, meetings, or personal memos. Such a tool offers a more expressive and flexible way to capture and revisit information: instead of just typed notes or raw audio, users get both audio context and structured notes. ...

Downloads: 0 This Week

Last Update: 2025-12-04
See Project
21

NovaSR

A lightning fast audio upsampler

NovaSR is an extremely lightweight and high-performance audio upsampling model that transforms low-quality 16 kHz audio into clearer, high-fidelity 48 kHz audio with remarkable speed and efficiency. At only about 50 KB in size, the model is orders of magnitude smaller than typical audio super-resolution networks, yet it achieves high quality and realtime performance thanks to its compact architecture and efficient convolutional design.

Downloads: 0 This Week

Last Update: 2026-02-26
See Project
22

Bili23 Downloader

Cross platform GUI tool for downloading videos from Bilibili sites

...It can parse different types of links such as standard video pages, short links, and collection or activity pages to automatically retrieve downloadable media. It also allows users to choose video resolution, audio quality, and encoding format based on the available sources. Additional features include downloading subtitles, comments, metadata, and artwork associated with videos.

Downloads: 34 This Week

Last Update: 2026-06-01
See Project
23

MOSS-TTS-Nano

MOSS-TTS-Nano is an open-source multilingual tiny speech generation

...The model operates efficiently on CPU-only systems, enabling deployment without specialized hardware. It supports multilingual voice cloning and produces high-fidelity audio with low latency. The system uses an autoregressive audio tokenization pipeline to generate natural-sounding speech. It is suitable for local applications, web services, and embedded systems. Overall, it brings advanced speech synthesis capabilities to lightweight and accessible environments.

Downloads: 9 This Week

Last Update: 2026-06-02
See Project
24

Speakr

Speakr is a personal, self-hosted web application

...It provides a clean, user-friendly interface where users can input text, choose a voice style or language, and immediately hear the output, making it ideal for accessibility, content creation, and learning applications. Behind the scenes, Speakr leverages modern TTS engines and streaming audio technologies to deliver smooth and responsive speech generation without noticeable delay. The project is built with extensibility in mind, enabling developers to add custom voices, integrate additional languages, and tailor the backend for different hardware or cloud environments. It also supports saving generated audio as downloadable files so users can reuse the speech outputs in other projects, presentations, or media content.

Downloads: 4 This Week

Last Update: 2026-06-10
See Project
25

SFML

Simple and Fast Multimedia Library

SFML provides a simple interface to the various components of your PC, to ease the development of games and multimedia applications. It is composed of five modules: system, window, graphics, audio and network. Discover their features more in detail in the tutorials and the API documentation. With SFML, your application can compile and run out of the box on the most common operating systems: Windows, Linux, macOS and soon Android & iOS. Pre-compiled SDKs for your favorite OS are available on the download page. SFML has official bindings for the C and .Net languages. ...

Downloads: 72 This Week

Last Update: 2026-04-16
See Project