dvd-audio free download

Showing 18 open source projects for "dvd-audio"

View related business solutions

Agentic AI Linux Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Ship Agents Faster
Transform your applications and workflows into powerful agentic systems at global scale.

Gemini Enterprise Agent Platform lets you rapidly build, scale, govern and optimize production-ready agents grounded in your organization's data. The platform enables developers to build custom or pre-built agents for virtually any use case. New customers get $300 in free credits.

Get Started Free
1

MOSS-TTS-Nano

MOSS-TTS-Nano is an open-source multilingual tiny speech generation

...The model operates efficiently on CPU-only systems, enabling deployment without specialized hardware. It supports multilingual voice cloning and produces high-fidelity audio with low latency. The system uses an autoregressive audio tokenization pipeline to generate natural-sounding speech. It is suitable for local applications, web services, and embedded systems. Overall, it brings advanced speech synthesis capabilities to lightweight and accessible environments.

Downloads: 9 This Week

Last Update: 2026-06-02
See Project
2

Claude Code Video Vision

Give Claude the ability to watch and understand videos

...It supports multiple backends for audio processing, including local and cloud-based options, enabling flexible deployment depending on privacy or performance requirements.

Downloads: 3 This Week

Last Update: 2026-05-18
See Project
3

video-use

Edit videos with Claude Code

...Designed to work with Claude Code, it automates the entire editing process—from cutting clips to rendering the final output—without requiring manual timelines or complex software interfaces. The system intelligently analyzes audio transcripts and visual cues to make precise, context-aware editing decisions. It supports a wide range of content types, including interviews, tutorials, montages, and talking-head videos. By combining structured text representations with on-demand visual previews, it minimizes processing overhead while maintaining high-quality results. ...

Downloads: 18 This Week

Last Update: 2026-05-15
See Project
4

notebooklm-py

Unofficial Python API and agentic skill for Google NotebookLM

...The project covers notebook management, source ingestion, conversational querying, research workflows, and sharing controls, while also enabling the generation of a wide range of study and media artifacts. These outputs include audio overviews, videos, slide decks, infographics, quizzes, flashcards, reports, data tables, and mind maps, with configurable formats and export options.

Downloads: 9 This Week

Last Update: 2 days ago
See Project
AI-powered service management for IT and enterprise teams
Enterprise-grade ITSM, for every business

Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.

Try it Free
5

OpenAI Python

The official Python library for the OpenAI API

The OpenAI Python library provides convenient access to the OpenAI REST API from any Python 3.7+ application. The library includes type definitions for all request params and response fields, and offers both synchronous and asynchronous clients powered by httpx.

Downloads: 8 This Week

Last Update: 2 days ago
See Project
6

E2B

Secure open source cloud runtime for AI apps & AI agents

E2B's Code Interpreter SDK allows you to add code-interpreting capabilities to your AI apps. E2B Sandbox is a secure sandboxed cloud environment made for AI agents and AI apps. Sandboxes allow AI agents and apps to have long-running cloud secure environments. In these environments, large language models can use the same tools as humans do.

Downloads: 9 This Week

Last Update: 10 hours ago
See Project
7

Vision Agents

Open Vision Agents by Stream. Build voice and vision agents quickly

...Vision Agents is model-agnostic, so developers can connect providers such as OpenAI, Gemini, Claude, Hugging Face, YOLO, Roboflow, and others. Its main value is giving developers a flexible foundation for multimodal agents that operate on live audio and video instead of only static prompts.

Downloads: 3 This Week

Last Update: 2026-06-11
See Project
8

infinite-canvas

Infinite Canvas Workbench for AI creation integrates AI generation

...Users can work across multiple canvases, drag and scale nodes, connect ideas visually, use a minimap, undo changes, and import or export work. The project supports OpenAI-compatible API connections for text-to-image, image-to-image, reference editing, text chat, audio generation, and video generation. It also includes a canvas assistant that can discuss selected nodes, use upstream context, generate new outputs, and place results back onto the canvas. The project is still in active development and is better suited for personal or local deployment than stable public multi-user production use.

Downloads: 0 This Week

Last Update: 3 days ago
See Project
9

Violin

Open-source Video Translation Skill

Violin is an open-source video translation and dubbing tool that turns existing videos into localized versions with translated voice-over and optional subtitles. It transcribes the original speech, translates the text, generates natural-sounding speech in the target language, and remuxes the new audio back into the video. The project is designed to keep the generated speech aligned with the original timing so the final result feels closer to a real dubbed video. It can be used from the command line, through a FastAPI web app, or as a Claude Code skill. Violin supports multilingual workflows and is useful for creators, educators, localization teams, and developers building automated video translation pipelines. ...

Downloads: 0 This Week

Last Update: 2026-05-19
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
10

OpenAI Realtime Agents

This is a simple demonstration of more advanced, agentic patterns

This repository demonstrates how to build low-latency, streaming “voice + chat” agents using OpenAI’s Realtime API combined with the OpenAI Agents SDK. The demo shows patterns for connecting a realtime voice stream (audio in/out) with agents that can use tools, maintain state, and orchestrate multi-agent workflows. The SDK offers abstractions such as agent orchestration, event handling, handoffs, state management, and guardrails, tailored to support realtime, conversational systems. The demo includes a Next.js frontend for browser interaction and likely a backend component to orchestrate realtime sessions and agent logic. ...

Downloads: 0 This Week

Last Update: 2026-01-07
See Project
11

BotSharp

AI Multi-Agent Framework in .NET

...It opens up as much learning power as possible for your own robots and precisely control every step of the AI processing pipeline. BotSharp is an open source machine learning framework for AI Bot platform builder. This project involves natural language understanding, computer vision and audio processing technologies, and aims to promote the development and application of intelligent robot assistants in information systems. Out-of-the-box machine learning algorithms allow ordinary programmers to develop artificial intelligence applications faster and easier. It's written in C# running on .Net Core that is full cross-platform framework. ...

Downloads: 0 This Week

Last Update: 2025-10-17
See Project
12

NodeTool

Visual AI Workflow Builder

NodeTool is an open‑source, visual AI workflow builder that lets you connect nodes for text, images, audio, video, data, and automation—then run them locally or on the cloud. Build multi‑step agents, RAG systems, and creative media pipelines without coding, inspect execution in real time, and deploy anywhere: home server, private VPC, RunPod, or Cloud Run. With a local‑first design, NodeTool keeps models and data under your control while still supporting providers like OpenAI, Anthropic, Replicate, and HuggingFace. ...

Downloads: 3 This Week

Last Update: 2026-01-20
See Project
13

ILA - teachable voice assistant

ILA is a fully customizable and teachable voice assistant for Java

ILA stands for (kind of) intelligent, learning assistant and is a speech recognition system aka voice assistant very similar to Siri, Google Now and Cortana. ILA is fully customizable and you can teach her/him/it new things by yourself like executing system commands, opening web pages, programs and apps or just some basic conversation :-) ILA runs on Java und thus is compatible to Windows, Mac and Linux. It is designed to integrate with your home enviroment and for example build up your own,...

4 Reviews

Downloads: 0 This Week

Last Update: 2018-07-23
See Project
14

openSMILE

SMILE = Speech & Music Interpretation by Large Space Extraction openSMILE is a fast, real-time (audio) feature extraction utility for automatic speech, music and paralinguistic recognition research developed originally at TUM in the scope of the EU-project SEMAINE, now maintained and supported by audEERING.

Downloads: 0 This Week

Last Update: 2014-11-27
See Project
15

openEAR

openEAR is the Munich Open-Source Emotion and Affect Recognition Toolkit developed at the Technische Universität München (TUM). It provides efficient (audio) feature extraction algorithms implemented in C++, classfiers, and pre-trained models on well-known emotion databases. It is now maintained and supported by audEERING. Updates will follow soon.

4 Reviews

Downloads: 10 This Week

Last Update: 2015-08-06
See Project
16

bluejam

BlueJam is a Java-based algorithmic music composer that uses evolutionary techniques and heuristics. Originally intended to evolve solos on the blues scale. BlueJam interfaces with Pure-Data to give real-time output.

Downloads: 1 This Week

Last Update: 2013-03-22
See Project
17

Music Agent

Music agent is a software agent designed to help people discover new creative commons licensed music according to their personal taste.

Downloads: 0 This Week

Last Update: 2014-05-09
See Project
18

Musical Multiagent System

This project is an implementation of a computational framework that addresses general-interest low-level problems such as real-time synchronization, sound communication and spatial agent mobility.

Downloads: 0 This Week

Last Update: 2013-04-22
See Project