audio processing free download

Showing 13 open source projects for "audio processing"

View related business solutions

Software Development Python Clear Filters & Widen Search

$300 Free Credits for Your Google Cloud Projects
Start building on Google Cloud with $300 in free credits. No commitment, no credit card required until you're ready to scale.

Launch your next project with $300 in free Google Cloud credits—no strings attached. Test, build, and deploy without risk. Use your credits across the entire Google Cloud platform to find what works best for your needs. After your credits are used, continue with always-free tier services. Only pay when you're ready to scale. Sign up in minutes and start exploring.

Start Free Trial
Save Up to 91% on Cloud Compute With Spot VMs
Automatic sustained-use discounts. One free VM per month. No negotiation needed.

Run batch jobs at 60-91% off with Spot VMs. Long-running workloads get automatic discounts with sustained use.

Try Free
1

txtai

Build AI-powered semantic search applications

txtai executes machine-learning workflows to transform data and build AI-powered semantic search applications. Traditional search systems use keywords to find data. Semantic search applications have an understanding of natural language and identify results that have the same meaning, not necessarily the same keywords. Backed by state-of-the-art machine learning models, data is transformed into vector representations for search (also known as embeddings). Innovation is happening at a rapid...

Downloads: 4 This Week

Last Update: 2026-07-01
See Project
2

AudioCraft

Audiocraft is a library for audio processing and generation

...It also contains training code and recipes, so researchers can fine-tune on custom data or explore new objectives without building infrastructure from scratch. Example notebooks, CLI tools, and audio utilities help with prompt design, conditioning on reference audio, and post-processing to produce ready-to-share outputs.

Downloads: 8 This Week

Last Update: 2025-10-13
See Project
3

Jina

Build cross-modal and multimodal applications on the cloud

Jina is a framework that empowers anyone to build cross-modal and multi-modal applications on the cloud. It uplifts a PoC into a production-ready service. Jina handles the infrastructure complexity, making advanced solution engineering and cloud-native technologies accessible to every developer. Build applications that deliver fresh insights from multiple data types such as text, image, audio, video, 3D mesh, PDF with Jina AI’s DocArray. Polyglot gateway that supports gRPC, Websockets, HTTP,...

Downloads: 0 This Week

Last Update: 2024-11-12
See Project
4

Pybris

B language compiler written in Python targeting RISVM

Pybris is a compiler written in Python using Pyparsing for the B Programming Language. The compiler emits a variant of Bitmario RISVM assembly. The practical goal of the project is to provide a way to develop digital signal processing (DSP) effects for the Competent Audio library that is a friendlier alternative to writing RISVM assembly by hand. Pybris is written for Python 2.7, but has also been tested to run with Python 3.8.10.

Downloads: 0 This Week

Last Update: 2024-07-08
See Project
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.

Try it Free
5

Ultimate Media Downloader

An Open source media downloader for downloading videos and audios

...Built with Python and powered by industry-standard extraction engines, it delivers enterprise-level capabilities with consumer-friendly simplicity. Whether you're downloading a single YouTube video, extracting audio from Spotify playlists, archiving TikTok content, or batch-processing entire music libraries, UMD handles it all with elegance and efficiency. IT CONSISTS OF : 1. Unified Interface: One command, 1000+ platforms. No tool shopping, no mental model switching. 2. Production-Ready, Zero Friction Installation: Most users go from hearing about the tool to downloading content in under 5 minutes. 3. ...

Downloads: 0 This Week

Last Update: 2026-06-09
See Project
6

OmniPull

Just pull anything

OmniPull is a powerful, cross-platform download manager built with Python and PySide6. It provides a modern, intuitive interface for managing downloads with advanced features like multi-threading, queue management, and media extraction.

Downloads: 4 This Week

Last Update: 2026-06-11
See Project
7

SVoice (Speech Voice Separation)

We provide a PyTorch implementation of the paper Voice Separation

...The repository includes all necessary scripts for training, dataset preparation, distributed training, evaluation, and audio separation.

Downloads: 1 This Week

Last Update: 6 days ago
See Project
8

aeneas

Automagically synchronize audio and text (aka forced alignment)

aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment). aeneas automatically generates a synchronization map between a list of text fragments and an audio file containing the narration of the text. In computer science this task is known as (automatically computing a) forced alignment.

Downloads: 3 This Week

Last Update: 2024-06-21
See Project
9

Distant Speech Recognition

Beamforming and Speech Recognition Toolkit

BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and...

Downloads: 3 This Week

Last Update: 2019-08-21
See Project
Train ML Models With SQL You Already Know
BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.

Try Free
10

InproTK

An Incremental Spoken Dialogue Processing Toolkit

InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/

Downloads: 0 This Week

Last Update: 2015-06-16
See Project
11

NovaX Home Application System

NovaX is a set of programs that is being devloped for small company's and beginners in the fields of HTML and programming. Coded in Python and C++, this is also a good replacement for MS Office. NOTE: This requires Python to be on your PC. ( Python.org )

Downloads: 0 This Week

Last Update: 2014-04-28
See Project
12

Panda

A modular audio programming language, designed to write applications quickly. Its main goal is real time audio processing, but it should be used for any kind of development.

Downloads: 0 This Week

Last Update: 2013-02-27
See Project
13

XML-VIS

An XML-based musical score processing framework written in Python, which outputs CSound score files.

Downloads: 0 This Week

Last Update: 2013-02-22
See Project