Join/Login
Business Software
Open Source Software
For Vendors
Blog
About
More

For Vendors Help Create Join Login

Business Software

Open Source Software

SourceForge Podcast

Resources

Articles
Case Studies
Blog

Menu

Help
Create
Join
Login

Home
Open Source Software
Search Results

Search Results for "audio processing" - Page 4

x

Sort By:

Relevance

Clear All Filters

OS

BSD 200
Linux 198
Windows 171
More...
Mac 159
ChromeOS 146
Desktop Operating Systems 13
Mobile Operating Systems 3
Server Operating Systems 2
Game Consoles 1

Category

Multimedia 138
Artificial Intelligence 54
Software Development 30
Scientific/Engineering 20
System 14
Games 6
Business 5
Communications 5
Text Editors 5
Internet 3
Database 1
Desktop Environment 1
Education 1
Security 1
Social sciences 1

License

OSI-Approved Open Source 177
Other License 3
Creative Commons Attribution License 2
Public Domain 1

Translations

English 60
German 12
French 8
Italian 4
More...
Russian 4
Dutch 3
Japanese 3
Portuguese 3
Spanish 3
Brazilian Portuguese 2
Catalan 2
Chinese (Simplified) 2
Estonian 2
Polish 2
Turkish 2
Arabic 1
Croatian 1
Czech 1
Danish 1
Finnish 1
Galician 1
Greek 1
Hebrew 1
Hungarian 1
Romanian 1
Slovak 1
Swedish 1
Telugu 1
Ukrainian 1

Programming Language

Status

Production/Stable 34
Beta 30
Pre-Alpha 19
Alpha 18
More...
Planning 10
Inactive 5
Mature 4

200 projects for "audio processing" with 1 filter applied:

BSD Clear Filters & Widen Search

Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
Enterprise-grade ITSM, for every business
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity.

Freshservice is an intuitive, AI-powered platform that helps IT, operations, and business teams deliver exceptional service without the usual complexity. Automate repetitive tasks, resolve issues faster, and provide seamless support across the organization. From managing incidents and assets to driving smarter decisions, Freshservice makes it easy to stay efficient and scale with confidence.

Try it Free
1

wasmboy

Game Boy / Game Boy Color Emulator Library

wasmboy is a Game Boy and Game Boy Color emulator built using WebAssembly and JavaScript, designed to run efficiently in both browsers and Node environments. It leverages modern web technologies such as HTML5 canvas and the Web Audio API to deliver graphics and sound directly within a web interface. The project emphasizes portability and integration, allowing it to be embedded into other applications as a reusable dependency. It supports a wide range of emulator features including save...

Downloads: 0 This Week

Last Update: 2026-04-07
See Project
2

Piano transcription

Task of transcribing piano recordings into MIDI files

Piano transcription is an open-source high-resolution piano transcription system by ByteDance that converts raw audio recordings of piano performance into symbolic MIDI files — detecting note onsets, offsets, pitch, velocity, and even pedal usage. The system is implemented in Python (PyTorch) and is capable of accurate transcription of polyphonic piano recordings, even with complex passages and pedal techniques, making it suitable for classical piano music. By using this transcription tool,...

Downloads: 2 This Week

Last Update: 2025-12-02
See Project
3

AhoTTS - TTS for Basque and Spanish

Text-to-Speech for Basque and Spanish

Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its acoustic engine is based on hts_engine and it uses a high quality vocoder called AhoCoder. Developed by Aholab Signal Processing Laboratory: https://aholab.ehu.es/aholab/ http://aholab.ehu.es/ahocoder/

1 Review

Downloads: 0 This Week

Last Update: 2022-05-03
See Project
4

AutoSub

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video

AutoSub is a Python-based tool designed to automatically generate subtitles for video or audio content using speech recognition technology. It processes media files by extracting audio, transcribing spoken content, and generating subtitle files in standard formats. The tool supports multiple languages and can integrate with translation systems to produce subtitles in different languages. It is designed for automation, allowing batch processing of multiple media files. ...

Downloads: 6 This Week

Last Update: 2026-04-28
See Project
Go From AI Idea to AI App Fast
One platform to build, fine-tune, and deploy ML models. No MLOps team required.

Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.

Try Free
5

Music Source Separation

Separate audio recordings into individual sources

Music Source Separation is a PyTorch-based open-source implementation for the task of separating a music (or audio) recording into its constituent sources — for example isolating vocals, instruments, bass, accompaniment, or background from a mixed track. It aims to give users the ability to take any existing song and decompose it into separate stems (vocals, accompaniment, etc.), or to train custom separation models on their own datasets (e.g. for speech enhancement, instrument isolation, or...

Downloads: 5 This Week

Last Update: 2025-12-02
See Project
6

SVoice (Speech Voice Separation)

We provide a PyTorch implementation of the paper Voice Separation

...The repository includes all necessary scripts for training, dataset preparation, distributed training, evaluation, and audio separation.

Downloads: 0 This Week

Last Update: 14 hours ago
See Project
7

hora

Efficient approximate nearest neighbor search algorithm collections

hora is an open-source high-performance vector similarity search library designed for large-scale machine learning and information retrieval systems. The project focuses on approximate nearest neighbor search, a fundamental technique used in modern AI applications such as recommendation systems, image search, and semantic search engines. Hora implements multiple efficient indexing algorithms that allow systems to rapidly search through high-dimensional vectors produced by machine learning...

Downloads: 0 This Week

Last Update: 2026-03-11
See Project
8

AAXtoMP3

Convert Audible's .aax filetype to MP3, FLAC, M4A, or OPUS

...AAXtoMP3 supports batch processing, enabling users to convert multiple files in a single workflow. Its minimal setup and script-based usage make it suitable for automation and integration into personal media pipelines. Overall, it provides a practical solution for managing audiobook libraries in open formats.

Downloads: 3 This Week

Last Update: 2026-04-24
See Project
9

Delphi ASIO & VST Packages

With these packages for Delphi the user can easily create VST plugins or ASIO applications within minutes. The included algorithms for filters and dynamics help to built effects without much knowledge of digital signal processing.

4 Reviews

Downloads: 6 This Week

Last Update: 2021-09-30
See Project
Build Securely on AWS with Proven Frameworks
Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.

Download Now
10

VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM

This repository is a voice activity detection (VAD) toolkit that implements multiple models (DNN, bDNN, LSTM, ACAM) for detecting speech versus non-speech in audio. It also provides a recorded dataset in varied real-world settings (e.g. bus stop, construction site, park, room) with ground truth labeling. Acoustic feature extraction (multi-resolution cochleagram, MRCG). Post-processing modules (e.g. smoothing, thresholds). The toolkit supports both MATLAB and Python/TensorFlow components (for feature extraction, classification, postprocessing). ...

Downloads: 0 This Week

Last Update: 2025-10-02
See Project
11

CHOW Phaser

Phaser effect based loosely on the Schulte Compact Phasing 'A'

ChowPhaser is an open-source audio plugin that emulates the classic Schulte Compact Phasing 'A' effect. It offers a unique phasing effect with nonlinear feedback and modulation capabilities, suitable for various audio processing applications.

Downloads: 0 This Week

Last Update: 2025-05-08
See Project
12

Denoiser

Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)

...The implementation includes data augmentation techniques applied to the raw waveforms (e.g. noise mixing, reverberation) to improve model robustness and generalization to diverse noise types. The project supports both offline denoising (batch inference) and live audio processing (e.g. via loopback audio interfaces), making it practical for real-time use in calls or recording. The codebase includes training and evaluation scripts, configuration management via Hydra, and pretrained models on standard noise datasets.

Downloads: 2 This Week

Last Update: 2025-10-07
See Project
13

Live Transcribe Speech Engine

Live Transcribe is an Android application

...Its design prioritizes latency and robustness in noisy, far-field environments, enabling continuous transcription with low delay on mobile hardware. The engine manages audio front-end processing—such as noise suppression and voice activity detection—before feeding audio into compact, accurate acoustic and language models. Partial hypotheses stream as words are recognized, then stabilize with minimal jitter as confidence increases, which is crucial for usability. The code emphasizes efficient use of CPU and neural accelerators to balance battery life with responsiveness. ...

Downloads: 0 This Week

Last Update: 2025-10-10
See Project
14

hplayer

A multi-screen player using Qt + FFmpeg

...It focuses on providing a minimal yet functional implementation of video playback, including decoding, rendering, and synchronization. The project is structured as a learning resource, helping developers understand the fundamentals of multimedia pipelines. It supports common audio and video formats and includes playback controls for managing media streams. The architecture emphasizes performance and simplicity, using native libraries to achieve efficient playback. It also demonstrates integration between UI layers and low-level media processing components. Overall, it serves as a practical reference for building custom media players.

Downloads: 0 This Week

Last Update: 2026-04-24
See Project
15

Perl Audio Converter

Linux Audio Converter / Tagger / CD Ripper

...It can also extract audio from the following video extensions: RM, RV, ASF, DivX, MPG, MKV, MPEG, AVI, MOV, OGM, OGV, QT, VCD, SVCD, M4V, NSV, NUV, PSP, SMK, VOB, FLV, WEBM and WMV. Parallel Processing, a CD ripping function with CDDB support, batch conversion, tag preservation for most supported formats, independent tag reading & writing, service menus for KDE Dolphin/Konqueror, Gnome Nautilus script, and action scripts for Nemo/Thunar are also provided.

4 Reviews

Downloads: 24 This Week

Last Update: 2021-02-09
See Project
16

mda VST plug-ins

Source code for "mda" audio processing plug-ins in VST format. Available for many years as closed-source freeware from mda-vst.com

3 Reviews

Downloads: 180 This Week

Last Update: 2021-01-22
See Project
17

quick-media

media(audio/image/qrcode/markdown/html/svg/png) support

quick-media is a lightweight multimedia processing toolkit designed to simplify common video and audio operations through streamlined command execution. It provides a wrapper around FFmpeg functionality, enabling users to perform tasks such as transcoding, clipping, and format conversion with simplified commands. The tool emphasizes ease of use while still allowing access to advanced encoding parameters when needed.

Downloads: 0 This Week

Last Update: 2026-05-01
See Project
18

Sysex Osc Generator

A Sysex OSC hex string generator for the X32/X-Air/Wing digital mixers

The Sysex OSC Generator provides a means of selecting a desired OSC command for the Behringer X32, X-Air or Wing digital mixer and generating the Sysex OSC hex string. This can be added to any midi device that supports sysex sending of commands. Available for the PC, Mac, linux (32 and 64bit) and Raspberry Pi platforms. Feedback of suggestions and bug reports that would improve the app would be appreciated.

Downloads: 7 This Week

Last Update: 2020-12-17
See Project
19

Xabe.FFmpeg

.NET Standard wrapper for FFmpeg. It allows to process media

Xabe.FFmpeg is a .NET library that provides a high-level wrapper for FFmpeg, allowing developers to perform multimedia operations using a clean and intuitive API. It simplifies complex command-line interactions by offering structured methods for tasks such as conversion, concatenation, and streaming. The library supports both synchronous and asynchronous execution, making it suitable for scalable applications. It includes utilities for retrieving media information through FFprobe, enabling...

Downloads: 0 This Week

Last Update: 2026-04-27
See Project
20

LiVES

LiVES is a Video Editing System. It is designed to be simple to use, y

LiVES mixes realtime video performance and non-linear editing in one professional quality application. It is designed to be simple to use, yet powerful. It is small in size, yet it has many advanced features. Using LiVES, you can start editing and making video right away, without having to worry about formats, frame sizes, or framerates. It is a very flexible tool which is used by both professional VJ's and video editors - mix and switch clips from the keyboard, use dozens of realtime...

15 Reviews

Downloads: 8 This Week

Last Update: 2020-11-08
See Project
21

Jacktube

Jacktube is an audio/MIDI processing program using LADSPA plugins.

Downloads: 0 This Week

Last Update: 2020-10-12
See Project
22

ffmpeg.js

Port of FFmpeg with Emscripten

ffmpeg.js is a JavaScript port of the FFmpeg multimedia framework compiled with Emscripten, enabling video and audio processing directly within browsers or Node.js environments. It provides prebuilt modules optimized for web use, balancing performance and file size while supporting common encoding and decoding tasks. By running entirely in JavaScript through asm.js, it allows developers to manipulate media files without requiring native binaries or server-side processing. ...

Downloads: 0 This Week

Last Update: 2026-04-26
See Project
23

videoshow

Simple node.js utility to create video slideshows from images

videoshow is a Node.js utility designed to create video slideshows from a sequence of images using FFmpeg as its processing engine. It allows developers to programmatically generate videos by combining images with optional audio tracks, subtitles, and visual transitions. The tool supports customization of parameters such as frame rate, resolution, bitrate, and codecs, enabling flexible output configurations. It includes both a programmatic API and a command-line interface, making it adaptable for different workflows. videoshow processes media efficiently and is used in production environments to generate large volumes of videos automatically. ...

Downloads: 0 This Week

Last Update: 2026-05-02
See Project
24

MediaToolkit

A .NET library to convert and process all your video & audio files

MediaToolkit is a .NET library designed to simplify multimedia processing tasks by providing an easy-to-use interface over FFmpeg functionality. It allows developers to perform operations such as video conversion, thumbnail generation, and metadata extraction without dealing with raw command-line syntax. The library supports common media workflows, making it suitable for backend services and desktop applications. It provides structured APIs for configuring encoding parameters and handling...

Downloads: 0 This Week

Last Update: 2026-04-27
See Project
25

Ecasound

Ecasound is a software package designed for multitrack audio processing. It can be used for simple tasks like audio playback, recording and format conversions, as well as for multitrack effect processing, mixing, recording and signal recycling. Ecasound

2 Reviews

Downloads: 0 This Week

Last Update: 2022-10-02
See Project

Previous
1
2
3
You're on page 4
5
6
7
8
Next

Related Searches

vst

speech synthesis

mp3 to amr converter

mda vst 64 bit

midi to osc converter

video editor

sapi 5 voices

wasapi

asio audio bridge

asio audio

Related Categories

Multimedia

Artificial Intelligence

Software Development

Scientific/Engineering

System

SourceForge

Create a Project
Open Source Software
Business Software
Top Downloaded Projects

Company

About
Team
SourceForge Headquarters
1320 Columbia Street Suite 310
San Diego, CA 92101
+1 (858) 422-6466

Resources

Support
Site Documentation
Site Status
SourceForge Reviews

© 2026 Slashdot Media. All Rights Reserved.

Terms Privacy Opt Out Advertise