Search Results for "audio processing" - Page 4

Showing 98 open source projects for "audio processing"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Let your crypto work for you

    Put idle assets to work with competitive interest rates, borrow without selling, and trade with precision. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 1
    Denoiser

    Denoiser

    Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)

    ...The implementation includes data augmentation techniques applied to the raw waveforms (e.g. noise mixing, reverberation) to improve model robustness and generalization to diverse noise types. The project supports both offline denoising (batch inference) and live audio processing (e.g. via loopback audio interfaces), making it practical for real-time use in calls or recording. The codebase includes training and evaluation scripts, configuration management via Hydra, and pretrained models on standard noise datasets.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Savify

    Savify

    Download Spotify songs to mp3 with full metadata and cover art

    Savify is a command-line tool designed to download and archive music from Spotify by leveraging YouTube as the audio source while preserving Spotify metadata. It allows users to input playlists, albums, or individual tracks and automatically retrieves matching audio files with proper tagging. The tool integrates FFmpeg and yt-dlp to handle downloading, conversion, and formatting into common audio formats such as MP3. It enriches files with metadata including artist, album, cover art, and...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 3
    LiVES

    LiVES

    LiVES is a Video Editing System. It is designed to be simple to use, y

    LiVES mixes realtime video performance and non-linear editing in one professional quality application. It is designed to be simple to use, yet powerful. It is small in size, yet it has many advanced features. Using LiVES, you can start editing and making video right away, without having to worry about formats, frame sizes, or framerates. It is a very flexible tool which is used by both professional VJ's and video editors - mix and switch clips from the keyboard, use dozens of realtime...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 4
    Monkey-DL

    Monkey-DL

    Bulk download your favourite anime episodes from your favourite anime

    Monkey-DL is a command-line media downloader designed to retrieve video and audio content from online platforms with flexibility and automation. It integrates with tools like FFmpeg to handle post-processing tasks such as merging streams, converting formats, and optimizing output quality. The tool supports downloading single media files or entire playlists, enabling efficient batch operations. It includes options for selecting resolution, format, and output structure, giving users fine control over downloads. monkey-dl is built for simplicity, providing straightforward commands while still supporting advanced configurations. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5
    MystiQ

    MystiQ

    Qt5/C++ FFmpeg Media Converter

    MystiQ is a cross-platform multimedia converter built with Qt and FFmpeg, designed to provide a modern graphical interface for video and audio processing tasks. It allows users to perform operations such as transcoding, trimming, and format conversion without needing to use command-line tools. The application supports a wide range of codecs and formats, enabling compatibility across devices and platforms. It includes batch processing capabilities, allowing multiple files to be converted simultaneously. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 6

    FastoCloud PRO

    IPTV/NVR/CCTV/Video cloud https://fastocloud.com

    IPTV/Video cloud Features: Cross-platform (Linux, MacOSX, FreeBSD, Raspbian/Armbian) GPU/CPU Encode/Decode/Post Processing Stream statistics CCTV Adaptive hls streams Load balancing Temporary urls HLS push EPG scanning Subtitles to text conversions AD insertion Logo overlay Video effects Relays Timeshifts Catchups Playlists Restream/Transcode from online streaming services like Youtube, Twitch ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    uncaptcha

    uncaptcha

    Defeating Google's audio reCaptcha with 85% accuracy

    ...It employs signal processing techniques such as segmenting audio clips into individual components before transcription, which improves accuracy in noisy or complex audio conditions. The project was developed as part of academic research to highlight potential weaknesses in CAPTCHA systems and includes disclaimers emphasizing responsible use. While it achieved high success rates at the time of publication, later updates to reCAPTCHA have reduced its effectiveness.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    aeneas

    aeneas

    Automagically synchronize audio and text (aka forced alignment)

    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment). aeneas automatically generates a synchronization map between a list of text fragments and an audio file containing the narration of the text. In computer science this task is known as (automatically computing a) forced alignment.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Secure File Transfer for Windows with Cerberus by Redwood Icon
    Secure File Transfer for Windows with Cerberus by Redwood

    Protect and share files over FTP/S, SFTP, HTTPS and SCP with the #1 rated Windows file transfer server.

    Cerberus supports unlimited users and connections on a single IP, with built-in encryption, 2FA, and a browser-based web client — all deployable in under 15 minutes with a 25-day free trial.
    Try for Free
  • 10
    The MusicKit & SndKit is an object-oriented software system for building music, sound, signal processing & MIDI applications. The distribution is a comprehensive package that includes on-line documentation, code examples, utilities, applications & scores
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Yet Another Audio Feature Extractor is a toolbox for audio analysis. Easy to use and efficient at extracting a large number of audio features simultaneously. WAV and MP3 files supported, or embedding in C++, Python or Matlab applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    CorEngine
    CorEngine is a work in progress, OpenGL graphics powered 3D game engine designed to help independent game developers with quick prototyping and game/virtual environment creation. The engine supports a standard set of features, like skeletal animation, post processing, Lua/C programming, physics powered by Bullet Physics, GUI and 2D/3D Audio.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    InproTK

    InproTK

    An Incremental Spoken Dialogue Processing Toolkit

    InproTK is an Incremental Spoken Dialogue Processing Toolkit, that is, a toolkit to help you build dialogue systems that listen and talk incrementally, allowing for advanced interactional behaviour. Please see our Wiki for more information: http://sourceforge.net/p/inprotk/wiki/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    The Qt Audio Processor is an ultimate audio files processing software, including ripping, converting, tagging and burning to, from and between every available audio codec.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    The purpose of HSVT is the collection and setup of OSS audio tools for the processing of speech and vocals. The end result will be something between jack-rack and ardour, with partial emulation or co-operation with hardware-rack voice processors.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Software for live sampling and audio processing. Algorithmic composition and improvised audio manipulation in real time. The audio engine uses Csound, and the composition logic is built with Python.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    NovaX is a set of programs that is being devloped for small company's and beginners in the fields of HTML and programming. Coded in Python and C++, this is also a good replacement for MS Office. NOTE: This requires Python to be on your PC. ( Python.org )
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    A modular audio programming language, designed to write applications quickly. Its main goal is real time audio processing, but it should be used for any kind of development.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    An XML-based musical score processing framework written in Python, which outputs CSound score files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Sonar 2D is a live signal processing application where an audio signal is used to navigate between 'zones' in a virtual 2d space. Each zone represents a given processing module, which when activated will be applied to the sound.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    A real time sound processing software. It can be used both, for general audio processing/editing and as a fuzzbox. Based on a very flexible plug-in system, it has been coded in python and currently uses portaudio for sound input/output.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    JackFX is a python module for midi control and realtime audio effects processing built using the Jack Audio Connection Kit. Effects are stackable, and can be chained in any configuration with only a few lines of python code.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    pulsorclip

    pulsorclip

    Download videos from almost any website

    ...The application focuses on a controlled workflow instead of instant downloads. Users first provide a media URL, then select format, quality, and container before processing the file. It includes both a web interface built with Next.js and a Telegram bot that offers the same guided experience through chat. Both share a common backend in a monorepo structure. The system is fully self-hosted and designed to run in a single Docker deployment including both web and bot services. Users can download videos and audio in formats such as MP4, WebM, MKV, MP3, and M4A, with server-side processing and progress tracking.
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB