Search Results for "audio processing" - Page 6

Showing 374 open source projects for "audio processing"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure Icon
    Stop Cyber Threats with VM-Series Next-Gen Firewall on Azure

    Native application identity and user-based security for your Azure cloud

    Gain integrated visibility across all traffic in a single pass. Deploy Palo Alto Networks VM-Series to determine application identity and content while automating security policy updates via rich APIs.
    Get a free trial
  • 1
    AudioEnhancerMAX

    AudioEnhancerMAX

    Open-source AI audio processing suite: 100% local, free, no limits.

    AudioEnhancerMAX is the open-source AI-powered audio media center for podcasters, creators, and professionals. It runs 100% locally on your hardware — no cloud, no subscriptions, no limits. Features: 16+ intelligent audio filters (AI noise removal, filler word detection, breath reduction, studio sound, audio super-resolution), Gemma 4 AI dynamic parameter tuning, distributed edge computing across Android smartphones, real-time system monitoring for Apple Silicon. Built with FastAPI,...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 2
    Bootleg Sound Processor

    Bootleg Sound Processor

    Software for processing audio files.

    Software for processing audio files. The files "Batch Processor.py" and "Duplicate remover.py" are meant to be used with the output of Bootleg Text Slicer (https://github.com/Northstrix/bootleg-text-slicer) placed into the "Unprocessed" folder, while "Single file processor.py" can be used with standalone files from arbitrary locations. GitHub repository: https://github.com/Northstrix/bootleg-sound-processor Made using Google AI Studio (https://aistudio.google.com/) and Perplexity (https://www.perplexity.ai/)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    ApraPipes

    A pipeline framework for developing video and image processing apps

    ApraPipes is a C++ multimedia processing framework designed for building high-performance video/audio processing pipelines with GPU acceleration. It provides a modular, declarative architecture for creating complex media processing workflows that span camera capture, encoding/decoding, computer vision, AI operations, and output to files, streams, or displays.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    WildMidi is a midi processing library and a midi player using the gus patch set.
    Downloads: 5 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5

    runabc

    Runabc is a user interface supporting abc music notation software

    Runabc is a graphic user interface to the abcMIDI, abc2svg and abcm2ps packages which normally run in a command window. In addition it contains numerous tools for editing and processing and analyzing abc and midi files. Runabc has been included in the sourceforge abc music project. It is now becoming a separate project on its own.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    Internet DJ Console

    Internet DJ Console

    A feature packed DJ console and internet radio client for Linux users

    Conceived as an internet radio Shoutcast/Icecast client and DJ console IDJC has two main media players, a background track player, effects buttons, crossfader, webm, aac, ogg, and mp3 streaming, stream automation timers, aux input, voice and VoIP integration. Media file formats include: mp3, ogg, flac, wma, wav, m4a, m3u, xspf, pls, and cue sheet support, IRC track and station announcements, uses jack audio connection kit to provide a flexible audio chain. This list of features is by no...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    MATLAB Deep Learning Model Hub

    MATLAB Deep Learning Model Hub

    Discover pretrained models for deep learning in MATLAB

    Discover pre-trained models for deep learning in MATLAB. Pretrained image classification networks have already learned to extract powerful and informative features from natural images. Use them as a starting point to learn a new task using transfer learning. Inputs are RGB images, the output is the predicted label and score.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    Internet Friendly Media Encoder

    Internet Friendly Media Encoder

    Simple, Flexible & Powerful H.265/HEVC & H266/VVC video encoder!

    ...Internet Friendly Media Encoder supports adding subtitles and attachments, and can function as a remuxing tool with a Copy Stream option. It allows you to combine multiple video, audio, subtitle, and attachment streams into a single file, remove unwanted streams, or incorporate subtitles from another video without extracting. The software also supports AviSynth for advanced video processing. The video and audio encoders are implemented in a plug-in style, enabling users to add their own compiled, optimized CPU architectures for increased speed. ...
    Downloads: 24 This Week
    Last Update:
    See Project
  • 9

    Esegui SB

    Flexible video encoding script supporting multiple formats and codecs.

    Video Encoding and Processing Script This bash script streamlines video encoding and processing by leveraging FFmpeg. It supports a wide range of formats and codecs through FFmpeg, including MKV, MP4, x264, x265, AV1 and VP9. Key features include: Flexible Video and Audio Encoding: Utilizes FFmpeg's libraries for various codecs and settings. Subtitle Handling: Include or exclude subtitles as needed.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Securely on Azure with Proven Frameworks Icon
    Build Securely on Azure with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 10
    VCClient

    VCClient

    Software that uses AI to perform real-time voice conversion

    VCClient is a real-time voice conversion system that uses machine learning models to transform a speaker’s voice into another voice with minimal latency. It is designed for live applications such as streaming, gaming, and virtual communication, where immediate feedback is essential. The system supports multiple voice conversion models, including RVC and other neural network-based approaches, allowing users to switch between different voices or customize their output. It provides both a...
    Downloads: 18 This Week
    Last Update:
    See Project
  • 11
    CSM (Conversational Speech Model)

    CSM (Conversational Speech Model)

    A Conversational Speech Generation Model

    The CSM (Conversational Speech Model) is a speech generation model developed by Sesame AI that creates RVQ audio codes from text and audio inputs. It uses a Llama backbone and a smaller audio decoder to produce audio codes for realistic speech synthesis. The model has been fine-tuned for interactive voice demos and is hosted on platforms like Hugging Face for testing. CSM offers a flexible setup and is compatible with CUDA-enabled GPUs for efficient execution.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Voxal voice changer

    Voxal voice changer

    Transform your voice in real-time voxal voice changer

    Voxal Voice Changer is a program that allows you to modify your voice by applying various effects (e.g. pitch change, echo, etc.) in real-time. Effects can be added in any sequence and in any combination, allowing you to distort your voice beyond recognition. Take your audio to the next level! Our powerful Voice Changer software lets you morph your voice in real-time with stunning AI-powered quality. Whether you're looking to have fun, protect your privacy, or create engaging content,...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13

    speech intonator

    The purpose of the project is to develop audio processing algorithms

    The initial version of the main branch of the project has been completed. The main name of the project is "Java audio mixer Summaha". The second name of the project is "Sound Arithmometer". Main purpose - production of musical sound remixes from a set of samples. The name "Summaha" rhymes well with 'Yamaha' and creates motivation and inspiration to achieve a sound quality comparable to with a well-known brand. Detailed documentation in 'read' signature files. Anyone who is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Snowmix

    Snowmix

    Video mixer for mixing live and recorded video and audio feeds

    New version 0.5.2.2 Released May 15th 2026. Snowmix is a Swiss army knife tool for mixing live and recorded video and audio feeds. It supports 2D and 3D clipping, scaling and transparent overlay of video, png graphics and text. It supports animation of video, images and texts through native commands changing scale, placement, transparency and rotation. Animation and actions can also be controlled through native scripting and an embedded Tcl and/or Python interpreter. Snowmix is designed...
    Downloads: 32 This Week
    Last Update:
    See Project
  • 15
    Drumstick Libraries

    Drumstick Libraries

    MIDI libraries for Qt/C++

    Drumstick is a tool to play music. This is a set of C++ MIDI libraries using Qt5 objects, idioms and style. It contains a C++ wrapper around the ALSA library sequencer interface; ALSA sequencer provides software support for MIDI technology on Linux. A complementary library provides classes for SMF (Standard MIDI files: .MID/.KAR), and Cakewalk (.WRK) file formats processing. A multiplatform realtime MIDI I/O library is also provided.
    Leader badge
    Downloads: 6 This Week
    Last Update:
    See Project
  • 16
    Yang YouTube Downloader

    Yang YouTube Downloader

    Downloads best-quality audio and video from YouTube

    This YouTube downloader allows you to get the best streams without re-encoding to preserve the best quality. While VP9 is 35% more efficient than MP4 for videos, some videos have 40-60% smaller file sizes in VP9 format! It will automatically select the best-quality video based on file sizes. It can even combine MP4 videos with Opus audios in a MKV file, although not all players will support it. I haven't seen any other downloader that can produce a MKV file with the best video and audio...
    Downloads: 10 This Week
    Last Update:
    See Project
  • 17

    mediaPlaylists

    Creating and maintaining media player playlists for audio and video

    Tools for creating and maintaining media player playlists for audio and video libraries, including support for environments that use Twonky or similar DLNA‑based servers. These command‑line utilities focus on predictable behavior, transparent processing and compatibility with common playlist formats such as M3U and WPL. The design allows additional formats to be added easily without modifying the core workflow.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    MLT Multimedia Framework
    A multimedia authoring and processing framework and a video playout server for television broadcasting.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 19
    AI File Sorter

    AI File Sorter

    Local AI file organization with categorization and rename suggestions

    AI File Sorter is a cross-platform desktop application that uses AI (local LLMs run on your computer) to organize files and suggest meaningful file names based on real content, not just filenames or extensions. The app can analyze images locally and propose descriptive rename suggestions (for example, IMG_2048.jpg → clouds_over_lake.jpg). It can also analyze document text to improve categorization and renaming. Supported formats include PDF, DOCX, XLSX, PPTX, ODT, ODS, ODP, and common...
    Leader badge
    Downloads: 378 This Week
    Last Update:
    See Project
  • 20
    Data Crow

    Data Crow

    The ultimate cataloguer

    Data Crow allows you to use the standard movie & video (divx, xvid, DVD, Blu-ray, etc), book (and eBooks), images, board games, comic books, games & software, music (mp3 and other music files) cataloguing modules. Besides these modules, which you can change to fit your requirements, you can create new modules (want to catalogue your stamps, equipment, or anything else?). The GUI is skinnable. Reporting (using JasperReports and their community edition JasperSoft Developer Studio ), loan...
    Leader badge
    Downloads: 378 This Week
    Last Update:
    See Project
  • 21
    DiffRhythm

    DiffRhythm

    Di♪♪Rhythm: Blazingly Fast & Simple End-to-End Song Generation

    DiffRhythm is an open-source, diffusion-based model designed to generate full-length songs. Focused on music creation, it combines advanced AI techniques to produce coherent and creative audio compositions. The model utilizes a latent diffusion architecture, making it capable of producing high-quality, long-form music. It can be accessed on Huggingface, where users can interact with a demo or download the model for further use. DiffRhythm offers tools for both training and inference, and its...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    OmniPull

    OmniPull

    Just pull anything

    OmniPull is a powerful, cross-platform download manager built with Python and PySide6. It provides a modern, intuitive interface for managing downloads with advanced features like multi-threading, queue management, and media extraction.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 23
    vocal-separate

    vocal-separate

    An extremely simple tool for separating vocals and background music

    ...Users can drag and drop an audio or video file onto the interface to begin separation, choosing between two, four, or five stems, which allows isolating specific components like vocals, bass, drums, or piano depending on the chosen model. After processing, the tool outputs separate WAV files for each extracted stem, making it easy to export and use in audio editing or remix software.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 24

    Virtualdub Batch Video DeShake v26.0204

    Batch to compress [and deshake] all videos [or images] in folder

    Installation: Execute "DeShakInst.BAT" VirtualDub2 44282; AviSynth+ 3.7.5 updated to C:\DVD DESHAK.BAT updated to C:\UT and added to PATH Usage: DESHAK task[s] [parameters] Tasks: tp1: deshake pass1 LOG generation for 2nd pass tp2: deshake pass2 and compress video and audio to MP3 tcomp: compress (no deshake) twav: extract WAV and/or uses external WAV audio Parameters (more in help): vEXT: video extension (ie: vmov), default: vAVI qN: h264...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 25
    Kisekae UltraKiss

    Kisekae UltraKiss

    Kisekae UltraKiss is a full featured integrated development environmen

    UltraKiss is a computer program that implements the Kisekae Set system, KiSS, a Japanese graphics system originally developed to facilitate costume changes on virtual dolls. UltraKiss was developed to help artists build their KiSS sets. It is a full featured viewer for all KiSS dolls, games, and visual applications. It is also a complete graphical development environment for creating KiSS applications. It fully implements the FKiSS event driven programming language up to and including...
    Downloads: 13 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB