Showing 42 open source projects for "extraction"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 1
    Librosa

    Librosa

    Python library for audio and music analysis

    Librosa is a powerful Python library for analyzing and processing audio and music signals. Built on top of NumPy, SciPy, and matplotlib, it provides a wide range of tools for feature extraction, time-series manipulation, audio display, and music information retrieval. Whether you're building machine learning models for audio classification or visualizing spectrograms, Librosa is a go-to library for researchers and developers working in audio signal processing.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 2
    Recorder

    Recorder

    HTML5 js recording mp3 wav ogg webm amr format

    ​Supports microphone recording and real-time processing in most of the implemented getUserMediamobile and PC browsers, mainly including Chrome, Firefox, Safari, iOS 14.3+, Android WebView, Tencent Android X5 kernel (QQ, WeChat, Mini Program WebView) , uni-app (App, H5), and most Android phones updated after 2021 have their own browsers; do not support: UC-based kernel (typical Alipay), most of the old domestic mobile phones that have not been updated have their own browsers and any other...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Super Audio CD Decoder
    ...Decoder is capable of playing back Super Audio CD ISO images, DSDIFF, DSF and DSD WavPack files. DSD(DoP) and PCM output modes. Separate DSD Processor/DSD Converter plugins for track extraction into DSD/DST encoded files.
    Leader badge
    Downloads: 4,891 This Week
    Last Update:
    See Project
  • 4
    LosslessExtract

    LosslessExtract

    Lossless audio extraction tool for Bluray, DVD-Audio, SACD, MKV

    Lossless Extract for macOS and Windows is a tool for purists who demand perfect audio preservation. Designed for precision and simplicity, it effortlessly extracts high-resolution audio from Blu-ray, SACD, MKV or DVD=Audio sources. It handles Dolby TrueHD (with Atmos) and DTS-HD Master Audio preserving atmos object based meta data. Many tools decode immersive audio into PCM, which permanently destroys spatial metadata. Lossless Extract preserves the original audio stream so the immersive mix...
    Leader badge
    Downloads: 48 This Week
    Last Update:
    See Project
  • Forever Free Full-Stack Observability | Grafana Cloud Icon
    Forever Free Full-Stack Observability | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 5
    FFmpeg Batch AV Converter

    FFmpeg Batch AV Converter

    Free all in one audio/video ffmpeg batch encoder

    FFmpeg Batch AV Converter is a free universal audio and video encoder for Windows and Linux (via Wine), that allows to use the full potential of ffmpeg command line with a few mouse clicks in a convenient GUI with drag and drop, progress information. Some fancy wizards make things easy for non-experts. Thanks to its multi-file encoding feature, it may be the fastest a/v batch encoder available, since it maximizes system resources usage by launching as many simultaneous processes up to...
    Leader badge
    Downloads: 2,879 This Week
    Last Update:
    See Project
  • 6
    SpectrumViewer for Windows

    SpectrumViewer for Windows

    Continuous display of audio spectrum from PC input or wave file

    ...Command line support for generating sine, square, triangle and sawtooth wave files. Uses biquad IIR digital filters for frequency detection, plus digital lowpass filters for output extraction. Highly configurable. 60+ FPS with suitable hardware. Manual or automatic gain control.
    Downloads: 12 This Week
    Last Update:
    See Project
  • 7
    AudiooPy

    AudiooPy

    Audio manager in Python Object-Oriented Programming

    ... - A scientifically validated method for automatically detecting sound segments in speech. - Manipulation of raw audio data. - Audio mixing capabilities. - Automated computation of statistical descriptors for audio data. - Channel extraction. - Channel mixing. AudiooPy is entirely self-contained and does not rely on any external libraries. <https://img.shields.io/pypi/v/AudiooPy>
    Downloads: 1 This Week
    Last Update:
    See Project
  • 8
    ccccd

    ccccd

    Channel Code Copy of Compact Discs

    ...The resulting image file, in turn, is demodulated and decoded in software, i.e. within a virtual CD player. The digital audio channel code on LaserDiscs and CD Video discs can be handled as well. Note: The image extraction functionality requires a proprietary hardware interface with a modified disc player. Associated open hardware project "ccccd-HW": http://sourceforge.net/projects/ccccd-hw/ Note: Special restrictions apply. See disclaimers below and within the distribution. Copyright (C) 2012-2025 by Klaus Michael Indlekofer. All rights reserved. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    winLAME

    winLAME

    winLAME is an easy to use encoder for many audio formats, e.g. MP3.

    winLAME is an easy to use encoder for many audio formats, including MP3, Opus, Ogg Vorbis and more. winLAME lets you read in audio tracks from CDs or encode audio files from your hard drive. The intuitive wizard-style user interface makes it easy to set up encoding settings. Multicore CPUs are used to parallelize encoding tasks. Official GitHub project webpage: https://github.com/vividos/winLAME
    Downloads: 48 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10

    audioFlux

    A library for audio and music analysis, feature extraction.

    audioflux is a deep learning tool library for audio and music analysis, feature extraction. It supports dozens of time-frequency analysis transformation methods and hundreds of corresponding time-domain and frequency-domain feature combinations. It can be provided to deep learning networks for training, and is used to study various tasks in the audio field such as Classification, Separation, Music Information Retrieval(MIR) and ASR etc.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    VAD

    VAD

    Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM

    This repository is a voice activity detection (VAD) toolkit that implements multiple models (DNN, bDNN, LSTM, ACAM) for detecting speech versus non-speech in audio. It also provides a recorded dataset in varied real-world settings (e.g. bus stop, construction site, park, room) with ground truth labeling. Acoustic feature extraction (multi-resolution cochleagram, MRCG). Post-processing modules (e.g. smoothing, thresholds). The toolkit supports both MATLAB and Python/TensorFlow components (for feature extraction, classification, postprocessing). Acoustic feature extraction (multi-resolution cochleagram, MRCG). Provided real-world dataset with manual annotations.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Spatial Media

    Spatial Media

    Specifications and tools for 360º video and spatial audio

    spatial-media provides tools for working with spherical video and spatial audio metadata so players and platforms can correctly render immersive media. The utilities inject, inspect, and extract metadata in common container formats (MP4/WebM) to signal 360° projection type, stereoscopy mode, and spatial audio layout. Creators use it to prepare 360/VR180 assets for upload so services know whether a video is monoscopic, top-bottom stereo, or side-by-side, and whether ambisonic audio is...
    Downloads: 50 This Week
    Last Update:
    See Project
  • 13
    Myo Mapper

    Myo Mapper

    An open-source software to map Myo data into OSC and MIDI messages.

    MyoMapper, an open-source software to rescale and centre and map Myo data into OSC and MIDI messages.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 14
    subs2srs

    subs2srs

    Convert movies and TV shows to flashcards

    subs2srs allows you to create import files for Anki or other Spaced Repetition Systems (SRS) based on your favorite foreign language movies and TV shows to aid in the language learning process. See http://subs2srs.sourceforge.net/ for more information.
    Leader badge
    Downloads: 32 This Week
    Last Update:
    See Project
  • 15

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Python Audio Tools are a collection of audio handling programs. These include programs for CD extraction, track conversion from one audio format to another, track renaming and retagging, track identification, CD burning from tracks, and more.
    Leader badge
    Downloads: 28 This Week
    Last Update:
    See Project
  • 17
    jAudio
    jAudio is a project implementing a number of common feature extraction algorithms for analyzing audio in java. Output is either in Weka's ARFF format or the ACE format. Can be used either as a GUI application, command-line application, or embedded. The primary repository is on github (dmcennis) which pushes to this projects github repository.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 18
    Yet Another Audio Feature Extractor is a toolbox for audio analysis. Easy to use and efficient at extracting a large number of audio features simultaneously. WAV and MP3 files supported, or embedding in C++, Python or Matlab applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    Accelerated Feature Extraction Tool

    A fast GPU accelerated feature extraction software for speech analysis

    A fast feature extraction software tool for speech analysis and processing. It incorporates standard MFCC, PLP, and TRAPS features. The tool is a specially designed to process very large audio data sets. It uses GPU acceleration if compatible GPU available (CUDA as weel as OpenCL, NVIDIA, AMD, and Intel GPUs are supported). CPU SSE intrinsic instruction set is used in cases where no compatible GPU present.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Simple general-purpose metadata extraction API with support for popular multimedia metadata formats such as EXIF and ID3.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    Speedy MIDI
    Speedy MIDI is an editor designed for choirs and singers to quickly generate MIDI files for rehearsal.
    Downloads: 10 This Week
    Last Update:
    See Project
  • 22
    Pumilio
    Pumilio is a web-based sound analysis and archive system for almost any kind of sound file with tools to see the spectrogram of the sound, select regions for further analysis and insertion in a database, filtering, and many other manipulations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Real-time audio feature extraction and similarity matching in C++. PD and Max/MSP support via FLEXT.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Sonic Visualiser is an application for inspecting and analysing the contents of music audio files. It combines powerful waveform and spectral visualisation tools with automated feature extraction plugins and annotation capabilities.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 25
    Game audio/data format extraction/conversion/injection tool/media player. Supports over 43 Xbox1/Xbox360/PC formats.
    Downloads: 12 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB