Showing 33 open source projects for "extraction"

View related business solutions
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    Librosa

    Librosa

    Python library for audio and music analysis

    Librosa is a powerful Python library for analyzing and processing audio and music signals. Built on top of NumPy, SciPy, and matplotlib, it provides a wide range of tools for feature extraction, time-series manipulation, audio display, and music information retrieval. Whether you're building machine learning models for audio classification or visualizing spectrograms, Librosa is a go-to library for researchers and developers working in audio signal processing.
    Downloads: 9 This Week
    Last Update:
    See Project
  • 2
    FFmpeg Batch AV Converter

    FFmpeg Batch AV Converter

    Free all in one audio/video ffmpeg batch encoder

    FFmpeg Batch AV Converter is a free universal audio and video encoder for Windows and Linux (via Wine), that allows to use the full potential of ffmpeg command line with a few mouse clicks in a convenient GUI with drag and drop, progress information. Some fancy wizards make things easy for non-experts. Thanks to its multi-file encoding feature, it may be the fastest a/v batch encoder available, since it maximizes system resources usage by launching as many simultaneous processes up to...
    Leader badge
    Downloads: 2,846 This Week
    Last Update:
    See Project
  • 3
    AudiooPy

    AudiooPy

    Audio manager in Python Object-Oriented Programming

    ... - A scientifically validated method for automatically detecting sound segments in speech. - Manipulation of raw audio data. - Audio mixing capabilities. - Automated computation of statistical descriptors for audio data. - Channel extraction. - Channel mixing. AudiooPy is entirely self-contained and does not rely on any external libraries. <https://img.shields.io/pypi/v/AudiooPy>
    Downloads: 1 This Week
    Last Update:
    See Project
  • 4
    ccccd

    ccccd

    Channel Code Copy of Compact Discs

    ...The resulting image file, in turn, is demodulated and decoded in software, i.e. within a virtual CD player. The digital audio channel code on LaserDiscs and CD Video discs can be handled as well. Note: The image extraction functionality requires a proprietary hardware interface with a modified disc player. Associated open hardware project "ccccd-HW": http://sourceforge.net/projects/ccccd-hw/ Note: Special restrictions apply. See disclaimers below and within the distribution. Copyright (C) 2012-2025 by Klaus Michael Indlekofer. All rights reserved. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 5

    audioFlux

    A library for audio and music analysis, feature extraction.

    audioflux is a deep learning tool library for audio and music analysis, feature extraction. It supports dozens of time-frequency analysis transformation methods and hundreds of corresponding time-domain and frequency-domain feature combinations. It can be provided to deep learning networks for training, and is used to study various tasks in the audio field such as Classification, Separation, Music Information Retrieval(MIR) and ASR etc.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 6
    VAD

    VAD

    Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM

    This repository is a voice activity detection (VAD) toolkit that implements multiple models (DNN, bDNN, LSTM, ACAM) for detecting speech versus non-speech in audio. It also provides a recorded dataset in varied real-world settings (e.g. bus stop, construction site, park, room) with ground truth labeling. Acoustic feature extraction (multi-resolution cochleagram, MRCG). Post-processing modules (e.g. smoothing, thresholds). The toolkit supports both MATLAB and Python/TensorFlow components (for feature extraction, classification, postprocessing). Acoustic feature extraction (multi-resolution cochleagram, MRCG). Provided real-world dataset with manual annotations.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Spatial Media

    Spatial Media

    Specifications and tools for 360º video and spatial audio

    spatial-media provides tools for working with spherical video and spatial audio metadata so players and platforms can correctly render immersive media. The utilities inject, inspect, and extract metadata in common container formats (MP4/WebM) to signal 360° projection type, stereoscopy mode, and spatial audio layout. Creators use it to prepare 360/VR180 assets for upload so services know whether a video is monoscopic, top-bottom stereo, or side-by-side, and whether ambisonic audio is...
    Downloads: 26 This Week
    Last Update:
    See Project
  • 8
    Myo Mapper

    Myo Mapper

    An open-source software to map Myo data into OSC and MIDI messages.

    MyoMapper, an open-source software to rescale and centre and map Myo data into OSC and MIDI messages.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 10
    Python Audio Tools are a collection of audio handling programs. These include programs for CD extraction, track conversion from one audio format to another, track renaming and retagging, track identification, CD burning from tracks, and more.
    Leader badge
    Downloads: 17 This Week
    Last Update:
    See Project
  • 11
    jAudio
    jAudio is a project implementing a number of common feature extraction algorithms for analyzing audio in java. Output is either in Weka's ARFF format or the ACE format. Can be used either as a GUI application, command-line application, or embedded. The primary repository is on github (dmcennis) which pushes to this projects github repository.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    Yet Another Audio Feature Extractor is a toolbox for audio analysis. Easy to use and efficient at extracting a large number of audio features simultaneously. WAV and MP3 files supported, or embedding in C++, Python or Matlab applications.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    Accelerated Feature Extraction Tool

    A fast GPU accelerated feature extraction software for speech analysis

    A fast feature extraction software tool for speech analysis and processing. It incorporates standard MFCC, PLP, and TRAPS features. The tool is a specially designed to process very large audio data sets. It uses GPU acceleration if compatible GPU available (CUDA as weel as OpenCL, NVIDIA, AMD, and Intel GPUs are supported). CPU SSE intrinsic instruction set is used in cases where no compatible GPU present.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Simple general-purpose metadata extraction API with support for popular multimedia metadata formats such as EXIF and ID3.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Speedy MIDI
    Speedy MIDI is an editor designed for choirs and singers to quickly generate MIDI files for rehearsal.
    Downloads: 15 This Week
    Last Update:
    See Project
  • 16
    Pumilio
    Pumilio is a web-based sound analysis and archive system for almost any kind of sound file with tools to see the spectrogram of the sound, select regions for further analysis and insertion in a database, filtering, and many other manipulations.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 17
    Real-time audio feature extraction and similarity matching in C++. PD and Max/MSP support via FLEXT.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    Sonic Visualiser is an application for inspecting and analysing the contents of music audio files. It combines powerful waveform and spectral visualisation tools with automated feature extraction plugins and annotation capabilities.
    Downloads: 13 This Week
    Last Update:
    See Project
  • 19
    FX Player is a Web-based streaming server with a Flash iTunes-like interface. It shares your MP3 library and allow access to your tracks through the Internet. Coded in Java, FX Player run on most platforms, including Mac OS X, Windows, Linux and Unix.
    Leader badge
    Downloads: 23 This Week
    Last Update:
    See Project
  • 20
    CAMEL - A Framework for Audio Analysis
    CAMEL (Content-based Audio and Music Extraction Library) is an easy-to-use C++ framework developed for content-based audio and music analysis. The framework provides a set of tools for easy Segmentation, Feature Extraction, Domain Extraction, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    MooSick Music Player
    Free mplayer-based Music Player with a built-in Song Database, Websearch, Streaming and Download Tools and several Audio Effects including Speed Control, Pitch Change, Dynamics Processing, "Live"-, Headphone- and Karaoke Filters and Distortion ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    This is a fast C implementation of Arturo Camacho's SWIPE' pitch extraction algorithm. See the project homepage for more about the advantages of the SWIPE' algorithm. swipe-1.0.tar.gz contains the current source, which should compile quite neatly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    libxtract is a simple, portable, lightweight library of audio feature extraction functions.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    MedosW is a media open source wrapper build on top of gstreamer. Provides: Playback functionality for mp3 and ogg files and their metadata extraction. Future version will support multiple of enhancements.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    jumbles (Java Unified Metadata Basic Library for Extracting and Storing) is a library that enables the extraction and storing of multimedia metadata. Currently wraps "jaudiotagger" (MP3 ID3 tags) and "metadata extractor" (EXIF, et al.).
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB