Showing 134 open source projects for "extraction"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 1
    winLAME

    winLAME

    winLAME is an easy to use encoder for many audio formats, e.g. MP3.

    winLAME is an easy to use encoder for many audio formats, including MP3, Opus, Ogg Vorbis and more. winLAME lets you read in audio tracks from CDs or encode audio files from your hard drive. The intuitive wizard-style user interface makes it easy to set up encoding settings. Multicore CPUs are used to parallelize encoding tasks. Official GitHub project webpage: https://github.com/vividos/winLAME
    Downloads: 46 This Week
    Last Update:
    See Project
  • 2
    VideoSubFinder
    The main purpose of this program is to provide functionality for extract hardcoded subtitles (hardsub) from video. It provides two main features: 1) Autodetection of frames with hardcoded text (hardsub) on video with saving info about timing positions. 2) Generation of cleared from background text images, which allows with usage of OCR programs (like FineReader, Subtitle Edit, Google Drive) to generate complete subtitles with original text and timing. For working of this program on...
    Leader badge
    Downloads: 534 This Week
    Last Update:
    See Project
  • 3

    audioFlux

    A library for audio and music analysis, feature extraction.

    audioflux is a deep learning tool library for audio and music analysis, feature extraction. It supports dozens of time-frequency analysis transformation methods and hundreds of corresponding time-domain and frequency-domain feature combinations. It can be provided to deep learning networks for training, and is used to study various tasks in the audio field such as Classification, Separation, Music Information Retrieval(MIR) and ASR etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    GXSM

    GXSM

    Scanning Probe Microscopy Controller and Data Visualization Software

    GXSM -- Gnome X Scanning Microscopy: A multi-channel image and vector-probe data acquisition and visualization system designed for SPM techniques (STM,AFM..), but also SPA-LEED/LEED/LEEM data analysis. A plug-in interface allows any user add-on data-processing and special hardware and instrument support. Latest: NC-AFM and related explorative methods as SQDM can be configured. High-Speed external PAC-PLL hardware option with digital DSP link. Based on several hardware options it supports...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • 5
    VSGAN

    VSGAN

    VapourSynth Single Image Super-Resolution Generative Adversarial

    ...The Network will be applied in quadrants of the image to reduce up-front VRAM usage. You can use any RGB video input, including float32 (e.g., RGBS) inputs. Using VapourSynth you can pass a Video directly to VSGAN, without any frame extraction needed. Any edit you make in the VapourSynth script with or without VSGAN can be re-used for any other video. VSGAN is released under the MIT License, ensuring it will stay free, with the ability to be used commercially.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    AutoSub

    AutoSub

    A CLI script to generate subtitle files (SRT/VTT/TXT) for any video

    AutoSub is a Python-based tool designed to automatically generate subtitles for video or audio content using speech recognition technology. It processes media files by extracting audio, transcribing spoken content, and generating subtitle files in standard formats. The tool supports multiple languages and can integrate with translation systems to produce subtitles in different languages. It is designed for automation, allowing batch processing of multiple media files. AutoSub leverages...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    flutter_ffmpeg

    flutter_ffmpeg

    Not maintained anymore. Superseded by FFmpegKit

    flutter-ffmpeg is a cross-platform Flutter plugin that enables developers to execute FFmpeg commands within mobile applications. It provides prebuilt FFmpeg binaries and a unified API for Android and iOS, simplifying multimedia processing in Flutter projects. The plugin supports a wide range of operations such as transcoding, filtering, and streaming. It also includes access to FFprobe for retrieving media metadata. flutter-ffmpeg is designed to abstract platform-specific complexities,...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 8
    VAD

    VAD

    Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM

    This repository is a voice activity detection (VAD) toolkit that implements multiple models (DNN, bDNN, LSTM, ACAM) for detecting speech versus non-speech in audio. It also provides a recorded dataset in varied real-world settings (e.g. bus stop, construction site, park, room) with ground truth labeling. Acoustic feature extraction (multi-resolution cochleagram, MRCG). Post-processing modules (e.g. smoothing, thresholds). The toolkit supports both MATLAB and Python/TensorFlow components (for feature extraction, classification, postprocessing). Acoustic feature extraction (multi-resolution cochleagram, MRCG). Provided real-world dataset with manual annotations.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    myjaphoo
    MyJaPhoO My Java Photo Organizer Manages local Photo and Video Collections
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8 Monitoring Tools in One APM. Install in 5 Minutes. Icon
    8 Monitoring Tools in One APM. Install in 5 Minutes.

    Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

    AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.
    Start Free
  • 10
    MediaToolkit

    MediaToolkit

    A .NET library to convert and process all your video & audio files

    MediaToolkit is a .NET library designed to simplify multimedia processing tasks by providing an easy-to-use interface over FFmpeg functionality. It allows developers to perform operations such as video conversion, thumbnail generation, and metadata extraction without dealing with raw command-line syntax. The library supports common media workflows, making it suitable for backend services and desktop applications. It provides structured APIs for configuring encoding parameters and handling input and output files. MediaToolkit also integrates FFprobe for retrieving detailed media information. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 11
    Image Super-Resolution (ISR)

    Image Super-Resolution (ISR)

    Super-scale your images and run experiments with Residual Dense

    The goal of this project is to upscale and improve the quality of low-resolution images. This project contains Keras implementations of different Residual Dense Networks for Single Image Super-Resolution (ISR) as well as scripts to train these networks using content and adversarial loss components. Docker scripts and Google Colab notebooks are available to carry training and prediction. Also, we provide scripts to facilitate training on the cloud with AWS and Nvidia-docker with only a few...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    qlImageSize

    qlImageSize

    QuickLook and Spotlight plugins to display the dimensions of images

    qlImageSize is a QuickLook plugin for macOS that displays image dimensions and file size in the QuickLook preview panel. It provides an efficient way to inspect image metadata without opening additional applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    Simd

    Simd

    High performance image processing library in C++

    The Simd Library is a free open source image processing library, designed for C and C++ programmers. It provides many useful high performance algorithms for image processing such as: pixel format conversion, image scaling and filtration, extraction of statistic information from images, motion detection, object detection (HAAR and LBP classifier cascades) and classification, neural network. The algorithms are optimized with using of different SIMD CPU extensions. In particular the library supports following CPU extensions: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and AVX-512 for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM. ...
    Leader badge
    Downloads: 34 This Week
    Last Update:
    See Project
  • 14
    Spatial Media

    Spatial Media

    Specifications and tools for 360º video and spatial audio

    spatial-media provides tools for working with spherical video and spatial audio metadata so players and platforms can correctly render immersive media. The utilities inject, inspect, and extract metadata in common container formats (MP4/WebM) to signal 360° projection type, stereoscopy mode, and spatial audio layout. Creators use it to prepare 360/VR180 assets for upload so services know whether a video is monoscopic, top-bottom stereo, or side-by-side, and whether ambisonic audio is...
    Downloads: 66 This Week
    Last Update:
    See Project
  • 15
    DigiExtractor
    DigiExtractor is a tool to allow extraction of video recordings from the DigiCorder series of DVB receivers manufactured by TechniSat.
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
  • 16
    Myo Mapper

    Myo Mapper

    An open-source software to map Myo data into OSC and MIDI messages.

    MyoMapper, an open-source software to rescale and centre and map Myo data into OSC and MIDI messages.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17
    Skinner

    Skinner

    Special Effects with Skinned Mesh in Unity

    Skinner is a collection of real-time special effects for Unity that use vertices of an animating skinned mesh as emission points. Instead of duplicating mesh data on the CPU, it employs a replacement shader to stream vertex positions into GPU-friendly buffers, conserving memory and CPU cycles. With those GPU-side buffers, Skinner can drive effects like trails, particles, or geometry that react to the underlying skinned animation in sophisticated ways. The approach enables complex, performant...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    subs2srs

    subs2srs

    Convert movies and TV shows to flashcards

    subs2srs allows you to create import files for Anki or other Spaced Repetition Systems (SRS) based on your favorite foreign language movies and TV shows to aid in the language learning process. See http://subs2srs.sourceforge.net/ for more information.
    Leader badge
    Downloads: 41 This Week
    Last Update:
    See Project
  • 19

    cbrTekStraktor

    an application to automatically extract text from comic books.

    ...The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text. The text extraction is achieved by a combination of statistical and graphical processing operations. It is based on the following 3 major algorithms - Binarization of color images (Niblak and other methods) - Connected components - K-Means clustering Apache Tesseract is used to perform Optical Character Recognition on the extracted text. ...
    Downloads: 2 This Week
    Last Update:
    See Project
  • 20
    OCR Web based

    OCR Web based

    OCR web based for Browser Firefox & PC

    ...id=com.ulm.ocr ========= Add-on for Opera: http://bit.ly/1F0E0wP ========= Release 1.0.1 For safety reasons, I disabled the possibility to import an image from url. Finally, I wish to inform you that you can write or draw directly on the canvas to get the subsequent character recognition and text extraction
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    3DBrainExplorer is a lightweight OpenGL-based 3D renderer for scientific visualization of volumetric data (NIfTI) featuring real-time image manipulation, AAL brain labels (Tzourio-Mazoyer et al., 2002), SPM5 statistical images and stereo display.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 22

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Python Audio Tools are a collection of audio handling programs. These include programs for CD extraction, track conversion from one audio format to another, track renaming and retagging, track identification, CD burning from tracks, and more.
    Leader badge
    Downloads: 14 This Week
    Last Update:
    See Project
  • 24
    MORPHE

    MORPHE

    MORphological PHenotype Extraction

    MORphological PHenotype Extraction (MORPHE) is a suite of automated image processing, visualization, and classification algorithms to facilitate the analysis of heritable and clonal red-to-green transitions that occurred during the growth of a colony.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 25
    jAudio
    jAudio is a project implementing a number of common feature extraction algorithms for analyzing audio in java. Output is either in Weka's ARFF format or the ACE format. Can be used either as a GUI application, command-line application, or embedded. The primary repository is on github (dmcennis) which pushes to this projects github repository.
    Downloads: 2 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB