Page 3 | extraction free download

Showing 134 open source projects for "extraction"

View related business solutions

Multimedia Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
Train ML Models With SQL You Already Know
BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.

Try Free
1

winLAME

winLAME is an easy to use encoder for many audio formats, e.g. MP3.

winLAME is an easy to use encoder for many audio formats, including MP3, Opus, Ogg Vorbis and more. winLAME lets you read in audio tracks from CDs or encode audio files from your hard drive. The intuitive wizard-style user interface makes it easy to set up encoding settings. Multicore CPUs are used to parallelize encoding tasks. Official GitHub project webpage: https://github.com/vividos/winLAME

8 Reviews

Downloads: 46 This Week

Last Update: 2023-05-14
See Project
2

VideoSubFinder

The main purpose of this program is to provide functionality for extract hardcoded subtitles (hardsub) from video. It provides two main features: 1) Autodetection of frames with hardcoded text (hardsub) on video with saving info about timing positions. 2) Generation of cleared from background text images, which allows with usage of OCR programs (like FineReader, Subtitle Edit, Google Drive) to generate complete subtitles with original text and timing. For working of this program on...

18 Reviews

Downloads: 534 This Week

Last Update: 2023-05-01
See Project
3

audioFlux

A library for audio and music analysis, feature extraction.

audioflux is a deep learning tool library for audio and music analysis, feature extraction. It supports dozens of time-frequency analysis transformation methods and hundreds of corresponding time-domain and frequency-domain feature combinations. It can be provided to deep learning networks for training, and is used to study various tasks in the audio field such as Classification, Separation, Music Information Retrieval(MIR) and ASR etc.

Downloads: 0 This Week

Last Update: 2023-03-22
See Project
4

GXSM

Scanning Probe Microscopy Controller and Data Visualization Software

GXSM -- Gnome X Scanning Microscopy: A multi-channel image and vector-probe data acquisition and visualization system designed for SPM techniques (STM,AFM..), but also SPA-LEED/LEED/LEEM data analysis. A plug-in interface allows any user add-on data-processing and special hardware and instrument support. Latest: NC-AFM and related explorative methods as SQDM can be configured. High-Speed external PAC-PLL hardware option with digital DSP link. Based on several hardware options it supports...

2 Reviews

Downloads: 0 This Week

Last Update: 2023-04-23
See Project
Our Free Plans just got better! | Auth0
With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.

Try free now
5

VSGAN

VapourSynth Single Image Super-Resolution Generative Adversarial

...The Network will be applied in quadrants of the image to reduce up-front VRAM usage. You can use any RGB video input, including float32 (e.g., RGBS) inputs. Using VapourSynth you can pass a Video directly to VSGAN, without any frame extraction needed. Any edit you make in the VapourSynth script with or without VSGAN can be re-used for any other video. VSGAN is released under the MIT License, ensuring it will stay free, with the ability to be used commercially.

Downloads: 1 This Week

Last Update: 2023-03-29
See Project
6

AutoSub

A CLI script to generate subtitle files (SRT/VTT/TXT) for any video

AutoSub is a Python-based tool designed to automatically generate subtitles for video or audio content using speech recognition technology. It processes media files by extracting audio, transcribing spoken content, and generating subtitle files in standard formats. The tool supports multiple languages and can integrate with translation systems to produce subtitles in different languages. It is designed for automation, allowing batch processing of multiple media files. AutoSub leverages...

Downloads: 4 This Week

Last Update: 4 days ago
See Project
7

flutter_ffmpeg

Not maintained anymore. Superseded by FFmpegKit

flutter-ffmpeg is a cross-platform Flutter plugin that enables developers to execute FFmpeg commands within mobile applications. It provides prebuilt FFmpeg binaries and a unified API for Android and iOS, simplifying multimedia processing in Flutter projects. The plugin supports a wide range of operations such as transcoding, filtering, and streaming. It also includes access to FFprobe for retrieving media metadata. flutter-ffmpeg is designed to abstract platform-specific complexities,...

Downloads: 3 This Week

Last Update: 5 days ago
See Project
8

VAD

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM

This repository is a voice activity detection (VAD) toolkit that implements multiple models (DNN, bDNN, LSTM, ACAM) for detecting speech versus non-speech in audio. It also provides a recorded dataset in varied real-world settings (e.g. bus stop, construction site, park, room) with ground truth labeling. Acoustic feature extraction (multi-resolution cochleagram, MRCG). Post-processing modules (e.g. smoothing, thresholds). The toolkit supports both MATLAB and Python/TensorFlow components (for feature extraction, classification, postprocessing). Acoustic feature extraction (multi-resolution cochleagram, MRCG). Provided real-world dataset with manual annotations.

Downloads: 0 This Week

Last Update: 2025-10-02
See Project
9

myjaphoo

MyJaPhoO My Java Photo Organizer Manages local Photo and Video Collections

Downloads: 0 This Week

Last Update: 2020-07-31
See Project
8 Monitoring Tools in One APM. Install in 5 Minutes.
Errors, performance, logs, uptime, hosts, anomalies, dashboards, and check-ins. One interface.

AppSignal works out of the box for Ruby, Elixir, Node.js, Python, and more. 30-day free trial, no credit card required.

Start Free
10

MediaToolkit

A .NET library to convert and process all your video & audio files

MediaToolkit is a .NET library designed to simplify multimedia processing tasks by providing an easy-to-use interface over FFmpeg functionality. It allows developers to perform operations such as video conversion, thumbnail generation, and metadata extraction without dealing with raw command-line syntax. The library supports common media workflows, making it suitable for backend services and desktop applications. It provides structured APIs for configuring encoding parameters and handling input and output files. MediaToolkit also integrates FFprobe for retrieving detailed media information. ...

Downloads: 6 This Week

Last Update: 5 days ago
See Project
11

Image Super-Resolution (ISR)

Super-scale your images and run experiments with Residual Dense

The goal of this project is to upscale and improve the quality of low-resolution images. This project contains Keras implementations of different Residual Dense Networks for Single Image Super-Resolution (ISR) as well as scripts to train these networks using content and adversarial loss components. Docker scripts and Google Colab notebooks are available to carry training and prediction. Also, we provide scripts to facilitate training on the cloud with AWS and Nvidia-docker with only a few...

Downloads: 1 This Week

Last Update: 2022-03-31
See Project
12

qlImageSize

QuickLook and Spotlight plugins to display the dimensions of images

qlImageSize is a QuickLook plugin for macOS that displays image dimensions and file size in the QuickLook preview panel. It provides an efficient way to inspect image metadata without opening additional applications.

Downloads: 1 This Week

Last Update: 2025-03-07
See Project
13

Simd

High performance image processing library in C++

The Simd Library is a free open source image processing library, designed for C and C++ programmers. It provides many useful high performance algorithms for image processing such as: pixel format conversion, image scaling and filtration, extraction of statistic information from images, motion detection, object detection (HAAR and LBP classifier cascades) and classification, neural network. The algorithms are optimized with using of different SIMD CPU extensions. In particular the library supports following CPU extensions: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and AVX-512 for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM. ...

3 Reviews

Downloads: 34 This Week

Last Update: 2019-02-01
See Project
14

Spatial Media

Specifications and tools for 360º video and spatial audio

spatial-media provides tools for working with spherical video and spatial audio metadata so players and platforms can correctly render immersive media. The utilities inject, inspect, and extract metadata in common container formats (MP4/WebM) to signal 360° projection type, stereoscopy mode, and spatial audio layout. Creators use it to prepare 360/VR180 assets for upload so services know whether a video is monoscopic, top-bottom stereo, or side-by-side, and whether ambisonic audio is...

Downloads: 66 This Week

Last Update: 2025-10-10
See Project
15

DigiExtractor

DigiExtractor is a tool to allow extraction of video recordings from the DigiCorder series of DVB receivers manufactured by TechniSat.

Downloads: 3 This Week

Last Update: 2018-06-17
See Project
16

Myo Mapper

An open-source software to map Myo data into OSC and MIDI messages.

MyoMapper, an open-source software to rescale and centre and map Myo data into OSC and MIDI messages.

Downloads: 2 This Week

Last Update: 2018-01-28
See Project
17

Skinner

Special Effects with Skinned Mesh in Unity

Skinner is a collection of real-time special effects for Unity that use vertices of an animating skinned mesh as emission points. Instead of duplicating mesh data on the CPU, it employs a replacement shader to stream vertex positions into GPU-friendly buffers, conserving memory and CPU cycles. With those GPU-side buffers, Skinner can drive effects like trails, particles, or geometry that react to the underlying skinned animation in sophisticated ways. The approach enables complex, performant...

Downloads: 0 This Week

Last Update: 2025-10-28
See Project
18

subs2srs

Convert movies and TV shows to flashcards

subs2srs allows you to create import files for Anki or other Spaced Repetition Systems (SRS) based on your favorite foreign language movies and TV shows to aid in the language learning process. See http://subs2srs.sourceforge.net/ for more information.

2 Reviews

Downloads: 41 This Week

Last Update: 2017-10-01
See Project
19

cbrTekStraktor

an application to automatically extract text from comic books.

...The application also enables to manually define text areas in CBR files. The application comprises a simple graphical editor for further processing the extracted text. The text extraction is achieved by a combination of statistical and graphical processing operations. It is based on the following 3 major algorithms - Binarization of color images (Niblak and other methods) - Connected components - K-Means clustering Apache Tesseract is used to perform Optical Character Recognition on the extracted text. ...

Downloads: 2 This Week

Last Update: 2017-06-14
See Project
20

OCR Web based

OCR web based for Browser Firefox & PC

...id=com.ulm.ocr ========= Add-on for Opera: http://bit.ly/1F0E0wP ========= Release 1.0.1 For safety reasons, I disabled the possibility to import an image from url. Finally, I wish to inform you that you can write or draw directly on the canvas to get the subsequent character recognition and text extraction

2 Reviews

Downloads: 0 This Week

Last Update: 2018-09-05
See Project
21

3DBrainExplorer

3DBrainExplorer is a lightweight OpenGL-based 3D renderer for scientific visualization of volumetric data (NIfTI) featuring real-time image manipulation, AAL brain labels (Tzourio-Mazoyer et al., 2002), SPM5 statistical images and stereo display.

Downloads: 1 This Week

Last Update: 2017-04-25
See Project
22

Distant Speech Recognition

Beamforming and Speech Recognition Toolkit

BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.

Downloads: 0 This Week

Last Update: 2019-08-21
See Project
23

Python Audio Tools

Python Audio Tools are a collection of audio handling programs. These include programs for CD extraction, track conversion from one audio format to another, track renaming and retagging, track identification, CD burning from tracks, and more.

1 Review

Downloads: 14 This Week

Last Update: 2019-04-23
See Project
24

MORPHE

MORphological PHenotype Extraction

MORphological PHenotype Extraction (MORPHE) is a suite of automated image processing, visualization, and classification algorithms to facilitate the analysis of heritable and clonal red-to-green transitions that occurred during the growth of a colony.

Downloads: 1 This Week

Last Update: 2016-10-25
See Project
25

jAudio

jAudio is a project implementing a number of common feature extraction algorithms for analyzing audio in java. Output is either in Weka's ARFF format or the ACE format. Can be used either as a GUI application, command-line application, or embedded. The primary repository is on github (dmcennis) which pushes to this projects github repository.

Downloads: 2 This Week

Last Update: 2016-02-26
See Project