extraction free download

Showing 26 open source projects for "extraction"

View related business solutions

Artificial Intelligence C++ Clear Filters & Widen Search

MongoDB Atlas runs apps anywhere
Deploy in 115+ regions with the modern database for every enterprise.

MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.

Start Free
$300 in Free Credit Towards Top Cloud Services
Build VMs, containers, AI, databases, storage—all in one place.

Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.

Get Started
1

Kaldi

kaldi-asr/kaldi is the official location of the Kaldi project

...Kaldi is designed for researchers who need a highly customizable environment to experiment with new algorithms, as well as for practitioners who want robust, production-ready ASR pipelines. It includes extensive tools for data preparation, feature extraction, acoustic and language modeling, decoding, and evaluation. With its modular design, Kaldi allows users to adapt the system to a wide range of languages and domains. As one of the most influential projects in speech recognition, it has become a foundation for much of the modern work in ASR.

Downloads: 3 This Week

Last Update: 3 days ago
See Project
2

3FS

A high-performance distributed file system

The 3FS repository (standing likely for “Feature 3F System” or similar) is focused on providing a feature extraction and transformation framework tailored to deep and large models, especially in token-based systems. Its primary aim is to support efficient and scalable feature transformation pipelines—especially for inference environments—by batching, caching, and integrating feature-based modules like segmenters, sparse retrievers, and scorers seamlessly.

Downloads: 1 This Week

Last Update: 2026-03-30
See Project
3

PaddleOCR-json

OCR offline image text recognition command line windows program

PaddleOCR-json is an OCR engine based on the PaddleOCR project that provides a command-line interface and tools for extracting text from images and exporting results in structured JSON format. It wraps the PaddleOCR models, which are capable of detecting and recognizing text in a wide variety of languages and layouts, into a self-contained executable that can be run locally without needing a deep learning environment configured manually. This makes it practical for developers or system...

Downloads: 7 This Week

Last Update: 2026-01-15
See Project
4

BrowserOS

Agentic browser; privacy-first alternative to ChatGPT Atlas

BrowserOS is an open-source, agentic web browser built on a Chromium base that integrates AI agents directly into the browsing experience. Rather than just doing standard browsing, it places AI intelligence at the core: you can connect your own API keys (for e.g., OpenAI, Anthropic, Google Gemini) or run local models (via e.g., Ollama) so that your browsing data and automation stay on your machine — privacy and control are emphasized throughout. The interface remains familiar to users of...

Downloads: 16 This Week

Last Update: 2026-04-08
See Project
Forever Free Full-Stack Observability | Grafana Cloud
Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.

Create free account
5

Simd Library

C++ image processing and machine learning library with using of SIMD

The Simd Library is a free open-source image processing and machine learning library, designed for C and C++ programmers. It provides many useful high-performance algorithms for image processing such as pixel format conversion, image scaling and filtration, extraction of statistical information from images, motion detection, object detection and classification, neural networks. The algorithms are optimized with using of different SIMD CPU extensions. In particular, the library supports the following CPU extensions: SSE, AVX, AVX-512, and AMX for x86/x64, and NEON for ARM. The Simd Library has C API and also contains useful C++ classes and functions to facilitate access to C API. ...

Downloads: 2 This Week

Last Update: 2026-04-01
See Project
6

eos

A lightweight 3D Morphable Face Model library in modern C++

eos is a lightweight 3D Morphable Face Model fitting library that provides basic functionality to use face models, as well as camera and shape fitting functionality. It's written in modern C++11/14. MorphableModel and PcaModel classes to represent 3DMMs, with basic operations like draw_sample(). Supports the Surrey Face Model (SFM), 4D Face Model (4DFM), Basel Face Model (BFM) 2009 and 2017, and the Liverpool-York Head Model (LYHM) out-of-the-box.

Downloads: 3 This Week

Last Update: 2024-12-10
See Project
7

SecurePose

Automated Face Blurring, Kinematics Extraction and Leg dystonia Dx

...This validation establishes its effectiveness and usability in clinically recorded gait videos for face blurring and kinematics extraction. For installation, https://www.rishabh-bajpai.com/secureposeinstallation Tutorial Videos https://www.youtube.com/playlist?list=PLO4_jCYO5Ib23MoBpn-Wpj1_b6DAYlDwk Please cite the paper: https://arxiv.org/abs/2402.14143

Downloads: 0 This Week

Last Update: 2024-09-13
See Project
8

GeoDMA

Geographic feature extraction and data mining

GeoDMA is a plugin for TerraView software, used for geographical data mining. With a single image, the user can perform segmentation, attributes extraction, normalization and classification.

1 Review

Downloads: 0 This Week

Last Update: 2026-01-20
See Project
9

DocWire SDK

Award-winning modern data processing SDK in C++20

DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to expand its capabilities, focusing on versatile data extraction, platform support, and seamless integration with various systems. ...

Downloads: 6 This Week

Last Update: 2026-03-27
See Project
AI-generated apps that pass security review
Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.

Try Retool free
10

VideoSubFinder

The main purpose of this program is to provide functionality for extract hardcoded subtitles (hardsub) from video. It provides two main features: 1) Autodetection of frames with hardcoded text (hardsub) on video with saving info about timing positions. 2) Generation of cleared from background text images, which allows with usage of OCR programs (like FineReader, Subtitle Edit, Google Drive) to generate complete subtitles with original text and timing. For working of this program on...

18 Reviews

Downloads: 522 This Week

Last Update: 2023-05-01
See Project
11

Aseryla2

Aseryla2 code repositories

This project describes a model of how the semantic human memory represents the information relevant to the objects of the world in text format. It provides a system and a GUI application capable of extracting and managing concepts and relations from English texts. https://aseryla2.sourceforge.io/

Downloads: 0 This Week

Last Update: 2022-05-03
See Project
12

aseryla

Aseryla code repositories

This project describes a model of how the semantic human memory represents the information relevant to the objects of the world in text format. It provides a system and a GUI application capable of extracting and managing concepts and relations from English texts. https://aseryla2.sourceforge.io/

Downloads: 0 This Week

Last Update: 2021-10-29
See Project
13

SmartOpenCV

OpenCV enhancement library for Android

SmartOpenCV is an enhanced computer vision toolkit built on top of OpenCV that aims to simplify and extend common image processing and computer vision tasks through higher-level abstractions and utilities. It provides pre-built modules and optimized pipelines for tasks such as object detection, image transformation, and feature extraction, reducing the need for low-level implementation. The framework is designed to be more developer-friendly than raw OpenCV by offering cleaner APIs and reusable components that accelerate development. It supports integration with machine learning models, enabling hybrid workflows that combine traditional computer vision techniques with modern AI approaches. ...

Downloads: 0 This Week

Last Update: 2026-03-19
See Project
14

Simd

High performance image processing library in C++

The Simd Library is a free open source image processing library, designed for C and C++ programmers. It provides many useful high performance algorithms for image processing such as: pixel format conversion, image scaling and filtration, extraction of statistic information from images, motion detection, object detection (HAAR and LBP classifier cascades) and classification, neural network. The algorithms are optimized with using of different SIMD CPU extensions. In particular the library supports following CPU extensions: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and AVX-512 for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM. ...

3 Reviews

Downloads: 36 This Week

Last Update: 2019-02-01
See Project
15

Convolutional Recurrent Neural Network

Convolutional Recurrent Neural Network (CRNN) for image-based sequence

Convolutional Recurrent Neural Network provides an implementation of the Convolutional Recurrent Neural Network (CRNN) architecture, a deep learning model designed for image-based sequence recognition tasks such as optical character recognition and scene text recognition. The architecture combines convolutional neural networks for extracting visual features from images with recurrent neural networks that model sequential dependencies in the extracted features. This hybrid approach allows the...

Downloads: 0 This Week

Last Update: 2026-03-12
See Project
16

Distant Speech Recognition

Beamforming and Speech Recognition Toolkit

BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.

Downloads: 0 This Week

Last Update: 2019-08-21
See Project
17

Phrasal

Statistical phrase-based machine translation system

...Distinctive features include: providing an easy to use API for implementing new decoding model features, the ability to translating using phrases that include gaps (Galley et al. 2010), and conditional extraction of phrase-tables and lexical reordering models. Developed by The Natural Language Processing Group at Stanford University, a team of faculty, postdocs, programmers and students who work together on algorithms that allow computers to process and understand human languages. Our work ranges from basic research in computational linguistics to key applications in human language technology, and covers areas such as sentence understanding, automatic question answering, machine translation, syntactic parsing and tagging, sentiment analysis.

Downloads: 0 This Week

Last Update: 2021-01-19
See Project
18

CILP++

In-development neural-symbolic system, CILP++

This is the project of the extension of the original C-IL2P neural-symbolic system, CILP++, for reasoning, knowledge extraction and theory revision from propositional and first-order logics. *** ATTENTION! The CILP++ project has been discontinued from SVN! It has been migrated to Github on https://github.com/manoelfranca/cilppp!

Downloads: 0 This Week

Last Update: 2017-06-27
See Project
19

Accelerated Feature Extraction Tool

A fast GPU accelerated feature extraction software for speech analysis

A fast feature extraction software tool for speech analysis and processing. It incorporates standard MFCC, PLP, and TRAPS features. The tool is a specially designed to process very large audio data sets. It uses GPU acceleration if compatible GPU available (CUDA as weel as OpenCL, NVIDIA, AMD, and Intel GPUs are supported). CPU SSE intrinsic instruction set is used in cases where no compatible GPU present.

1 Review

Downloads: 0 This Week

Last Update: 2015-05-25
See Project
20

ivantk

Image-based Vascular Analysis Toolkit

The Image-based Vascular Analysis Toolkit is a set of multiplatform C++ libraries for vascular analysis of (3D) medical images, typically CT or MRI. It can be considered as an extension of the Insight Toolkit (ITK) for vascular image analysis, with methods for detection, extraction and modeling of vascular structures.

Downloads: 0 This Week

Last Update: 2013-09-16
See Project
21

openSMILE

SMILE = Speech & Music Interpretation by Large Space Extraction openSMILE is a fast, real-time (audio) feature extraction utility for automatic speech, music and paralinguistic recognition research developed originally at TUM in the scope of the EU-project SEMAINE, now maintained and supported by audEERING.

Downloads: 0 This Week

Last Update: 2014-11-27
See Project
22

openEAR

openEAR is the Munich Open-Source Emotion and Affect Recognition Toolkit developed at the Technische Universität München (TUM). It provides efficient (audio) feature extraction algorithms implemented in C++, classfiers, and pre-trained models on well-known emotion databases. It is now maintained and supported by audEERING. Updates will follow soon.

4 Reviews

Downloads: 8 This Week

Last Update: 2015-08-06
See Project
23

visual recognition of gestures

This project is to share my work in visual recognition of gestures. It shows how I have implemented person detection, color adaptation and tracking of the user's right-hand, gesture feature extraction, etc. using a single videocamera.

1 Review

Downloads: 0 This Week

Last Update: 2014-06-30
See Project
24

Ontolib

A multi-platform information extraction/ontology population library from HTML documents, written in C++

Downloads: 0 This Week

Last Update: 2013-03-27
See Project
25

Feature Extraction plugin API

Easy-to-use platform-independent plugin API for the extraction of low-level features from audio data in PCM format, as required in the context of music information retrieval software.

Downloads: 0 This Week

Last Update: 2013-04-17
See Project