Showing 105 open source projects for "extraction"

View related business solutions
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    More flexibility. More control.

    Generate interest, access liquidity without selling, and execute trades seamlessly. All in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • 1
    Kaldi

    Kaldi

    kaldi-asr/kaldi is the official location of the Kaldi project

    ...Kaldi is designed for researchers who need a highly customizable environment to experiment with new algorithms, as well as for practitioners who want robust, production-ready ASR pipelines. It includes extensive tools for data preparation, feature extraction, acoustic and language modeling, decoding, and evaluation. With its modular design, Kaldi allows users to adapt the system to a wide range of languages and domains. As one of the most influential projects in speech recognition, it has become a foundation for much of the modern work in ASR.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 2
    3FS

    3FS

    A high-performance distributed file system

    The 3FS repository (standing likely for “Feature 3F System” or similar) is focused on providing a feature extraction and transformation framework tailored to deep and large models, especially in token-based systems. Its primary aim is to support efficient and scalable feature transformation pipelines—especially for inference environments—by batching, caching, and integrating feature-based modules like segmenters, sparse retrievers, and scorers seamlessly.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    BrowserOS

    BrowserOS

    Agentic browser; privacy-first alternative to ChatGPT Atlas

    BrowserOS is an open-source, agentic web browser built on a Chromium base that integrates AI agents directly into the browsing experience. Rather than just doing standard browsing, it places AI intelligence at the core: you can connect your own API keys (for e.g., OpenAI, Anthropic, Google Gemini) or run local models (via e.g., Ollama) so that your browsing data and automation stay on your machine — privacy and control are emphasized throughout. The interface remains familiar to users of...
    Downloads: 35 This Week
    Last Update:
    See Project
  • 4
    PaddleOCR-json

    PaddleOCR-json

    OCR offline image text recognition command line windows program

    PaddleOCR-json is an OCR engine based on the PaddleOCR project that provides a command-line interface and tools for extracting text from images and exporting results in structured JSON format. It wraps the PaddleOCR models, which are capable of detecting and recognizing text in a wide variety of languages and layouts, into a self-contained executable that can be run locally without needing a deep learning environment configured manually. This makes it practical for developers or system...
    Downloads: 5 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    eos

    eos

    A lightweight 3D Morphable Face Model library in modern C++

    eos is a lightweight 3D Morphable Face Model fitting library that provides basic functionality to use face models, as well as camera and shape fitting functionality. It's written in modern C++11/14. MorphableModel and PcaModel classes to represent 3DMMs, with basic operations like draw_sample(). Supports the Surrey Face Model (SFM), 4D Face Model (4DFM), Basel Face Model (BFM) 2009 and 2017, and the Liverpool-York Head Model (LYHM) out-of-the-box.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 6
    Simd Library

    Simd Library

    C++ image processing and machine learning library with using of SIMD

    The Simd Library is a free open-source image processing and machine learning library, designed for C and C++ programmers. It provides many useful high-performance algorithms for image processing such as pixel format conversion, image scaling and filtration, extraction of statistical information from images, motion detection, object detection and classification, neural networks. The algorithms are optimized with using of different SIMD CPU extensions. In particular, the library supports the following CPU extensions: SSE, AVX, AVX-512, and AMX for x86/x64, and NEON for ARM. The Simd Library has C API and also contains useful C++ classes and functions to facilitate access to C API. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 7
    Super Audio CD Decoder
    ...Decoder is capable of playing back Super Audio CD ISO images, DSDIFF, DSF and DSD WavPack files. DSD(DoP) and PCM output modes. Separate DSD Processor/DSD Converter plugins for track extraction into DSD/DST encoded files.
    Leader badge
    Downloads: 5,301 This Week
    Last Update:
    See Project
  • 8
    SecurePose

    SecurePose

    Automated Face Blurring, Kinematics Extraction and Leg dystonia Dx

    ...This validation establishes its effectiveness and usability in clinically recorded gait videos for face blurring and kinematics extraction. For installation, https://www.rishabh-bajpai.com/secureposeinstallation Tutorial Videos https://www.youtube.com/playlist?list=PLO4_jCYO5Ib23MoBpn-Wpj1_b6DAYlDwk Please cite the paper: https://arxiv.org/abs/2402.14143
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    GeoDMA

    GeoDMA

    Geographic feature extraction and data mining

    GeoDMA is a plugin for TerraView software, used for geographical data mining. With a single image, the user can perform segmentation, attributes extraction, normalization and classification.
    Downloads: 1 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10

    NeuroSuite

    Klusters, NeuroScope and NDManager

    ...NDManager (Neurophysiological Data Manager) is a simple graphical application designed to help neurophysiologists manage their experimental recording parameters (e.g., number of channels and sampling rate of the acquisition system) and process their data (data filtering, spike extraction, PCA, etc.)
    Leader badge
    Downloads: 26 This Week
    Last Update:
    See Project
  • 11
    LabPlot

    LabPlot

    Data Visualization and Analysis

    LabPlot is a FREE, open source and cross-platform Data Visualization and Analysis software accessible to everyone.
    Downloads: 29 This Week
    Last Update:
    See Project
  • 12

    UniversalTextExtractor

    Command-line toolset for extracting text from files

    Command-line toolset for extracting text from files (documents, images, archives) into SQLite with OCR support. Simple, expandable, one shell script only.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    DocWire SDK

    DocWire SDK

    Award-winning modern data processing SDK in C++20

    DocWire SDK, a standout C++20AI driven data processing tool, has received award from SourceForge and strong backing from Microsoft. It handles nearly 100 file types, empowering efficient text extraction, web data extraction, and document analysis. For businesses, the shift to DocWire SDK signifies a leap forward. It promises comprehensive document format support and the ability to extract valuable insights from email boxes, databases, and websites using cutting-edge AI. DocWire SDK aims to expand its capabilities, focusing on versatile data extraction, platform support, and seamless integration with various systems. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 14
    The 3D Toolkit provides algorithms and methods to process 3D point clouds. In includes automatic precise registration (6D simultaneous localization and mapping, 6D SLAM) and other tools, e.g., a fast 3D viewer, plane extraction software, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    Ion Beam Simulator
    Library for ion optics, plasma extraction and space charge dominated ion beam transport.
    Leader badge
    Downloads: 17 This Week
    Last Update:
    See Project
  • 16
    jbig2enc

    jbig2enc

    JBIG2 Encoder

    ...JBIG2 encodes bi-level (1 bpp) images using a number of clever tricks to get better compression than G4. This encoder can: Generate JBIG2 files, or fragments for embedding in PDFs Generic region encoding Perform symbol extraction, classification and text region coding Perform refinement coding and, Compress multipage documents It uses the (Apache-ish licensed) Leptonica library: http://leptonica.com/
    Downloads: 88 This Week
    Last Update:
    See Project
  • 17

    Language-Aware String Extractor

    multi-encoding strings(1) replacement with language identification

    Enhanced version of the standard Unix strings(1) program which uses language models for automatic language identification and character-set identification, supporting over 1400 languages, dozens of character encodings, and 4800+ language/encoding pairs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    winLAME

    winLAME

    winLAME is an easy to use encoder for many audio formats, e.g. MP3.

    winLAME is an easy to use encoder for many audio formats, including MP3, Opus, Ogg Vorbis and more. winLAME lets you read in audio tracks from CDs or encode audio files from your hard drive. The intuitive wizard-style user interface makes it easy to set up encoding settings. Multicore CPUs are used to parallelize encoding tasks. Official GitHub project webpage: https://github.com/vividos/winLAME
    Downloads: 45 This Week
    Last Update:
    See Project
  • 19
    VideoSubFinder
    The main purpose of this program is to provide functionality for extract hardcoded subtitles (hardsub) from video. It provides two main features: 1) Autodetection of frames with hardcoded text (hardsub) on video with saving info about timing positions. 2) Generation of cleared from background text images, which allows with usage of OCR programs (like FineReader, Subtitle Edit, Google Drive) to generate complete subtitles with original text and timing. For working of this program on...
    Leader badge
    Downloads: 532 This Week
    Last Update:
    See Project
  • 20
    GXSM

    GXSM

    Scanning Probe Microscopy Controller and Data Visualization Software

    GXSM -- Gnome X Scanning Microscopy: A multi-channel image and vector-probe data acquisition and visualization system designed for SPM techniques (STM,AFM..), but also SPA-LEED/LEED/LEEM data analysis. A plug-in interface allows any user add-on data-processing and special hardware and instrument support. Latest: NC-AFM and related explorative methods as SQDM can be configured. High-Speed external PAC-PLL hardware option with digital DSP link. Based on several hardware options it supports...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 21
    RedtDec

    RedtDec

    RetDec is a retargetable machine-code decompiler based on LLVM

    The decompiler is not limited to any particular target architecture, operating system, or executable file format. ELF, PE, Mach-O, COFF, AR (archive), Intel HEX, and raw machine code supported. 32-bit: Intel x86, ARM, MIPS, PIC32, and PowerPC 64-bit: x86-64 supported. Demangling of symbols from C++ binaries (GCC, MSVC, Borland). Reconstruction of functions, types, and high-level constructs. Output in two high-level languages: C and a Python-like language. Generation of call graphs,...
    Downloads: 24 This Week
    Last Update:
    See Project
  • 22
    QXmlEdit

    QXmlEdit

    Simple XML editor and XSD viewer

    QXmlEdit is a simple XML editor written in qt. Its main features are unusual data visualization modes, nice XML manipulation and presentation and it is multi platform. It can split very big XML files into fragments, compare XML and XSD files, and has a graphical XSD viewers. Project site: http://qxmledit.org Source code hosted at GitHub (moved from Google Code) https://github.com/lbellonda/qxmledit Report issues at: https://github.com/lbellonda/qxmledit/issues Discussion...
    Leader badge
    Downloads: 118 This Week
    Last Update:
    See Project
  • 23
    7-Zip-JBinding

    7-Zip-JBinding

    Java wrapper for 7z archiver engine

    Native (JNI) cross-platform library to extract (password protected, multi-part) 7z Zip Rar Tar Split Lzma Iso HFS GZip Cpio BZip2 Z Arj Chm Lhz Cab Nsis Deb Rpm Wim Udf archives and create 7z, Zip, Tar, GZip & BZip2 from Java.
    Leader badge
    Downloads: 24 This Week
    Last Update:
    See Project
  • 24

    Aseryla2

    Aseryla2 code repositories

    This project describes a model of how the semantic human memory represents the information relevant to the objects of the world in text format. It provides a system and a GUI application capable of extracting and managing concepts and relations from English texts. https://aseryla2.sourceforge.io/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    pak

    PAK file editor for Quake engine games

    A utility for manipulating .PAK files used by Quake and Quake 2 engine games. Allows for creation of .PAK data files from directories, extraction, individual file/directory insertion and extraction, and file/directory deletion.
    Downloads: 7 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB