Showing 161 open source projects for "extraction"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Earn up to 16% annual interest with Nexo. Icon
    Earn up to 16% annual interest with Nexo.

    Access competitive interest rates on your digital assets.

    Generate interest, borrow against your crypto, and trade a range of cryptocurrencies — all in one platform. Geographic restrictions, eligibility, and terms apply.
    Get started with Nexo.
  • 1
    Orange

    Orange

    OpenResty/Nginx Gateway for API monitoring and management

    A Gateway based on OpenResty(Nginx + Lua) for API Monitoring and Management. We recommend that you use luarocks to install Orange to reduce problems caused by dependency extensions in different operating system releases. System dependencies (openresty, resty-CLI, luarocks, etc.) are necessary to install Orange on different operating systems. By default, a Dashboard is provided to manage all Orange plugin data. All Orange's plugins have open APIs that can be used to achieve more personalized...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Enlive

    Enlive

    Selector-based templating and transformation system for Clojure

    Enlive is a Clojure library for HTML templating, transformation, and scraping, supporting composable manipulation of HTML/XML in a functional style. It allows selecting, transforming, and generating HTML fragments using CSS selectors, and supports server-side template composition, dynamic pages, and content rewriting. By default selector-transformation pairs are run sequentially. When you know that several transformations are independent, you can now specify (as an optimization) to process...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    Duckling (Old)

    Duckling (Old)

    Clojure library that parses text into structured data

    Duckling (the “old” archived version) is a natural language processing library (in Clojure) for parsing text to structured data — specifically, recognizing quantities such as dates, times, durations, measurements, currencies, etc., from free-form text. To use Duckling in your project, you just need two functions: load! to load the default configuration, and parse to parse a string. Duckling is a Clojure library that parses text into structured data. See our blog post announcement for more...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    MITIE

    MITIE

    MITIE: library and tools for information extraction

    This project provides free (even for commercial use) state-of-the-art information extraction tools. The current release includes tools for performing named entity extraction and binary relation detection as well as tools for training custom extractors and relation detectors. MITIE is built on top of dlib, a high-performance machine-learning library[1], MITIE makes use of several state-of-the-art techniques including the use of distributional word embeddings[2] and Structural Support Vector Machines[3]. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • AI-powered service management for IT and enterprise teams Icon
    AI-powered service management for IT and enterprise teams

    Enterprise-grade ITSM, for every business

    Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
    Try it Free
  • 5
    Simd

    Simd

    High performance image processing library in C++

    The Simd Library is a free open source image processing library, designed for C and C++ programmers. It provides many useful high performance algorithms for image processing such as: pixel format conversion, image scaling and filtration, extraction of statistic information from images, motion detection, object detection (HAAR and LBP classifier cascades) and classification, neural network. The algorithms are optimized with using of different SIMD CPU extensions. In particular the library supports following CPU extensions: SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2, AVX, AVX2 and AVX-512 for x86/x64, VMX(Altivec) and VSX(Power7) for PowerPC, NEON for ARM. ...
    Leader badge
    Downloads: 37 This Week
    Last Update:
    See Project
  • 6
    iText®, a JAVA PDF library

    iText®, a JAVA PDF library

    PDF Library for Developers

    iText is an open-source PDF library available for Java and .NET (C#). iText allows you to effortlessly generate and manipulate standards-compliant PDF documents with a powerful and feature-rich SDK. With iText, you can create archivable and accessible PDFs, split and merge documents, fill and flatten forms, digitally sign documents, and more. iText add-ons enable additional functionality, such as PDF creation from HTML templates, secure redaction, OCR, and much more. The latest...
    Leader badge
    Downloads: 164 This Week
    Last Update:
    See Project
  • 7
    pyhanlp

    pyhanlp

    Chinese participle

    ...The project focuses on making HanLP’s capabilities accessible through a Python-friendly API surface, so you can integrate NLP steps into data pipelines, notebooks, and downstream ML or information-extraction code. In practice, it serves as a bridge layer: Python calls are translated into the corresponding HanLP operations, so you can keep your application logic in Python while relying on HanLP’s implementations. It is especially useful when you need a pragmatic “get results quickly” NLP layer for segmentation, tagging, entity extraction, parsing, or keyword-style tasks rather than experimenting with model training from scratch.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    Deeplearning-papernotes

    Deeplearning-papernotes

    Summaries and notes on Deep Learning research papers

    Deeplearning-papernotes is an implementation of Convolutional Neural Networks for sentence and text classification in TensorFlow, based on a well-known research paper that applies CNN architectures to natural language processing tasks with strong performance in sentiment analysis and similar classification problems. The repository provides the complete network definition, including an embedding layer to convert words into dense representations, convolution and max-pooling layers to extract...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    TextTeaser

    TextTeaser

    TextTeaser is an automatic summarization algorithm

    textteaser is an automatic text summarization algorithm implemented in Python. It extracts the most important sentences from an article to generate concise summaries that retain the core meaning of the original text. The algorithm uses features such as sentence length, keyword frequency, and position within the document to determine which sentences are most relevant. By combining these features with a simple scoring mechanism, it produces summaries that are both readable and informative....
    Downloads: 1 This Week
    Last Update:
    See Project
  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • 10
    Matlab/Octave Rotations Library

    Matlab/Octave Rotations Library

    Library for working with 3D rotations in Matlab/Octave

    ...The rotation representations that are supported are rotation matrices (Rotmat), Quaternions (Quat), intrinsic ZYX Euler angles (Euler), fused angles (Fused) and tilt angles (Tilt). Operations such as composition, inversion, ZYX yaw extraction, fused yaw extraction, random generation, equality detection, vector rotation and much more have been implemented for each rotation representation. Tilt vector addition and the relative and absolute tilt phase spaces are also supported. Unit tests for each function are provided to ensure that the library is performant.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Rotations Conversion Library

    Rotations Conversion Library

    Library for working with 3D rotations in C++

    The Rotations Conversion Library (RCL) is a collection of C++ functions that address common computations and numerical handling of rotations in 3D Euclidean space, including support for rotation matrices (`Rotmat`), Quaternions (`Quat`), intrinsic ZYX Euler angles (`Euler`), fused angles (`Fused`) and tilt angles (`Tilt`). In addition to the core competency of being able to convert between each of the representations, operations such as inversion, ZYX yaw extraction, fused yaw extraction, renormalisation, equality detection, vector rotation and much more have been implemented. Unit tests for each function are provided to ensure that the library performs exactly as it should. The implementation of the Rotations Conversion Library is based on the very related Matlab/Octave Rotations Library (MORL), which is a highly tested reference implementation for the required algorithms (see https://sourceforge.net/projects/rotationslib/).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    nexmon

    nexmon

    The C-based Firmware Patching Framework for Broadcom/Cypress WiFi Chip

    The C-based Firmware Patching Framework for Broadcom/Cypress WiFi Chips enables Monitor Mode, Frame Injection, and much more. Nexmon is our C-based firmware patching framework for Broadcom/Cypress WiFi chips that enables you to write your own firmware patches, for example, to enable monitor mode with radiotap headers and frame injection. This repository mainly focuses on enabling monitor mode and frame injection on many chips. A real Wi-Fi jammer that allows to overlay ongoing frame...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 13

    Specstractor

    Tool chain for the automated extraction/analysis of system invariants

    Specstractor is an automated tool chain for the extraction and analysis of system specifications in the form of system invariants. These automatically generated invariants yield useful insight into the actual system behavior and can reveal flaws and missing elements in the developer-maintained specifications. Specstractor consists of two main components, the Extractor, which automatically extracts invariants from automatically generated test cases of a system using data mining, and Insight, a web based tool that gives the user high level views and visualizations of the resulting invariants. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14

    LEP

    Literate programming for eclipse

    Literate Eclipse Programming assists developers in writing structured, well documented source code. LEP integrates automated extraction of source code and documentation into a familiar eclipse environment. Compiling and debugging are fully supported.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    ttag

    ttag

    Simple approach for javascript localization

    Modern javascript i18n localization library based on ES6 tagged templates and the good old GNU gettext. Just tag your strings to make them translatable. Use a simple ttag-cli tool for translation extraction. Can be easily integrated with almost any workflow as it uses the babel-plugin for string extraction. Can be easily used with the typescript. Allows you to place translations into the sources on a build step. Gettext is a simple localization format with a rich ecosystem. Ttag has support for plurals, contexts, translator comments, and much more.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16
    aeneas

    aeneas

    Automagically synchronize audio and text (aka forced alignment)

    aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment). aeneas automatically generates a synchronization map between a list of text fragments and an audio file containing the narration of the text. In computer science this task is known as (automatically computing a) forced alignment.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 17

    BTCore

    A Collection of Useful Java/Swing Code

    NEWEST FEATURE ---------- Added one line ".zip" extraction to Util class! Added one line ".zip" extraction from URL (web) and one line file download from URL! BTCore is a library that was designed to be used with all of Banotech's software. It includes a tremendous amount of code snippets and classes that have been boiled down to allow ease of use by everyone. BTCore is, and always will be FREE.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    Distant Speech Recognition

    Beamforming and Speech Recognition Toolkit

    BTK contains C++ and Python libraries that implement speech processing and microphone array techniques such as speech feature extraction, speech enhancement, speaker tracking, beamforming, dereverberation and echo cancellation algorithms. The Millennium ASR provides C++ and python libraries for automatic speech recognition. The Millennium ASR implements a weighted finite state transducer (WFST) decoder, training and adaptation methods. These toolkits are meant for facilitating research and development of automatic distant speech recognition.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    CRFasRNN

    CRFasRNN

    Semantic image segmentation method described in the ICCV 2015 paper

    CRF-RNN is a deep neural architecture that integrates fully connected Conditional Random Fields (CRFs) with Convolutional Neural Networks (CNNs) by reformulating mean-field CRF inference as a Recurrent Neural Network. This fusion enables end-to-end training via backpropagation for semantic image segmentation tasks, eliminating the need for separate, offline post-processing steps. Our work allows computers to recognize objects in images, what is distinctive about our work is that we also...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Python Audio Tools are a collection of audio handling programs. These include programs for CD extraction, track conversion from one audio format to another, track renaming and retagging, track identification, CD burning from tracks, and more.
    Leader badge
    Downloads: 28 This Week
    Last Update:
    See Project
  • 21

    eLibrary

    Personalized Search Engine for Commonly Used Files

    eLibrary (electric library) is a Java software to search files and folders in an OS file system. It differs from general OS file search engines in that it personalizes the indexing setup so that users can choose which directories to index or remove from an existing index and it can also suggest queries just like Google's "Did you mean" feature. The customization of indexing and query suggestion greatly improves search speed and make user experience more comfortable. eLibrary can also extract...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    GUAJE FUZZY

    GUAJE FUZZY

    Free software for generating understandable and accurate fuzzy systems

    ...Thus, it is a free software tool (licensed under GPL-v3) with the aim of supporting the design of interpretable and accurate fuzzy systems by means of combining several preexisting open source tools, taking profit from the main advantages of all of them. It is a user-friendly portable tool designed and developed in order to make easier knowledge extraction and representation for fuzzy systems, paying special attention to interpretability issues. GUAJE lets the user define expert variables and rules, but also provide supervised and fully automatic learning capabilities. Both types of knowledge, expert and induced, are integrated under the expert supervision, ensuring interpretability, simplicity and consistency of the knowledge base along the whole process. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    mwetoolkit

    THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/

    THIS PROJECT MIGRATED TO https://gitlab.com/mwetoolkit/mwetoolkit3/ The Multiword Expressions toolkit aids in the automatic identification and extraction of multiword units in running text. These include idioms (kick the bucket), noun compounds (cable car), phrasal verbs (take off, give up), etc. Even though it focuses on multiword expresisons, the framework is quite complete and can also be useful in any corpus-based study in computational linguistics. The mwetoolkit can be applied to virtually any text collection, language, and MWE type. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    SrcME

    SrcME

    Source Code Metrics Extraction tool

    SrcME (Source Code Metrics Extraction) tool helps in extracting various code metrics from different programming languages (Initially, C/C++ and Java source code are considered).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    IBEX_MDA

    IBEX_MDA

    An open infrastructure software to facilitate radiomics research.

    ...The latest stand-alone version can be downloaded at http://bit.ly/IBEX_MDAnderson The latest source-code version can be downloaded at http://bit.ly/IBEXSrc_MDAnderson IBEX (imaging biomarker explorer) is an open infrastructure software platform that flexibly supports common radiomics workflow tasks such as multimodality image data import and review, development of feature extraction algorithms, model validation, and consistent data sharing among multiple institutions. IBEX software package was developed using the MATLAB and C/C++ programming languages. The software architecture deploys the modern model-view-controller, unit testing, and function handle programming concepts to isolate each quantitative imaging analysis task, to validate if their relevant data and algorithms are fit for use, and to plug in new modules.
    Downloads: 1 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB