Showing 779 open source projects for "extraction"

View related business solutions
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Roomba

    Roomba

    A Node.js tool to examine the correctness of Open Data Metadata

    Linked Open Data (LOD) has emerged as one of the largest collection of interlinked datasets on the web. Benefiting from this mine of data requires the existence of descriptive information about each dataset in the accompanying metadata. Such meta information is currently very limited to few data portals where they are usually provided manually thus giving little or bad quality insights. To address this issue, we propose a scalable automatic approach for extracting, validating and generating...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    Concept Extractor

    Concept Extractor

    Concept Extractor

    CXTRACTOR is a generic concept extractor that integrates state-of-the-art term/concept extraction methods. cxtractor is designed to be easy to run from a command line. Example $java -jar cxtractor-xx.jar "Sertoli-Leydig cell tumor is a cancer that starts in the female ovaries. The cancer cells produce and release a male sex hormone..." Sertoli-Leydig cell tumor (C0206723, Sertoli-Leydig Cell Tumor) is a cancer (C0027651, Neoplasms) that starts in the female (C0015780, Female) ovaries (C0029939, Ovary). ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    OpenSearchServer Extractor

    OpenSearchServer Extractor

    A RESTFul/JSON Web Service for text and metata extraction

    An open source RESTFul Web Service for text , meta-data extraction and analysis. oss-text-extractor supports various binary formats: Word processor (doc, docx, odt, rtf) Spreadsheet (xls, xlsx, ods) Presentation (ppt, pptx, odp) Publishing (pdf, pub) Web (rss, html/xhtml) Medias (audio, images) Others (vsd, text)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    ...The FOSSology Project is a Free Open Source Software (FOSS) project built around an open and modular architecture for analyzing software for open source software governance. Existing modules include license scanning, copyright and user identification, license classification and meta data extraction.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5

    STELLA Data Reduction Pipeline

    The STELLA Data-Reduction Pipeline for all kinds of Spectra

    ...If you use any of these programs to reduce data for a publication you must cite the paper "A Fast and Portable Reimplementation of Piskunov and Valenti's Optimal Extraction Algorithm with improved Cosmic Ray Removal and Optimal Sky Subtraction" by A. Ritter, E. A. Hyde, and Q. A. Parker, published in PASP 126, February 2014.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    ezSIFT

    An easy-to-use standalone SIFT library written in C/C++

    *************************************************************************** Updated 06/28/2018 The ezSift project has moved to https://github.com/robertwgh/ezSIFT *************************************************************************** The SIFT (scale-invariant feature transform) algorithm is considered to be one of the most robust local feature detector and description methods. Most of the open-source SIFT implementations rely on some 3rd-party libraries. Some of them even rely...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    igafem

    igafem

    Open source 3D Matlab Isogeometric Analysis Code

    Isogeometric analysis (IGA) is a fundamental step forward in computational mechanics that offers the possibility of integrating methods for analysis into Computer Aided Design (CAD) tools and vice versa. The benefits of such an approach are evident, since the time taken from design to analysis is greatly reduced leading to large savings in cost and time for industry. The tight coupling of CAD and analysis within IGA requires knowledge from both fields and it is one of the goals of the...
    Downloads: 8 This Week
    Last Update:
    See Project
  • 8

    SyntheticWSI

    Tools to generate and visualize artificial whole slide images

    ...Collection of tools to help generate artificial Whole Slide Images (WSIs). A WSI is stored as a ZIP archive of JPG tiles, and this software contains a tool to visualize this format. SVS files can be used directly for texture extraction (thanks to the included Bio-Formats library). Main source files in package fr.unistra.wsi.synthetic.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    DCTFinder

    DCTFinder

    Extract title and creation time from web page.

    ...DCTFinder is a system that parses a web page and extracts from its content the title and the creation date of this web page. DCTFinder combines heuristic title detection, supervised learning with Conditional Random Fields (CRFs) for document date extraction, and rule-based creation time recognition. DCTFinder is released under CeCILL free software license agreement. The system is described in the following paper (see 'Files' section): Xavier Tannier. "Extracting News Web Page Creation Time with DCTFinder". Proceedings of the 9th Language Resources and Evaluation Conference. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Custom VMs From 1 to 96 vCPUs With 99.95% Uptime Icon
    Custom VMs From 1 to 96 vCPUs With 99.95% Uptime

    General-purpose, compute-optimized, or GPU/TPU-accelerated. Built to your exact specs.

    Live migration and automatic failover keep workloads online through maintenance. One free e2-micro VM every month.
    Try Free
  • 10
    cdr-view

    cdr-view

    Webface witch extract CDR (Call Detail Records) from MySQL base.

    Webface witch extract CDR (Call Detail Records ) directly from MySQL base. May be used as SOHO billing system.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Simple general-purpose metadata extraction API with support for popular multimedia metadata formats such as EXIF and ID3.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    Freya Desktop Tools

    Small utilities for the unix desktop, based on the Enlightenment libs.

    ...The tools are developed using the Enlightenment Foundation Libraries, but should work outside of Enlightenment well too. The first available programm is a beta release of said file extraction tool, Sif, which in itself is a simple frontend for atool.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    EplSite ETL

    ETL Based on Perl With WEB Interface

    EplSite ETL is a tool to do easy the data migrations, doing extraction, transformation, validation and load in a very fast way. It was built by people involved in data migrations so, it contains the necessary to do the migration(Extract Transformation, validation and load) and do it well.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    CUbRIK

    CUbRIK

    Human-enhanced time-aware multimedia search

    The CUbRIK project provides a modular framework and distributed system architecture for flexible design and implementation of multimedia search applications. The framework supports hybrid workflows that combines automatic computation with CROWD-enabled and GWAP-enabled human computation.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    CSV*Extractor PRO (command line)

    Extract your scalar data from major relational databases.

    Windows command line tool for data extraction in CSV format. Supports 14 major databases: DB2 Advanced Enterprise Server DB2 Advanced Workgroup Server DB2 Developer Edition DB2 Enterprise Server DB2 Express DB2 Express C DB2 Workgroup Server Exadata Infobright Informix IDS Informix Innovator C MariaDB MySQL Oracle Oracle XE PostgreSQL SAP Sybase ASE SQL Lite SQL Server Enterprise SQL Server Express Sybase IQ Sybase SQL Anywhere TimesTen
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16

    MITIE

    Free and state-of-the-art information extraction tools

    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    CMIS Input plugin for Pentaho

    CMIS Input plugin for Pentaho

    Allows querying Content Management Systems that use the CMIS.

    ...Imagine using the information extracted for statistical purposes, for creating reports and, more generally, to analyse your document archives in a way unthinkable until now with the current tools available. All this is possible within the Pentaho Suite, the Open Source Business Intelligence platform, which is useful to the extraction and analysis of structured and semi-structured data. With this goal (the extraction and analysis of data) has been designed and developed the CMIS Input plugin for Pentaho Data Integration (Kettle) that allows querying Content Management Systems that use the CMIS interoperability standard. The data, once extracted, can be stored and analyzed and perhaps presented in customized reports be published in various formats for the end user (PDF, Excel, etc..).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    TextHunter

    User friendly toolkit to extract structured data from free text.

    ...To address this, TextHunter provides two key tools: - An efficient annotation interface to help rapidly code large volumes of documents - A means to automatically generate information extraction algorithms, so that manual review of the entire document set is unnecessary Designed to run on desktop hardware, TextHunter is already being used in the UK Mental Health Biomedical Research centre (http://brc.slam.nhs.uk/) to speed research in several major projects. Fast, powerful and free, TextHunter is a general purpose information extraction tool, suitable for any domain.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    EODG

    EODG is a web application to upload/download files from a local folder

    The EO Data Gateway application provide web access to a local EO repository both for upload and download of Earth Observation data products. EODG has been designed to be a simple input/output gateway to the ESA Earth Observation data stored in the ESA Grid-Processing On Demand systems. Anyway, it can be used for sharing any other of files. EODG provides per user control (via groups) on files download and uploads (number of concurrent downloads, number of files downloaded per month,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Xtreme Media Player

    Xtreme Media Player

    Xtreme Media Player is a free cross-platform media player.

    ...It provides the user with a graphical interface for choosing music files and playlists - and includes support for many audio file formats including .spx, .snd, .aifc, .aif, .wav, .au, .flac, .mp1, .mp2, .mp3, .ogg, .aac, and .m4a. The code is very modular to allow the straightforward extraction of core modules (such as the audio engine, the FFT analysis, the playlist management module, and several visualizations) to use in other projects as external libraries. A key feature of XtremeMP is the capability to view visualizations (on-screen graphics controlled by the music’s audio). These can have scientific/technical purposes such as depicting some properties of the audio (such as the Oscilloscope, Spectrum, Stereogram, and Spectrogram visualizations).
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    This computer-aided diagnosis (CAD) software (MATLAB toolbox) has been developed for automated prediction of tuberculosis (TB) from chest X-ray (CXRs) of patients. This toolbox was developed by incorporating more diversified global features extraction methods such as Gist and PHOG. It is effective in discriminating between CXR(s) of non-TB and TB patients. It contains two modules: Training and Prediction Modules. The latter (Prediction Module) predicts input digital CXR(s) image as TB or non-TB so it will useful for general user. The former module (Training module) enables user to develop a model trained on his/her own TB and nonTB chest radiographs. ...
    Downloads: 6 This Week
    Last Update:
    See Project
  • 22
    musicinformationretrieval.com

    musicinformationretrieval.com

    Instructional notebooks on music information retrieval

    musicinformationretrieval.com is a collection of instructional materials for music information retrieval (MIR). These materials contain a mix of casual conversation, technical discussion, and Python code. These pages, including the one you're reading, are authored using Colab notebooks.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Row-Bean

    Row-Bean

    CSV reader writer - bean mapping - easy bean extraction from CSV file

    Row-Bean is a CSV-Bean JAVA API . Row-Bean provides CSV reader an writer. More ever provides a mechanism to map csv file content to java beans and revers. For each use, a XML description must describe the wished mapping. Another possibility consists in use Annotations. Use under maven : <!-- row bean with annotations...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    The National Library of New Zealand's Metadata Extraction Tool automatically extracts preservation-related metadata from digital files, then output that metadata in XML formats. It can be used through a graphical user interface or command-line interface. Please take the latest code from 'https://github.com/DIA-NZ/Metadata-Extraction-Tool.git'. The code on source forge will not be updated henceforth as it is moved to github.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 25

    pyll

    Set of scripts to help on package deploy on WebMethods Integration Srv

    Pyll can be described by a set of tools developed to help on package deployment on a growing environment of 20+ Integration Servers. Actually pyll can deploy over than 100+ packages for 20+ Integration servers under 2/3 hours ( update mode ). Pyll works for WebMethods Integration Server version 7.x and 8.x.
    Downloads: 0 This Week
    Last Update:
    See Project