Showing 32 open source projects for "data processing"

View related business solutions
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1

    MarDRe

    MapReduce-based tool to remove duplicate DNA reads

    MarDRe is a de novo MapReduce-based parallel tool to remove duplicate and near-duplicate DNA reads through the clustering of single-end and paired-end sequences from FASTQ/FASTA datasets. This tool allows bioinformatics to avoid the analysis of not necessary reads, reducing the time of subsequent procedures with the dataset. MarDRe is the Big Data counterpart of ParDRe (link above), which employs HPC technologies (i.e., hybrid MPI/multithreading) to reduce runtime on multicore systems. Instead, MarDRe takes advantage of the MapReduce programming model to significantly improve ParDRe performance on distributed systems, especially on cloud-based infrastructures. Written in pure Java to maximize cross-platform compatibility, MarDRe is built upon the open-source Apache Hadoop project, the most popular distributed computing framework for Big Data processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    HSRA

    Hadoop spliced read aligner for RNA-seq data

    ...This tool allows bioinformatics researchers to efficiently distribute their mapping tasks over the nodes of a cluster by combining a fast multithreaded spliced aligner (HISAT2) with Apache Hadoop, which is a distributed computing framework for scalable Big Data processing. HSRA currently supports single-end and paired-end read alignments from FASTQ/FASTA datasets. Moreover, our tool uses the Hadoop Sequence Parser (HSP) library (link above) to efficiently read the input datasets stored on the Hadoop Distributed File System (HDFS), being able to process datasets compressed with Gzip and BZip2 codecs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    High-Throughput Tabular Data Processor
    ...HTDP can also import, process and convert Variant Call Format (VCF) files ver. 4.0, 4.1 and 4.2 (http://samtools.github.io/hts-specs/VCFv4.2.pdf). HTDP provides quick filtering functionality and can process data consisting of single or multiple input files. Citation: Madanecki P, Bałut M, Buckley PG, Ochocka JR, Bartoszewski R, Crossman DK, et al. (2018) High-Throughput Tabular Data Processor – Platform independent graphical tool for processing large data sets. PLoS ONE 13(2): e0192858. https://doi.org/10.1371/journal.pone.0192858
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Protein Microarray Analyser

    Protein Microarray Analyser

    Protein microarray data processing and normalization

    The Protein Microarray Analyser software presented here includes the following tools: (1) neighbourhood background correction, (2) net intensity correction, (3) user-defined noise threshold, (4) user-defined CV threshold amongst replicates and (5) assay controls, (6) composite ‘pin-to-pin’ normalization amongst sub-arrays, and (7) ‘array-to-array’ normalization amongst whole arrays.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 5

    MOIRAI

    Simple Scientific Workflow System for CAGE Analysis

    ...After mapping, a CAGE peak on the genome indicates the position of an active transcriptional start site (TSS) and the number of reads correspond to its expression level. CAGE is prominently used in both the FANTOM and ENCODE project. MOIRAI is a compact yet flexible workflow system designed to carry out the main steps in data processing and analysis of CAGE data. MOIRAI has a graphical interface allowing wet-lab researchers to create, modify and run analysis workflows. Embedded within the workflows are graphical quality control indicators allowing users assess data quality and to quickly spot potential problems. MOIRAI package comes with three main workflows allowing users to map, annotate and perform an expression analysis over multiple samples.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. ...
    Leader badge
    Downloads: 10 This Week
    Last Update:
    See Project
  • 7
    Visualization of Protein-Ligand Graphs

    Visualization of Protein-Ligand Graphs

    Compute protein graphs. Moved to https://github.com/MolBIFFM/PTGLtools

    NOTE: Project moved to https://github.com/MolBIFFM/PTGLtools. The Visualization of Protein-Ligand Graphs (VPLG) software package computes and visualizes protein graphs. It works on the super-secondary structure level and uses the atom coordinates from PDB files and the SSE assignments of the DSSP algorithm. VPLG is command line software. If you do not like typing commands, try our PTGL web server: http://ptgl.uni-frankfurt.de/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    IAP.G2P

    IAP.G2P

    IAP - the Integrated Analysis Pipeline

    The Integrated Analysis Platform (IAP) has been designed and developed to support the analysis of large-scale image data sets of different camera systems. It aims in bridging different data domains and in integrating different approaches to data analysis and post-processing.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 9
    Microscopy Image Processing Package
    XMIPP is an image processing suite for 3D-reconstruction of biological specimens from transmission electron microscopy data. NOTE that stable releases (including binaries) are longer stored at Sourceforge, but are available at http://xmipp.cnb.csic.es
    Downloads: 6 This Week
    Last Update:
    See Project
  • Add Two Lines of Code. Get Full APM. Icon
    Add Two Lines of Code. Get Full APM.

    AppSignal installs in minutes and auto-configures dashboards, alerts, and error tracking.

    Works out of the box for Rails, Django, Express, Phoenix, and more. Monitoring exceptions and performance in no time.
    Start Free
  • 10
    aisconvert
    Toolkit for processing genetic data. Currently supports (command-line): Half-IBD (Identity by descent) aka HIRs - between 2 or any number of files (in distances and cM); RAW2PED, PED2RAW conversions; regions of homozygousity and other converters.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    XMLPipeDB is a suite of tools for building relational databases from XML sources with minimal manual processing of the data. While the applicability is general, our motivation was to facilitate the management of biological data from different sources.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 12
    scipion-xmipp

    scipion-xmipp

    Image processing framework to integrate EM software packages.

    Scipion is an image processing framework to obtain 3D models of macromolecular complexes using Electron Microscopy (3DEM). It integrates several software packages and presents an unified interface for both biologists and developers. Scipion allows to execute workflows combining different software tools, while taking care of formats and conversions. Additionally, all steps are tracked and can be reproduced later on. Xmipp is a well-known package in the EM image processing. It is integrated...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    MZmine 2

    MZmine 2

    A framework for differential analysis of mass spectrometry data

    MZmine 2 is a framework written in Java for differential analysis of mass spectrometry data.
    Leader badge
    Downloads: 28 This Week
    Last Update:
    See Project
  • 14

    iMir

    Integrated pipeline for HT miRNA-Seq data analysis

    Processing of smallRNA-Seq data to gather biologically relevant information requires application of multiple statistical and bioinformatics tools from different sources, each focusing on a specific step of the analysis pipeline. The analytical workflow can be challenging for the continuous interventions by the operator, a critical factor when large numbers of datasets need to be analyzed at once.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    xAffect

    xAffect

    Sensor Middleware

    xAffect is a software framework for online recording, processing and storage of multi sensor data. It was developed at FZI Research Center for Information Technology, Karlsruhe, Germany. xAffect can be used as a flexible, customizable middleware between physiological sensors and applications which require online feature computation (e.g. biofeedback or affective computing). Moreover, it can also be used as a standalone application for data acquisition and visualization.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Java Machine Learning Library is a library of machine learning algorithms and related datasets. Machine learning techniques include: clustering, classification, feature selection, regression, data pre-processing, ensemble learning, voting, ...
    Downloads: 11 This Week
    Last Update:
    See Project
  • 17

    IQuant

    A pipeline for quantitative proteomics based upon isobaric tags

    IQuant is an automated pipeline for quantitative proteomics based upon isobaric tags. It integrates post-processing tool of protein identification and advanced statistical algorithms to process the MS/MS signals generated from the peptides labeled by isobaric tags for quantification. IQuant can run from a graphical user interface (GUI) as well as a command-line interface and work with both Windows and Linux system. This website contains the IQuant software, an example data labeled by iTRAQ-8plex for testing and a user's manual. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    BioDare

    BioDare is Biological Data Repository focused on timeseries data

    BioDare (Biological Data Repository) was developed under the multi-site ROBuST project (http://hallidaylab.bio.ed.ac.uk/ROBuST.html) to support data exchange inside the project. It is a web application which allows data-sharing (including public dissemination), data-processing and analysis, with the main focus on time-series data produced in circadian experiments.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    iBRAIN2 Workflow Manager

    A system for automated analysis and data handling for RNAi screens

    The iBRAIN2 software system for RNAi high-content screening integrates automated analysis and customizable data management. It enables robust and complex parallel processing on computer cluster infrastructure and allows for reliable storage of primary and resulting data sets.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    IDL specified API for manipulating and processing CellML 1.0 and 1.1. Includes C++ implementation. Accessible from a C++ program, or from any language for which a CORBA language mapping is available Also, a Java wrapper of the API is available.
    Leader badge
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    CallSim

    CallSim

    low-volume read processing base corrector

    CallSim is a software application that provides evidence for the validity of base calls believed to be sequencing errors and it is applicable to Ion Torrent and 454 data. CallSim provides a final classification or rescue of a base/indel in reads, where putative variants have been identified via typical SNP/indel workflows. plots rendered by: JFreeChart library http://sourceforge.net/projects/jfreechart/
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    SemaRule Navigator
    SemaRule Navigator is an Integrated Suite of Open-Source and Free-License Software, placing Semantic and Text Analysis Technologies in the toolbox of Researchers, Students, and Enterprises.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    This is a Java-based project for complex event extraction from text and co-reference resolution. Currently the code can read BioNLP shared task format (http://2011.bionlp-st.org/) and i2b2 Natural Language Processing for Clinical Data shared task format (https://www.i2b2.org/NLP/DataSets/Main.php). Event extraction includes finding events and the parameters for an event in a text. The method is based on SVM but other ML algorithms can be adopted. The method details are explained in the following paper: Ehsan Emadzadeh, Azadeh Nikfarjam, and Graciela Gonzalez. 2011. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    clinicalStudyTracker

    Clinical study tracker tracks case study members through activities.

    Developed at Medical Research Council UK to meet a common clinical trial need to track various types of clinical data. Users log dates of completed tasks against participant ID. Simple, useable, reduces lost data! Makes graphical progress reports.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    Knime (http://www.knime.org) nodes for sequence bioinformatics. Sequime is an eclipse plug-in for the KNIME data mining platform, providing additional nodes for reading, processing and visualizing sequence information.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • Next
MongoDB Logo MongoDB