Showing 106 open source projects for "data processing"

View related business solutions
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • Gemini 3 and 200+ AI Models on One Platform Icon
    Gemini 3 and 200+ AI Models on One Platform

    Access Google's best plus Claude, Llama, and Gemma. Fine-tune and deploy from one console.

    Build generative AI apps with Vertex AI. Switch between models without switching platforms.
    Start Free
  • 1
    Free DELTA

    Free DELTA

    Software tools for processing taxonomic descriptions in DELTA format

    The Free DELTA Project is a free, open-source, software alternative for a generic system to the processing of taxonomic descriptions based on the DELTA (DEscription Language for TAxonomy) format.
    Leader badge
    Downloads: 546 This Week
    Last Update:
    See Project
  • 2
    BioXTAS RAW

    BioXTAS RAW

    Processing and analysis of Small Angle X-ray Scattering (SAXS) data.

    BioXTAS RAW is a program for analysis of Small-Angle X-ray Scattering (SAXS) data. The software enables: creation of 1D scattering profiles from 2D detector images, standard data operations such as averaging and subtraction, analysis of radius of gyration (Rg) and molecular weight, and advanced analysis using GNOM and DAMMIF as well as electron density reconstructions using DENSS. It also allows easy processing of inline SEC-SAXS data and data deconvolution using the evolving factor analysis (EFA) or the regularized alternating least squares (REGALS) methods. ...
    Leader badge
    Downloads: 99 This Week
    Last Update:
    See Project
  • 3
    PANDA

    PANDA

    A comprehensive and flexible quantification tool for proteomics data

    ...On the levels of spectra, peptides and proteins, PANDA works out a few quantitative filters and new scores for quantification confidence. Third, PANDA is designed for processing proteomics big data in parallel.
    Downloads: 20 This Week
    Last Update:
    See Project
  • 4

    Subread

    High-performance read alignment, quantification and mutation discovery

    The Subread software package is a tool kit for processing next-gen sequencing data. It includes Subread aligner, Subjunc exon-exon junction detector and featureCounts read summarization program. Subread aligner can be used to align both gDNA-seq and RNA-seq reads. Subjunc aligner was specified designed for the detection of exon-exon junction. For the mapping of RNA-seq reads, Subread performs local alignments and Subjunc performs global alignments.
    Leader badge
    Downloads: 1,435 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    ReViMS

    ReViMS

    ReViMS, a 3D volume rendering tool for light-sheet/confocal microscopy

    Reconstruction and Visualization from Multiple Sections (ReViMS), an open-source, user-friendly software for automatically estimating volume and several other features of 3D multicellular aggregates (i.e., cancer spheroid, zebrafish, fruit fly). ReViMS requires a z-stack of 2D binary masks, obtained by segmenting a sequence of fluorescent images acquired by scanning the aggregate along the z axis, using a confocal or a light-sheet fluorescent microscope. It provides a number of tools...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    GromacsProSuite

    Graphical User Interface for Gromacs

    This tool is an integrated graphical interface that simplifies molecular dynamics simulations using Gromacs. It provides a structured, tab-based environment to set up, execute, and analyze simulations data without complex command-line operations. The software automates tasks such as topology generation, solvation, ion addition, minimization, equilibration, and production runs while executing GROMACS commands in the background. Built-in monitoring tracks CPU, RAM, and disk usage to ensure stable performance during parallel processing. ...
    Leader badge
    Downloads: 13 This Week
    Last Update:
    See Project
  • 7

    DataPrep

    Python-based data preprocessing tool

    DataPrep v0.2 is a Tkinter-based GUI application/tool designed to assist users in data preprocessing, multicollinearity removal, and feature selection for a wide range of applications in Cheminformatics, Bioinformatics, Data Analysis, Feature Selection, Molecular Modeling, Machine Learning, and Quantitative-structure-property relationship (QSPR) studies. It includes functionality to load, process, and save datasets with support for different preprocessing & multicollinearity removal...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    miRge3

    Comprehensive analysis of small RNA sequencing data

    An update to Python package to perform comprehensive analysis of small RNA sequencing data, including miRNA annotation, A-to-I editing, novel miRNA detection, isomiR analysis, visualization through IGV, processing Unique Molecular Identifieres (UMI), tRF detection and producing interactive graphical output. miRge3.0 is developed in python v3.8 and is a recent update of our previous version miRge2.0. This build includes command line interface (CLI) and cross-platform Graphical User Interface (GUI). ...
    Downloads: 5 This Week
    Last Update:
    See Project
  • 9
    MAF

    MAF

    Bash Framework to combine many preprocessing and alignment tools

    ...The platform enables researcher to apply multiple sequence aligners and further pre- and postprocessing tools in flexible combinations to analyze the subtle to remarkable differences between different sequence alignment and processing algorithms applied to the same data basis. Cite the following publication: https://doi.org/10.3390/mps8030065
    Downloads: 0 This Week
    Last Update:
    See Project
  • Try Google Cloud Risk-Free With $300 in Credit Icon
    Try Google Cloud Risk-Free With $300 in Credit

    No hidden charges. No surprise bills. Cancel anytime.

    Use your credit across every product. Compute, storage, AI, analytics. When it runs out, 20+ products stay free. You only pay when you choose to.
    Start Free
  • 10
    dataMAPPs

    dataMAPPs

    R based pipeline for MHC-associated peptide proteomics (MAPPs) data

    dataMAPPs allows routine and efficient processing of data from immunogenicity studies applying the MAPPs peptidomics technology to detect potential MHCI- or MHC-II epitopes as presented by dendritic cells (DC). It features quality control of the raw data, across-sample/across-donor normalization and visualization of results in a heatmap style (heatMAPPs). dataMAPPs' core is a generic R library that can be tailored to specific projects via dedicated control scripts which also allow reproducible recalculation of results. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    An open source framework for LC-MS based proteomics and metabolomics. OpenMS offers data structures and algorithms for the processing of mass spectrometry data. The library is written in C++. Our source code and wiki lives on GitHub (https://github.com/OpenMS/OpenMS).
    Downloads: 8 This Week
    Last Update:
    See Project
  • 12

    MarDRe

    MapReduce-based tool to remove duplicate DNA reads

    MarDRe is a de novo MapReduce-based parallel tool to remove duplicate and near-duplicate DNA reads through the clustering of single-end and paired-end sequences from FASTQ/FASTA datasets. This tool allows bioinformatics to avoid the analysis of not necessary reads, reducing the time of subsequent procedures with the dataset. MarDRe is the Big Data counterpart of ParDRe (link above), which employs HPC technologies (i.e., hybrid MPI/multithreading) to reduce runtime on multicore systems. Instead, MarDRe takes advantage of the MapReduce programming model to significantly improve ParDRe performance on distributed systems, especially on cloud-based infrastructures. Written in pure Java to maximize cross-platform compatibility, MarDRe is built upon the open-source Apache Hadoop project, the most popular distributed computing framework for Big Data processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13

    HSRA

    Hadoop spliced read aligner for RNA-seq data

    ...This tool allows bioinformatics researchers to efficiently distribute their mapping tasks over the nodes of a cluster by combining a fast multithreaded spliced aligner (HISAT2) with Apache Hadoop, which is a distributed computing framework for scalable Big Data processing. HSRA currently supports single-end and paired-end read alignments from FASTQ/FASTA datasets. Moreover, our tool uses the Hadoop Sequence Parser (HSP) library (link above) to efficiently read the input datasets stored on the Hadoop Distributed File System (HDFS), being able to process datasets compressed with Gzip and BZip2 codecs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    High-Throughput Tabular Data Processor
    ...HTDP can also import, process and convert Variant Call Format (VCF) files ver. 4.0, 4.1 and 4.2 (http://samtools.github.io/hts-specs/VCFv4.2.pdf). HTDP provides quick filtering functionality and can process data consisting of single or multiple input files. Citation: Madanecki P, Bałut M, Buckley PG, Ochocka JR, Bartoszewski R, Crossman DK, et al. (2018) High-Throughput Tabular Data Processor – Platform independent graphical tool for processing large data sets. PLoS ONE 13(2): e0192858. https://doi.org/10.1371/journal.pone.0192858
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    CSBB-v3.0

    CSBB-v3.0

    CSBB - Computational Suite for Bioinformaticians and Biologists

    CSBB is a command line-based bioinformatics suite to analyze biological data acquired through varied avenues of biological experiments. CSBB is implemented in Perl, while it also leverages the use of R, java, python and ruby in background for specific modules. Major focus of CSBB is to allow users from biology and bioinformatics community, to get benefited by performing down-stream analysis tasks while eliminating the need to write programming code. CSBB is currently available on Linux,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    HCS Analyzer

    HCS Analyzer

    High Content Screening Analyzer

    HCS Analyzer is an open source software dedicated to High Content Screening data processing and analysis.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Protein Microarray Analyser

    Protein Microarray Analyser

    Protein microarray data processing and normalization

    The Protein Microarray Analyser software presented here includes the following tools: (1) neighbourhood background correction, (2) net intensity correction, (3) user-defined noise threshold, (4) user-defined CV threshold amongst replicates and (5) assay controls, (6) composite ‘pin-to-pin’ normalization amongst sub-arrays, and (7) ‘array-to-array’ normalization amongst whole arrays.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    CognitionMaster

    CognitionMaster

    object-based image analysis framework

    Wienert S, Heim D, Kotani M, Lindequist B, Stenzinger A, Ishii M, Hufnagl P, Beil M, Dietel M, Denkert C, Klauschen F. CognitionMaster: an object-based image analysis framework. Diagn Pathol 2013, 8:34
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19

    irayMol

    Molecular visualization

    Analyses and Interactive visualizations of the structures, functions and actions of biomolecules including molecular surface computation, and protein-ligand interface and protein-ligand docking where the ligand could a small compound, a nucleic acid, membrane and other proteins, written in C++/Qt/openGL/GLSL with more than 125,000 lines of codes. In addition to Qt, the only external library needed is GSL everything else is coded from the scratch.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    datasw

    datasw

    datasw, a tool for rapid processing of HPLC-SAXS data.

    Small-angle X-ray scattering (SAXS) in solution is a common low-resolution method which can efficiently complement high-resolution information obtained by crystallography or NMR. Sample monodispersity is key to reliable SAXS data interpretation and model building. Beamline setups with inline high-performance liquid chromatography (HPLC) are particularly useful for accurate profiling of heterogeneous samples. The program DATASW performs averaging of individual data frames from HPLC-SAXS...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21

    MOIRAI

    Simple Scientific Workflow System for CAGE Analysis

    ...After mapping, a CAGE peak on the genome indicates the position of an active transcriptional start site (TSS) and the number of reads correspond to its expression level. CAGE is prominently used in both the FANTOM and ENCODE project. MOIRAI is a compact yet flexible workflow system designed to carry out the main steps in data processing and analysis of CAGE data. MOIRAI has a graphical interface allowing wet-lab researchers to create, modify and run analysis workflows. Embedded within the workflows are graphical quality control indicators allowing users assess data quality and to quickly spot potential problems. MOIRAI package comes with three main workflows allowing users to map, annotate and perform an expression analysis over multiple samples.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Vappio is a framework for building virtual appliances that supports distributed data processing in cloud computing environments using Sun Grid Engine or Hadoop. The primary target application of Vappio is bioinformatics.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Identifies rRNA, contaminants, trims in a standard fashion etc. Maintains read pairs. Built for fire-and-forget high throughput projects (terabytes of data). Uses pbzip2, bowtie2, fastx_toolkit, samtools, fastqc, Trimmomatic (optional)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24

    P3BSseq

    Parallel processing pipeline for analysis of bisulfite sequencing data

    Bisulfite sequencing (BSseq) processing is among the most cumbersome next generation sequencing (NGS) applications. Though some BSseq processing tools are available, they are scattered, require puzzling parameters and are running-time and memory-usage demanding. We have developed P3BSseq, a parallel processing pipeline for fast, accurate and automatic analysis of BSseq reads that trims, aligns, annotates, records the intermediate results, performs bisulfite conversion quality assessment,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. ...
    Leader badge
    Downloads: 11 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB