Showing 365 open source projects for "data"

View related business solutions
  • Go From Idea to Deployed AI App Fast Icon
    Go From Idea to Deployed AI App Fast

    One platform to build, fine-tune, and deploy. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • Build on Google Cloud with $300 in Free Credit Icon
    Build on Google Cloud with $300 in Free Credit

    New to Google Cloud? Get $300 in free credit to explore Compute Engine, BigQuery, Cloud Run, Vertex AI, and 150+ other products.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query exabytes in BigQuery, or build AI apps with Vertex AI and Gemini. Once your credits are used, keep building with 20+ products with free monthly usage, including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. Sign up to start building right away.
    Start Free Trial
  • 1
    IntEnz is the name for the Integrated relational Enzyme database. IntEnz contains data curated and approved by the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology (NC-IUBMB).
    Downloads: 1 This Week
    Last Update:
    See Project
  • 2
    NOVA

    NOVA

    Analysis and visualization of complexome profiling data.

    ...Many additional functions like zooming, searching for proteins, image export, and automatic file format recognition support intuitive handling for biologists. Giese, et al. NOVA: a software to analyze complexome profiling data. Bioinformatics, 2015, 31(3): 440-441
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    BioModels Database is a data resource that allows biologists to store, search and retrieve published mathematical models of biological interests. Models presented are annotated and linked to relevant data resources and are available in various format
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4
    Mass-Up

    Mass-Up

    MALDI-TOF data analysis tool

    Mass-Up is an Open-Source mass spectrometry utility for proteomics designed to support the preprocessing and analysis of MALDI-TOF mass spectrometry data. Mass-Up includes several tools and operations to load, preprocess and analyze MALDI-TOF data.
    Downloads: 6 This Week
    Last Update:
    See Project
  • AI-generated apps that pass security review Icon
    AI-generated apps that pass security review

    Stop waiting on engineering. Build production-ready internal tools with AI—on your company data, in your cloud.

    Retool lets you generate dashboards, admin panels, and workflows directly on your data. Type something like “Build me a revenue dashboard on my Stripe data” and get a working app with security, permissions, and compliance built in from day one. Whether on our cloud or self-hosted, create the internal software your team needs without compromising enterprise standards or control.
    Try Retool free
  • 5
    PPIXpress

    PPIXpress

    specific protein interaction networks from transcript expression

    ...Since a simple reduction of the networks to the subset of expressed genes only scratches the surface of higher organisms’ regulatory capabilities, we propose the advanced method PPIXpress that allows to exploit expression data at the transcript-level and is thus able to also reveal alterations in protein connectivity caused by alternative splicing. The original publication can be found on https://bioinformatics.oxfordjournals.org/content/32/4/571 .
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    PTESFinder

    Post-Transcriptional Exon Shuffling (PTES) Identification Pipeline

    PTESFinder is a computational pipeline for identifying Post-transcriptional Exon Shuffling events from high-throughput RNAseq data. PTESFinder leverages the power of established RNASeq tools and systematically excludes all known classes of false positive structures by applying stringent filters designed to specifically target these false positives. PTESFinder compares alignment qualities of reads mapping to putative PTES structures with qualities of the same reads when mapped to genomic regions and canonically spliced transcripts. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    CompleXChange

    CompleXChange

    differential analysis of combinatorial protein complexes

    The increasing wealth of transcriptomic data and current computational tools enable to infer how protein interactomes and complexomes may be assembled in specific samples. With CompleXChange this information can be exploited to conduct differential analyses of the dynamic protein complexome in a quantitative manner. The corresponding publication can be found on https://doi.org/10.1186/s12859-019-2852-z.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    MarDRe

    MapReduce-based tool to remove duplicate DNA reads

    MarDRe is a de novo MapReduce-based parallel tool to remove duplicate and near-duplicate DNA reads through the clustering of single-end and paired-end sequences from FASTQ/FASTA datasets. This tool allows bioinformatics to avoid the analysis of not necessary reads, reducing the time of subsequent procedures with the dataset. MarDRe is the Big Data counterpart of ParDRe (link above), which employs HPC technologies (i.e., hybrid MPI/multithreading) to reduce runtime on multicore systems. Instead, MarDRe takes advantage of the MapReduce programming model to significantly improve ParDRe performance on distributed systems, especially on cloud-based infrastructures. Written in pure Java to maximize cross-platform compatibility, MarDRe is built upon the open-source Apache Hadoop project, the most popular distributed computing framework for Big Data processing.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9

    HSRA

    Hadoop spliced read aligner for RNA-seq data

    ...This tool allows bioinformatics researchers to efficiently distribute their mapping tasks over the nodes of a cluster by combining a fast multithreaded spliced aligner (HISAT2) with Apache Hadoop, which is a distributed computing framework for scalable Big Data processing. HSRA currently supports single-end and paired-end read alignments from FASTQ/FASTA datasets. Moreover, our tool uses the Hadoop Sequence Parser (HSP) library (link above) to efficiently read the input datasets stored on the Hadoop Distributed File System (HDFS), being able to process datasets compressed with Gzip and BZip2 codecs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • Easily Host LLMs and Web Apps on Cloud Run Icon
    Easily Host LLMs and Web Apps on Cloud Run

    Run everything from popular models with on-demand NVIDIA L4 GPUs to web apps without infrastructure management.

    Run frontend and backend services, batch jobs, host LLMs, and queue processing workloads without the need to manage infrastructure. Cloud Run gives you on-demand GPU access for hosting LLMs and running real-time AI—with 5-second cold starts and automatic scale-to-zero so you only pay for actual usage. New customers get $300 in free credit to start.
    Try Cloud Run Free
  • 10
    GMOL

    GMOL

    A tool for 3D genome structure visualization

    ...It allows users to view the genome structure at multiple scales, including: global, chromosome, loci, fiber, nucleosome, and nucleotide. This software was built upon the pre-existing Jmol package by Prof. Cheng's group. The software is developed in Prof. Jianlin Cheng's Bioinformatics, Data Mining and Machine Learning Laboratory in the Computer Science Department at the University of Missouri - Columbia, USA. The project is supported by the National Science Foundation (grant no. DBI1149224). If you use GMOL in your research, please cite: Nowotny, Jackson, Avery Wells, Oluwatosin Oluwadare, Lingfei Xu, Renzhi Cao, Tuan Trieu, Chenfeng He, and Jianlin Cheng. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 11

    mirplant

    miRPlant: An Integrated Tool for Identification of Plant miRNA

    please cite: An J, Lai J, Sajjanhar A, Lehman ML, Nelson CC: miRPlant: an integrated tool for identification of plant miRNA from RNA sequencing data. BMC bioinformatics 2014, 15(1):275. We will create index for you if you tell us your interested plants (j.an@qut.edu.au).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12
    seqMINER
    A genome wide mapping data interpretation platform for NGS(ChIPSeq). A tutorial can be found at: http://genomeast.igbmc.fr/wiki/doku.php?id=training:seqminer
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    OpenChrom
    OpenChrom is a tool for gas chromatography and mass spectrometry. The focus is to handle data files from different GC/MS and GC/FID systems and vendors. Its functionality and algorithms can be extended using a flexible plugin approach, based on Eclipse RCP.
    Downloads: 48 This Week
    Last Update:
    See Project
  • 14
    ...This copy of the code will remain but all new code updates and releases will be from the new site. Java code developed by the Australian ICGC team for operating on next-generation sequencing data. This code is currently being maintained and expanded by the QIMR Berghofer Genome Informatics team (http://www.qimrberghofer.edu.au/lab/genome-informatics/) More details and documentation can be found on the wiki: http://sourceforge.net/p/adamajava/wiki/Home/
    Downloads: 4 This Week
    Last Update:
    See Project
  • 15
    PPICompare

    PPICompare

    detection of rewiring events in protein interaction networks

    PPICompare detects statistically significant rewiring events in protein-protein interaction networks - even if they are caused by alternative splicing - and reports plenty of information to that. The input data needs to be constructed with PPIXpress (see https://sourceforge.net/projects/ppixpress/). The original publication can be found on https://bmcsystbiol.biomedcentral.com/articles/10.1186/s12918-017-0400-x.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    High-Throughput Tabular Data Processor
    HIGH-THROUGHPUT TABULAR DATA PROCESSOR (HTDP) is Java application that is intended to facilitate data exploration and reduction tasks in large text files resulting from high throughput technologies, e.g. massively parallel sequencing or microarrays. The software has been optimized for microarray and deep parallel sequencing data, however it can accept any character delimited tabular data sets.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 17

    Taxoblast

    Taxoblast is a pipeline to identify contamination in genomic sequence

    Raw genomic sequences are frequently contaminated with sequences of other organism. Their identification is essential for the interpretation of genomic data. In this context it is essential to distinguish between horizontal gene transfers and contamination. The genomic context of sequences can help distinguish the two scenarios. Taxoblast splits genomic scaffolds into sub-sequences of defined length and for each of them determines the closest related taxon. It then summarizes this information for the entire scaffold, taking into account the taxonomic ontology. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18

    BisSNP

    Bisulfite-seq/NOMe-seq SNPs & cytosine methylation caller

    ...It uses bayesian inference with either manually specified or automatically estimated methylation probabilities of different cytosine context(not only CpG, CHH, CHG in Bisulfite-seq, but also GCH et.al. in other bisulfite treated sequencing) to determine genotypes and methylation levels simultaneously. It works for both of single-end and paired-end reads.Specificity and sensitivity has been validate by Illumina IM SNP array. In default threshold 30X data (Phred scale score > 20), it could detect 92.21% heterozygous SNPs with 0.14% false positive rate Cytosine calling is not only based on reference context, so it could detect non-reference cytosine context. Google group for help: http://goo.gl/zL7Nj
    Downloads: 5 This Week
    Last Update:
    See Project
  • 19
    Protein Microarray Analyser

    Protein Microarray Analyser

    Protein microarray data processing and normalization

    The Protein Microarray Analyser software presented here includes the following tools: (1) neighbourhood background correction, (2) net intensity correction, (3) user-defined noise threshold, (4) user-defined CV threshold amongst replicates and (5) assay controls, (6) composite ‘pin-to-pin’ normalization amongst sub-arrays, and (7) ‘array-to-array’ normalization amongst whole arrays.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    This site hosts the source code for C++ version of the Broker for SBW, NOM module, advanced simulation suite, analysis applications and model editors.
    Downloads: 3 This Week
    Last Update:
    See Project
  • 21
    DicomReader is a simple Java Dicom files decipher. It handles headers and images within as well; data (headers and pixel-value images) will be saved into ascii clear text files. A pgm version of the image files is also provided.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    Specify Software

    Specify Software

    Biodiversity Database Platform

    ...National Science Foundation and the State of Kansas. A web browser application, Specify 7, is available on GitHub. In 2015, 450 biological collections worldwide use Specify Software for collections data management. An iPad app, Specify Insight is also available.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23

    MOIRAI

    Simple Scientific Workflow System for CAGE Analysis

    ...After mapping, a CAGE peak on the genome indicates the position of an active transcriptional start site (TSS) and the number of reads correspond to its expression level. CAGE is prominently used in both the FANTOM and ENCODE project. MOIRAI is a compact yet flexible workflow system designed to carry out the main steps in data processing and analysis of CAGE data. MOIRAI has a graphical interface allowing wet-lab researchers to create, modify and run analysis workflows. Embedded within the workflows are graphical quality control indicators allowing users assess data quality and to quickly spot potential problems. MOIRAI package comes with three main workflows allowing users to map, annotate and perform an expression analysis over multiple samples.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    Jillion
    Java bio-informatics library to analyze and convert genomic sequence and assembly data. This library was created and used by the J. Craig Venter Institute (JCVI)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    spectralHMM

    A spectral method for inferring selection from time series data

    ...This software implements the algorithms described in the following paper: Steinrücken, M., Bhaskar, A. and Song, Y.S. A novel spectral method for inferring general diploid selection from time series genetic data. Annals of Applied Statistics, Vol. 8, No. 4 (2014) 2203-2222
    Downloads: 0 This Week
    Last Update:
    See Project
MongoDB Logo MongoDB