Showing 301 open source projects for "data science"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • $300 Free Credits to Build on Google Cloud Icon
    $300 Free Credits to Build on Google Cloud

    New to Google Cloud? Get $300 in credits to explore Compute Engine, BigQuery, Cloud Run, Gemini Enterprise Agent Platform, and more.

    Start your next project with $300 in free Google Cloud credit. Spin up VMs, run containers, query petabytes in BigQuery, or build agents with Gemini Enterprise Agent Platform. Once your credits are used, keep building with 20+ always-free tier products including Compute Engine, Cloud Storage, GKE, and Cloud Run functions. No commitment required—just sign up and start building.
    Claim $300 Free
  • 1

    PTESFinder

    Post-Transcriptional Exon Shuffling (PTES) Identification Pipeline

    PTESFinder is a computational pipeline for identifying Post-transcriptional Exon Shuffling events from high-throughput RNAseq data. PTESFinder leverages the power of established RNASeq tools and systematically excludes all known classes of false positive structures by applying stringent filters designed to specifically target these false positives. PTESFinder compares alignment qualities of reads mapping to putative PTES structures with qualities of the same reads when mapped to genomic...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 2
    CompleXChange

    CompleXChange

    differential analysis of combinatorial protein complexes

    The increasing wealth of transcriptomic data and current computational tools enable to infer how protein interactomes and complexomes may be assembled in specific samples. With CompleXChange this information can be exploited to conduct differential analyses of the dynamic protein complexome in a quantitative manner. The corresponding publication can be found on https://doi.org/10.1186/s12859-019-2852-z.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3

    MarDRe

    MapReduce-based tool to remove duplicate DNA reads

    MarDRe is a de novo MapReduce-based parallel tool to remove duplicate and near-duplicate DNA reads through the clustering of single-end and paired-end sequences from FASTQ/FASTA datasets. This tool allows bioinformatics to avoid the analysis of not necessary reads, reducing the time of subsequent procedures with the dataset. MarDRe is the Big Data counterpart of ParDRe (link above), which employs HPC technologies (i.e., hybrid MPI/multithreading) to reduce runtime on multicore systems....
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    HSRA

    Hadoop spliced read aligner for RNA-seq data

    HSRA is a MapReduce-based parallel tool for mapping reads from RNA sequencing (RNA-seq) experiments. RNA-seq analyses typically begin by mapping reads to a reference genome in order to determine the location from which the reads were originated, which is a very time-consuming step. This tool allows bioinformatics researchers to efficiently distribute their mapping tasks over the nodes of a cluster by combining a fast multithreaded spliced aligner (HISAT2) with Apache Hadoop, which is a...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 5
    GMOL

    GMOL

    A tool for 3D genome structure visualization

    ...It allows users to view the genome structure at multiple scales, including: global, chromosome, loci, fiber, nucleosome, and nucleotide. This software was built upon the pre-existing Jmol package by Prof. Cheng's group. The software is developed in Prof. Jianlin Cheng's Bioinformatics, Data Mining and Machine Learning Laboratory in the Computer Science Department at the University of Missouri - Columbia, USA. The project is supported by the National Science Foundation (grant no. DBI1149224). If you use GMOL in your research, please cite: Nowotny, Jackson, Avery Wells, Oluwatosin Oluwadare, Lingfei Xu, Renzhi Cao, Tuan Trieu, Chenfeng He, and Jianlin Cheng. ...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 6
    seqMINER
    A genome wide mapping data interpretation platform for NGS(ChIPSeq). A tutorial can be found at: http://genomeast.igbmc.fr/wiki/doku.php?id=training:seqminer
    Downloads: 3 This Week
    Last Update:
    See Project
  • 7
    OpenChrom
    OpenChrom is a tool for gas chromatography and mass spectrometry. The focus is to handle data files from different GC/MS and GC/FID systems and vendors. Its functionality and algorithms can be extended using a flexible plugin approach, based on Eclipse RCP.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 8
    PPICompare

    PPICompare

    detection of rewiring events in protein interaction networks

    PPICompare detects statistically significant rewiring events in protein-protein interaction networks - even if they are caused by alternative splicing - and reports plenty of information to that. The input data needs to be constructed with PPIXpress (see https://sourceforge.net/projects/ppixpress/). The original publication can be found on https://bmcsystbiol.biomedcentral.com/articles/10.1186/s12918-017-0400-x.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 9
    As of 2018-06-28, this project has moved to https://github.com/AdamaJava. This copy of the code will remain but all new code updates and releases will be from the new site. Java code developed by the Australian ICGC team for operating on next-generation sequencing data. This code is currently being maintained and expanded by the QIMR Berghofer Genome Informatics team (http://www.qimrberghofer.edu.au/lab/genome-informatics/) More details and documentation can be found on the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Cut Data Warehouse Costs by 54% Icon
    Cut Data Warehouse Costs by 54%

    Easily migrate from Snowflake, Redshift, or Databricks with free tools.

    BigQuery delivers 54% lower TCO with exabyte scale and flexible pricing. Free migration tools handle the SQL translation automatically.
    Try Free
  • 10

    Taxoblast

    Taxoblast is a pipeline to identify contamination in genomic sequence

    Raw genomic sequences are frequently contaminated with sequences of other organism. Their identification is essential for the interpretation of genomic data. In this context it is essential to distinguish between horizontal gene transfers and contamination. The genomic context of sequences can help distinguish the two scenarios. Taxoblast splits genomic scaffolds into sub-sequences of defined length and for each of them determines the closest related taxon. It then summarizes this...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 11
    Protein Microarray Analyser

    Protein Microarray Analyser

    Protein microarray data processing and normalization

    The Protein Microarray Analyser software presented here includes the following tools: (1) neighbourhood background correction, (2) net intensity correction, (3) user-defined noise threshold, (4) user-defined CV threshold amongst replicates and (5) assay controls, (6) composite ‘pin-to-pin’ normalization amongst sub-arrays, and (7) ‘array-to-array’ normalization amongst whole arrays.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    BisSNP

    Bisulfite-seq/NOMe-seq SNPs & cytosine methylation caller

    Now in Github: https://github.com/dnaase/Bis-tools/tree/master/Bis-SNP BisSNP is a package based on the Genome Analysis Toolkit (GATK) map-reduce framework for genotyping in bisulfite treated massively parallel sequencing (Bisulfite-seq, NOMe-seq and RRBS) on Illumina platform. It uses bayesian inference with either manually specified or automatically estimated methylation probabilities of different cytosine context(not only CpG, CHH, CHG in Bisulfite-seq, but also GCH et.al. in other...
    Downloads: 1 This Week
    Last Update:
    See Project
  • 13
    DicomReader is a simple Java Dicom files decipher. It handles headers and images within as well; data (headers and pixel-value images) will be saved into ascii clear text files. A pgm version of the image files is also provided.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    Specify Software

    Specify Software

    Biodiversity Database Platform

    Specify is a biological collections and species occurrence database management platform for zoological museums, herbaria and other biodiversity specimen repositories. Specify is supported by grants from the Division of Biological Infrastructure, U.S. National Science Foundation and the State of Kansas. A web browser application, Specify 7, is available on GitHub. In 2015, 450 biological collections worldwide use Specify Software for collections data management. An iPad app, Specify Insight is also available.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    MOIRAI

    Simple Scientific Workflow System for CAGE Analysis

    Cap analysis of gene expression (CAGE) is a sequencing based technology to capture the 5’ ends of RNAs in a biological sample. After mapping, a CAGE peak on the genome indicates the position of an active transcriptional start site (TSS) and the number of reads correspond to its expression level. CAGE is prominently used in both the FANTOM and ENCODE project. MOIRAI is a compact yet flexible workflow system designed to carry out the main steps in data processing and analysis of CAGE data....
    Downloads: 2 This Week
    Last Update:
    See Project
  • 16
    Jillion
    Java bio-informatics library to analyze and convert genomic sequence and assembly data. This library was created and used by the J. Craig Venter Institute (JCVI)
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17

    spectralHMM

    A spectral method for inferring selection from time series data

    ***WARNING*** This software was migrated to: https://github.com/popgenmethods/spectralHMM Support and updates will only be available at this new address. This software implements the algorithms described in the following paper: Steinrücken, M., Bhaskar, A. and Song, Y.S. A novel spectral method for inferring general diploid selection from time series genetic data. Annals of Applied Statistics, Vol. 8, No. 4 (2014) 2203-2222
    Downloads: 0 This Week
    Last Update:
    See Project
  • 18
    MSqBAT

    MSqBAT

    Label-free protein quantification for LC-MS

    MSqBAT is a freely-available all-platform software application for label-free quantification of proteins from LS-MS data. It was developed in the lab of Dr. Christoph Rösli at the Heidelberg Institude for Stem Cells and Experimental Medicine (HI-STEM) and the German Cancer Research Center (DKFZ). It’s main features are 1) Label-free, MS1-based quantification 2) Support both LC-MALDI-MS- as well as LC-ESI-MS data 3) Supports both GeLC-MALDI-MS- and GeLC-ESI-MS data 4) Convenient,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Maui

    Maui

    Maui is the Maltcms User Interface

    Maui is the Maltcms User Interface, a rich client application for Chromatography-Mass Spectrometry and related research areas.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    An open source workbench for chemo- and bioinformatics built on the Eclipse Rich Client Platform (RCP).
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    Molecular Simulation Grid

    Molecular Simulation Grid

    Provides high performance computing power and state of the art tools

    MoSGrid focuses on the configuration and provision of Grid services for molecular simulations and annotation of the results with metadata and their provision for data mining and knowledge generation. It is based on Liferay technology togethe with gUSE.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    metawatt

    metawatt

    Binner for assembled metagenomes

    The Metawatt binner is a graphical binning tool that makes use of multivariate statistics of tetranucleotide frequencies and differential coverage based binning. It also performs taxonomic assessment of binning quality (via diamond BLASTx). Created bins can be edited and exported as fasta. The Metawatt is implemented in Java SWING and minimally depends on Diamond, HMMer3.1, BBMap, Prodigal and the Batik library for the export of SVG graphics. Citation: Strous M, Kraft B, Bisdorf R,...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    Maltcms
    The Maltcms - Modular Application Toolkit for Chromatography Mass-Spectrometry is a JAVA API for preprocessing, alignment, analysis and visualization of data stored in open file formats used in Proteomics and Metabolomics research.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    PANTHER project: software for modeling of protein sequence and function evolution, and tools for applying these data to the analysis of genome data, expression data and coding SNPs. Details available at http://www.pantherdb.org.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We are...
    Leader badge
    Downloads: 3 This Week
    Last Update:
    See Project
Auth0 Logo