Showing 120 open source projects for "data science"

View related business solutions
  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1

    Webapp

    A web interface to the SnowyOwl gene prediction pipeline

    Downloads: 0 This Week
    Last Update:
    See Project
  • 2

    hrefinder

    Detection of homologous recombination events from SNP data

    This software detects homologous recombination events (HREs) from SNP data. Based on SNP alleles calls and locations, it breaks the genomes into locally colinear blocks, and looks for cases where SNPs do not agree with the vertical pattern of inheritance in a phylogeny. It applies a dynamic programming algorithm to model whether changes within a block are likely a result of mutations, sequencing errors, or HRE. We use information from the nearby SNPs, so that if 1-2 alleles in a series of...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 3
    ReadWrapper is a Python toolset to prepare and package raw sequencing data ("reads") for submission to the Sequence Read Archive (SRA, http://trace.ncbi.nlm.nih.gov/Traces/sra/).
    Downloads: 0 This Week
    Last Update:
    See Project
  • 4

    AMBIENT

    Find active modules in metabolic networks using high-throughput data

    IMPORTANT: Since publication of the AMBIENT method in BMC Sys Bio, several updates have been made. If you wish to use the version used in the paper it is v0.6.3, however I recommend using the latest version which works in the same way but with additional options and has stability and performance improvements. Thanks for your interest! AMBIENT (Active Modules for Bipartite Networks) is a Python module that uses simulated annealing to find areas of a metabolic network (modules) that have...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Save Up to 91% on Cloud Compute With Spot VMs Icon
    Save Up to 91% on Cloud Compute With Spot VMs

    Automatic sustained-use discounts. One free VM per month. No negotiation needed.

    Run batch jobs at 60-91% off with Spot VMs. Long-running workloads get automatic discounts with sustained use.
    Try Free
  • 5

    GS junior Webserver

    GS junior Webserver

    GS junior Webserver is a web based file server to easily access sequencing data of
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6

    Mix

    Mix is a tool to combine multiple assemblies from NGS data.

    Mix is a tool that combines two or more draft assemblies, without relying on a reference genome and has the goal to reduce contig fragmentation and thus speed-up genome finishing. The proposed algorithm builds an extension graph where vertices represent extremities of contigs and edges represent existing alignments between these extremities. These alignment edges are used for contig extension. The resulting output assembly corresponds to a path in the extension graph that maximizes the...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    GEPETTO - Gene Prioritization in Java

    GEPETTO - Gene Prioritization in Java

    GEPETTO (GEne Prioritization ExTended TOol)

    GEPETTO (GEne PrioriTization ExTended TOol) is an original open-source framework, distributed under the LGPL license, for gene selection and prioritization on a desktop computer that ensures confidentiality of personal data. It takes advantage of the data integration capabilities in the SM2PH-Central Framework(KD4v,MSV3d,BIRD,..), combined with in-house developed gene prioritization methods. It currently incorporates six prioritization modules, based on gene sequence, protein-protein...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8

    SpiKeDeteKt

    An automatic spike detection program to be used with new KlustaKwik

    This is an automatic spike detection program which takes account of probe geometry and produces a .mask file to be used with the new masked version of KlustaKwik. We recommend you use Python 2.6 or 2.7, e.g. a free academic version can be obtained from Entthought Python. The input files for SpiKeDeteKt are: .dat (raw data file) .probe (probe file, described below - user constructed) parameters.py (optional - otherwise it uses defaultparameters.py) SpiKeDeteKt outputs...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    GUI for DEDA

    GUI for DEDA

    GUI for DEmography Data Analysis

    <This project has been completely rewrote and transformed into a new one: https://sourceforge.net/projects/deday/. 2013/06/26> The graphic user interface for DEDA (DEmography Data Analysis), a scientific software package fitting survivalship data to a number of distributions using maximum likelihood (ML) method. Currently, Weibull (2p), Gompertz and Gompertz-Makeham are supported. IMPORTANT NOTICE: Only the GUI is provided here. In order to perform the analysis, one also need the DEDA...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Build Agents and Models on One Platform Icon
    Build Agents and Models on One Platform

    Everything you need to build production-ready agents and models. Access 200+ Google and third-party AI models and tools.

    Gemini Enterprise Agent Platform is Google Cloud's comprehensive platform for developers to build, scale, govern, and optimize agents and models. Choose from Google's most advanced models and third-party models like Anthropic's Claude Model Family.
    Try It Free
  • 10
    CoNIFER

    CoNIFER

    Homepage for CoNIFER (Copy Number Inference From Exome Reads)

    CoNIFER uses exome sequencing data to find copy number variants (CNVs) and genotype the copy-number of duplicated genes.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 11

    u/sbmv2012

    Taxonomy assignment of metazoans using a python based pipeline

    The aim of this project is to create an automated pipeline for taxonomic assignment of DNA sequences obtained from environmental samples. We develop a series of python scripts to process the raw sequence data obtained from benthic environmental samples and to taxonomical assignment of these sequences and finally to integrate all data in a relational database.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 12

    QUASR

    Cross-platform NGS processing and analysis pipeline in Python

    QUASR is a lightweight pipeline written to process and analyse next-generation sequencing (NGS) data from Illumina, 454, and Ion Torrent platforms. Although originally written for viral data, it is generic enough to work on any NGS dataset. Functions include: duplicate removal demultiplexing primer-removal quality-assurance (QA) graphing quality control (QC) consensus-generation minority-variant determination minority-variant graphing The main current version is 6.X, which is...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 13
    GWCNV is a genome-wide algorithm for detecting CNV associations with diseases. It works directly on a transformation of intensity data. It is powerful and sensitive in detecting small CNV associations, and retains high power for large CNVs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 14
    The RDXplorer is a computational tool for copy number variants (CNV) detection in whole human genome sequence data using read depth (RD) coverage. CNV detection is based on the Event-Wise Testing (EWT) algorithm recently published by our group.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15
    AnnTools is an efficient, fast and robust bioinformatics tool annotating SNP and CNV calls generated from sequencing and microarray data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    Febrl (Freely Extensible Biomedical Record Linkage) does data standardisation (segmentation and cleaning) and probabilistic record linkage ("fuzzy" matching) of one or more files or data sources which do not share a unique record key or identifier.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 17
    PyBact is an open source software written in Python for Bacterial Identification. The software generates simulated data matrix which accurately represents the probabilistic positive/negative results of the tested biochemical test.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 18
    This is a c-library that provides tools for advanced analysis of electrophysiological data. It features denoising, unsupervised classification, time-frequency analysis, phase-space analysis, neural networks, time-warping and more.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Flow Investigation using N-Dimensions
    Flow Investigation using N-Dimensions (FIND) is a program designed for analysis and visualization of Flow Cytometry data. FIND focuses specifically on automated population discovery (clustering) methods. The project targets both users and developers.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 20
    Image analysis program that extract the movements of the bacteria from digitised movies and data processing associated.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 21
    H-mito is a mitochondrial DNA haplogroup prediction tool based on the phylogenetic tree http://www.phylotree.org/. Ancillary scripts: mitoP.py to extract mutation lists and clustal-2-fasta.zip to join, align through clustalw and convert data.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 22
    The Protein Geometry Database hosts the development code for a flexible database for searching protein geometry, as well as a library for accessing this data for protein modeling & refinement programs.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 23
    A database and a web front end for physiologic data on animal feeding, developed with PostgreSQL and Django at NESCent (http://www.nescent.org) for Mammalian Feeding Working Group (http://www.feedexp.org). Current development of the codebase is on Github at http://github.com/NESCent/feedingdb.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    qips is a software package for analyzing ChIP-seq ("Chromatin ImmunoPrecipitation on sequencing") data. It finds enriched regions of arbitrary lengths and is therefore especially suited for analyzing ChIP-seq of histone marks or polymerase.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25
    A collection of tools for working with the comparative data analysis ontology including import/export facilities for common phylogenetic file formats, and also a triple-store framework.
    Downloads: 0 This Week
    Last Update:
    See Project
Auth0 Logo