Showing 405 open source projects for "bioinformatics"

View related business solutions
  • $300 in Free Credit Towards Top Cloud Services Icon
    $300 in Free Credit Towards Top Cloud Services

    Build VMs, containers, AI, databases, storage—all in one place.

    Start your project in minutes. After credits run out, 20+ products include free monthly usage. Only pay when you're ready to scale.
    Get Started
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 1
    Volcano

    Volcano

    A Cloud Native Batch System (Project under CNCF)

    Volcano is a batch system built on Kubernetes. It provides a suite of mechanisms that are commonly required by many classes of batch & elastic workload including machine learning/deep learning, bioinformatics/genomics, and other "big data" applications. These types of applications typically run on generalized domain frameworks like TensorFlow, Spark, Ray, PyTorch, MPI, etc, which Volcano integrates with. Volcano builds upon a decade and a half of experience running a wide variety of high-performance workloads at scale using several systems and platforms, combined with best-of-breed ideas and practices from the open-source community. ...
    Downloads: 275 This Week
    Last Update:
    See Project
  • 2
    Prokka

    Prokka

    Rapid prokaryotic genome annotation

    Prokka is a command-line software tool for rapid annotation of prokaryotic genomes (bacteria and archaea). Given a FASTA file of contigs, it predicts genes, rRNAs, tRNAs, and other functional elements, then assigns functions by comparing to reference protein databases and HMM profiles. It outputs GenBank, GFF, and other formats compatible with downstream tools and genome browsers. Prokka handles common complications—overlapping ORFs, frameshifts, alternate start codons—while providing...
    Downloads: 3 This Week
    Last Update:
    See Project
  • 3
    AlphaFold 3

    AlphaFold 3

    AlphaFold 3 inference pipeline

    ...This repository provides the complete inference pipeline for running AlphaFold 3, though access to the model parameters is restricted and must be obtained directly from Google under specific terms of use. The system is designed for scientific research applications in structural biology, biochemistry, and bioinformatics, enabling accurate modeling of proteins, ligands, and covalent modifications. Users can perform local predictions via Docker containers, integrating AlphaFold 3’s inference process with provided JSON input configurations. The software includes flexible options for running both data preprocessing and GPU-accelerated inference, allowing users to adapt to available computational resources.
    Downloads: 6 This Week
    Last Update:
    See Project
  • 4
    Codon

    Codon

    A high-performance, zero-overhead, extensible Python compiler

    ...The Codon framework is fully modular and extensible, allowing for the seamless integration of new modules, compiler optimizations, domain-specific languages and so on. We actively develop Codon extensions for a number of domains such as bioinformatics and quantitative finance.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5
    nextflow

    nextflow

    A DSL for data-driven computational pipelines

    Nextflow is a powerful workflow orchestration tool designed for the development and execution of data-intensive pipelines, especially in the field of bioinformatics. It enables reproducible and portable workflows that can run seamlessly across different environments including local, HPC clusters, and cloud platforms. With support for containers and software package managers, Nextflow makes it easy to define complex data processes using a concise DSL and manage dependencies efficiently.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 6
    audioFlux

    audioFlux

    A library for audio and music analysis, feature extraction

    A library for audio and music analysis, and feature extraction. Can be used for deep learning, pattern recognition, signal processing, bioinformatics, statistics, finance, etc. audioflux is a deep learning tool library for audio and music analysis, feature extraction. It supports dozens of time-frequency analysis transformation methods and hundreds of corresponding time-domain and frequency-domain feature combinations. It can be provided to deep learning networks for training and is used to study various tasks in the audio field such as Classification, Separation, Music Information Retrieval(MIR) ASR, etc.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 7
    MedgeClaw

    MedgeClaw

    Open-source AI research assistant for biomedicine

    MedgeClaw is a specialized AI-powered research assistant tailored for biomedical and scientific workflows, built on top of OpenClaw and Claude Code architectures. It integrates a large library of domain-specific skills, enabling it to perform complex analyses in areas such as genomics, drug discovery, and clinical research. The system connects conversational interfaces with computational environments, allowing users to initiate research tasks through messaging platforms while the backend...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 8
    DGL

    DGL

    Python package built to ease deep learning on graph

    ...Scale to giant graphs via multi-GPU acceleration and distributed training infrastructure. DGL empowers a variety of domain-specific projects including DGL-KE for learning large-scale knowledge graph embeddings, DGL-LifeSci for bioinformatics and cheminformatics, and many others. We are keen to bringing graphs closer to deep learning researchers. We want to make it easy to implement graph neural networks model family. We also want to make the combination of graph based modules and tensor based modules (PyTorch or MXNet) as smooth as possible. DGL provides a powerful graph object that can reside on either CPU or GPU. ...
    Downloads: 0 This Week
    Last Update:
    See Project
  • 9
    ...Especially, it supports whole genome sequencing data even with low sequencing depth. If you use Lep-MAP3, please cite P. Rastas. Lep-MAP3: Robust linkage mapping even for low-coverage whole genome sequencing data, Bioinformatics. 2017, 33(23):3726-3732. https://doi.org/10.1093/bioinformatics/btx494. Please note Lep-Anchor for anchoring genomes with Lep-MAP3 linkage maps http://sourceforge.net/projects/lep-anchor
    Leader badge
    Downloads: 21 This Week
    Last Update:
    See Project
  • Fully Managed MySQL, PostgreSQL, and SQL Server Icon
    Fully Managed MySQL, PostgreSQL, and SQL Server

    Automatic backups, patching, replication, and failover. Focus on your app, not your database.

    Cloud SQL handles your database ops end to end, so you can focus on your app.
    Try Free
  • 10

    Lep-Anchor

    Linkage map guided genome anchoring

    Lep-Anchor has been developed to efficiently anchor genomes into chromosomes by using all the information provided by Lep-MAP3 and the additional information provided by long reads and contig-contig alignments to link contigs and to collapse haplotypes. Lep-Anchor supports millions of markers over multiple maps. Pasi Rastas, Lep-Anchor: Automated construction of linkage map anchored haploid genomes, Bioinformatics, btz978, https://doi.org/10.1093/bioinformatics/btz978
    Downloads: 1 This Week
    Last Update:
    See Project
  • 11
    Armadillo

    Armadillo

    fast C++ library for linear algebra & scientific computing

    * Fast C++ library for linear algebra (matrix maths) and scientific computing * Easy to use functions and syntax, deliberately similar to Matlab / Octave * Uses template meta-programming techniques to increase efficiency * Provides user-friendly wrappers for OpenBLAS, Intel MKL, LAPACK, ATLAS, ARPACK, SuperLU and FFTW libraries * Useful for machine learning, pattern recognition, signal processing, bioinformatics, statistics, finance, etc. * Downloads: http://arma.sourceforge.net/download.html * Documentation: http://arma.sourceforge.net/docs.html * Bug reports: http://arma.sourceforge.net/faq.html * Git repo: https://gitlab.com/conradsnicta/armadillo-code
    Leader badge
    Downloads: 2,727 This Week
    Last Update:
    See Project
  • 12
    TomoJ

    TomoJ

    ImageJ plugin to perform Electron Tomography

    ...registration is described in: Journal of Structural Biology: X. 2020, Volume 4. "Improvements on marker-free images alignment for electron tomography" C.O.S. Sorzano et al. https://doi.org/10.1016/j.yjsbx.2020.100037. BMC Bioinformatics. 2009 Apr 27;10:124."Marker-free image registration of electron tomography tilt-series." C.O.S. Sorzano et al. reconstruction part was described in: BMC Bioinformatics. 2007 Aug 6;8:288. "TomoJ: tomography software for three-dimensional reconstruction in transmission electron microscopy."Messaoudi C et al
    Downloads: 19 This Week
    Last Update:
    See Project
  • 13

    CpGtools

    Python package to analyze DNA methylation data

    CpGtools package provides a number of Python programs to annotate, QC, visualize, and analyze DNA methylation data generated from Illumina HumanMethylation450 BeadChip (450K) / MethylationEPIC BeadChip (850K) array or RRBS / WGBS.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 14

    Fastq-Fasta Converter

    A bioinformatics tool that converts Fastq into Fasta

    Fastq-Fasta Converter is a desktop application that converts Fastq files into Fasta files.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 15

    CpDB tutorial

    Hands-on automatic annotation tutorial in Linux

    This tutorial was conceived for a five hours course teaching automatic annotation to students with basic Linux OS concepts and some bioinformatics background. This is part of a book chapter available at the address https://doi.org/10.5772/23724. This is an alternative mirror for the site presented in the book chapter, a site no longer available. The parser's source code is available at sourceforge.net/projects/cpdb/.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 16
    com-hfg

    com-hfg

    java library for XML, HTML, or SVG generation + bioinformatics classes

    The com.hfg (hairyfatguy.com) library is a collection of utility classes that make it easy to construct XML, HTML, XHTML, or SVG in an object-oriented way from within your Java application or webapp. It also contains other IO utilities and bioinformatics classes for phylogenetic trees, taxonomy, and multi-chain proteins. Requires JDK1.8.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 17
    Avogadro

    Avogadro

    An intuitive molecular editor and visualization tool

    Avogadro is an advanced molecular editor designed for cross-platform use in computational chemistry, molecular modeling, bioinformatics, materials science and related areas. It offers a flexible rendering framework and a powerful plugin architecture.
    Leader badge
    Downloads: 1,141 This Week
    Last Update:
    See Project
  • 18
    CIERA

    CIERA

    Crop Information Engine and Research Assistant

    CIERA is a data management system for multi-crop breeding programs. It handles the genealogy, phenotypic and genotypic data of most modern breeding programs. It is distributed under the GNU General Public License, Version 3. ** This site is under construction, as of June 2024. ***
    Downloads: 0 This Week
    Last Update:
    See Project
  • 19
    Bandicoot

    Bandicoot

    fast C++ library for GPU linear algebra & scientific computing

    * Fast GPU linear algebra library (matrix maths) for the C++ language, aiming towards a good balance between speed and ease of use * Provides high-level syntax and functionality deliberately similar to Matlab * Provides an API that is aiming to be compatible with Armadillo for easy transition between CPU and GPU linear algebra code * Useful for algorithm development directly in C++, or quick conversion of research code into production environments * Distributed under the permissive Apache 2.0 license, useful for both open-source and proprietary (closed-source) software * Can be used for machine learning, pattern recognition, computer vision, signal processing, bioinformatics, statistics, finance, etc * Downloads: http://coot.sourceforge.io/download.html * Documentation: http://coot.sourceforge.io/docs.html * Bug reports: http://coot.sourceforge.io/faq.html * Git repo: https://gitlab.com/conradsnicta/bandicoot-code
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20
    QUAST

    QUAST

    Quality Assessment Tool for Genome Assemblies

    ...The tool also produces web-based reports condensing all information in one easy-to-navigate file. QUAST and its three follow-up papers (MetaQUAST, Icarus, QUAST-LG) papers were published in Bioinformatics; the last paper (WebQUAST) is out in Nucl Acid Research.
    Leader badge
    Downloads: 38 This Week
    Last Update:
    See Project
  • 21
    PANDA

    PANDA

    A comprehensive and flexible quantification tool for proteomics data

    PANDA is a comprehensive and flexib tool for quantitative proteomics data analysis, which is developed based on our solid foundations in quantitative proteomics for years. Several novelties have been implemented in it. First, we implement the advantage algorithms of LFQuant (Proteomics 2012, 12, (23-24), 3475-84) and SILVER (Bioinformatics 2014, 30, (4), 586-7) into PANDA. Second, we consider the state-of-art concept of quantification reliability in this quantitative workflow. On the levels of spectra, peptides and proteins, PANDA works out a few quantitative filters and new scores for quantification confidence. Third, PANDA is designed for processing proteomics big data in parallel.
    Downloads: 7 This Week
    Last Update:
    See Project
  • 22
    Gemi

    Gemi

    PCR primers / probes design from multiple & degenerate sequences

    Please, do not hesitate to contact us if you need help. https://sites.google.com/view/hsa23/ Gemi, an automated, fast, and easy-to-use bioinformatics tool with a user-friendly interface to design primers and probes for polymerase chain reaction (PCR). Gemi accepts multiple aligned and long DNA and RNA sequences with degenerate nucleotide (non-A/C/G/T bases). Gemi can be used for quantitative, real-time and conventional PCR (qPCR, rt-PCR, etc.), and Sanger sequencing. Gemi can parse large dataset of sequences efficiently. ...
    Downloads: 7 This Week
    Last Update:
    See Project
  • 23

    DataPrep

    Python-based data preprocessing tool

    DataPrep v0.2 is a Tkinter-based GUI application/tool designed to assist users in data preprocessing, multicollinearity removal, and feature selection for a wide range of applications in Cheminformatics, Bioinformatics, Data Analysis, Feature Selection, Molecular Modeling, Machine Learning, and Quantitative-structure-property relationship (QSPR) studies. It includes functionality to load, process, and save datasets with support for different preprocessing & multicollinearity removal strategies with customizable parameter setting options.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 24
    UniversalMer

    UniversalMer

    A CLI k-mer counting tool for multiple sizes of k at once.

    ...The program counts and summarizes the exact frequency of all k-mers from 1-mer to a user-defined maximum length (kmax). Analyzing the k-mer spectrum across multiple values of k can be done in seconds. This program is designed for bioinformatics researchers and scientists.
    Downloads: 0 This Week
    Last Update:
    See Project
  • 25

    AngClust

    AngClust: Angle-based feature clustering for time series

    Citation: Aimin Li, Siqi Xiong, Junhuai Li, Saurav Mallik, Yajun Liu, Rong Fei, Hongfang Zhou, Guangming Liu. AngClust: Angle Feature-Based Clustering for Short Time Series Gene Expression Profiles. January 2022. IEEE/ACM transactions on computational biology and bioinformatics / IEEE, ACM. DOI: 10.1109/TCBB.2022.3192306 Full text: https://ieeexplore.ieee.org/document/9833353/ https://pubmed.ncbi.nlm.nih.gov/35853049/ Highlights * We proposed a novel clustering algorithm based on angular features for short-term gene expression profiles. * We defined three indicators to identify significant clusters: (i) the fluctuation degree of expression levels, (ii) homogeneity, and (iii) the degree of clustering while the clusters are functionally significant...
    Downloads: 0 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB