Python Bio-Informatics Software

View 66 business solutions

Browse free open source Python Bio-Informatics Software and projects below. Use the toggles on the left to filter open source Python Bio-Informatics Software by OS, license, language, programming language, and project status.

  • Go From AI Idea to AI App Fast Icon
    Go From AI Idea to AI App Fast

    One platform to build, fine-tune, and deploy ML models. No MLOps team required.

    Access Gemini 3 and 200+ models. Build chatbots, agents, or custom models with built-in monitoring and scaling.
    Try Free
  • MongoDB Atlas runs apps anywhere Icon
    MongoDB Atlas runs apps anywhere

    Deploy in 115+ regions with the modern database for every enterprise.

    MongoDB Atlas gives you the freedom to build and run modern applications anywhere—across AWS, Azure, and Google Cloud. With global availability in over 115 regions, Atlas lets you deploy close to your users, meet compliance needs, and scale with confidence across any geography.
    Start Free
  • 1
    PyRx - Virtual Screening Tool

    PyRx - Virtual Screening Tool

    Virtual Screening software for Computational Drug Discovery

    PyRx is a Virtual Screening software for Computational Drug Discovery that can be used to screen libraries of compounds against potential drug targets. PyRx enables Medicinal Chemists to run Virtual Screening from any platform and helps users in every step of this process - from data preparation to job submission and analysis of the results. While it is true that there is no magic button in the drug discovery process, PyRx includes docking wizard with easy-to-use user interface which makes it a valuable tool for Computer-Aided Drug Design. PyRx also includes chemical spreadsheet-like functionality and powerful visualization engine that are essential for Rational Drug Design. Please visits PyRx home page to learn more about PyRx and watch videos on how to use it.
    Leader badge
    Downloads: 2,370 This Week
    Last Update:
    See Project
  • 2
    MzDOCK - Multiple Ligand Docking Tool

    MzDOCK - Multiple Ligand Docking Tool

    MzDOCK is A Virtual Screening Tool For Drug Discovery Research

    - Molecular Docking Virtual Screening Tool To Aid In Drug Discovery Research. - Published in Wiley, Journal of Computational Chemistry . Link: https://onlinelibrary.wiley.com/doi/abs/10.1002/jcc.27390 - Developed with Synchronized functioning of Python and Batch scripts -Integerated With Pymol-open-source for visualizing interaction (PSE file) generated from MzDOCK - Integrated with Molecule Drawing Tool - JSME Editor - If you are facing any issues or for queries . Contact us - kabeermuzammil614@gmail.com MzDOCK Development - Available on WindowsOS and Linux ubuntu 22.04 and more - Software Authorship - Muzammil Kabier
    Downloads: 84 This Week
    Last Update:
    See Project
  • 3
    Free DELTA

    Free DELTA

    Software tools for processing taxonomic descriptions in DELTA format

    The Free DELTA Project is a free, open-source, software alternative for a generic system to the processing of taxonomic descriptions based on the DELTA (DEscription Language for TAxonomy) format.
    Leader badge
    Downloads: 351 This Week
    Last Update:
    See Project
  • 4
    BioXTAS RAW

    BioXTAS RAW

    Processing and analysis of Small Angle X-ray Scattering (SAXS) data.

    BioXTAS RAW is a program for analysis of Small-Angle X-ray Scattering (SAXS) data. The software enables: creation of 1D scattering profiles from 2D detector images, standard data operations such as averaging and subtraction, analysis of radius of gyration (Rg) and molecular weight, and advanced analysis using GNOM and DAMMIF as well as electron density reconstructions using DENSS. It also allows easy processing of inline SEC-SAXS data and data deconvolution using the evolving factor analysis (EFA) or the regularized alternating least squares (REGALS) methods. Active source code is now maintained on github: https://github.com/jbhopkins/bioxtasraw To install: Check the instructions available at: http://bioxtas-raw.readthedocs.io/en/latest/install.html and in the Files tab. User guides: RAW guides are available at: http://bioxtas-raw.readthedocs.io/ and in the Files tab. To contact us, see: https://bioxtas-raw.readthedocs.io/en/latest/help.html
    Leader badge
    Downloads: 99 This Week
    Last Update:
    See Project
  • Train ML Models With SQL You Already Know Icon
    Train ML Models With SQL You Already Know

    BigQuery automates data prep, analysis, and predictions with built-in AI assistance.

    Build and deploy ML models using familiar SQL. Automate data prep with built-in Gemini. Query 1 TB and store 10 GB free monthly.
    Try Free
  • 5

    MAGeCK

    Model-based Analysis of Genome-wide CRISPR-Cas9 Knockout

    Model-based Analysis of Genome-wide CRISPR-Cas9 Knockout (MAGeCK) is a computational tool to identify important genes from the recent genome-scale CRISPR-Cas9 knockout screens technology. For instructions and documentations, please refer to the wiki page. MAGeCK is developed by Wei Li and Han Xu from Dr. Xiaole Shirley Liu's lab at Dana-Farber Cancer Institute/Harvard School of Public Health, and is maintained by Wei Li lab at Children's National Medical Center. We thank the support from Claudia Adams Barr Program in Innovative Basic Cancer Research and NIH/NHGRI to develop MAGeCK.
    Leader badge
    Downloads: 92 This Week
    Last Update:
    See Project
  • 6
    PowerTalk automatically speaks Microsoft PowerPoint presentations. For presenters who find speaking difficult, audiences containing people with visual impairments and fun educational uses. Uses synthesised computer speech provided with Windows
    Downloads: 24 This Week
    Last Update:
    See Project
  • 7
    An open source framework for LC-MS based proteomics and metabolomics. OpenMS offers data structures and algorithms for the processing of mass spectrometry data. The library is written in C++. Our source code and wiki lives on GitHub (https://github.com/OpenMS/OpenMS).
    Downloads: 26 This Week
    Last Update:
    See Project
  • 8
    CBFlib
    CBFLIB is a library of ANSI-C functions providing a simple mechanism for accessing Crystallographic Binary Files (CBF files) and Image-supporting CIF (imgCIF) files. The CBFLIB API is loosely based on the CIFPARSE API for mmCIF files.
    Leader badge
    Downloads: 56 This Week
    Last Update:
    See Project
  • 9
    Open Drug Discovery Toolkit (ODDT)

    Open Drug Discovery Toolkit (ODDT)

    Modular and comprehensive toolkit for use in cheminformatics

    Open Drug Discovery Toolkit (ODDT) is modular and comprehensive toolkit for use in cheminformatics, molecular modeling etc. ODDT is written in Python, and makes extensive use of Numpy/Scipy. You can use any supported toolkit united under common API (for reference see Pybel or Cinfony). All methods and software based on Pybel/Cinfony should be drop-in compatible with ODDT toolkits. In contrast to its predecessors, which were aimed to have minimalistic API, ODDT introduces extended methods and additional handles. These extensions allow to use of toolkits at all its grace and some features may be backported from others to introduce missing functionalities. Most important and handy property of Molecule in ODDT are Numpy dictionaries containing most properties of supplied molecule. Some of them are straightforward, other require some calculation, ie. atom features. Dictionaries are provided for major entities of molecules.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Full-stack observability with actually useful AI | Grafana Cloud Icon
    Full-stack observability with actually useful AI | Grafana Cloud

    Our generous forever free tier includes the full platform, including the AI Assistant, for 3 users with 10k metrics, 50GB logs, and 50GB traces.

    Built on open standards like Prometheus and OpenTelemetry, Grafana Cloud includes Kubernetes Monitoring, Application Observability, Incident Response, plus the AI-powered Grafana Assistant. Get started with our generous free tier today.
    Create free account
  • 10
    Protenix

    Protenix

    A trainable PyTorch reproduction of AlphaFold 3

    Protenix is an open-source, trainable PyTorch reimplementation of AlphaFold 3, developed by ByteDance with the goal of democratizing high-accuracy protein structure prediction for computational biology and drug-discovery research. Protenix provides a complete pipeline for turning protein sequences (with optional MSA / sequence alignment) or structural inputs (e.g. PDB/CIF) into full 3D atomic-level structure predictions. It supports both “full” models and lightweight variants such as “Protenix-Mini,” offering a trade-off between speed/compute cost and predictive accuracy — making structure prediction accessible even in resource-constrained environments. The project also includes support for constraints (e.g., specifying residue- or atom-level contact constraints, or pocket constraints) to guide predictions toward biologically or experimentally relevant conformations, which enhances its utility for tasks like modeling complexes, ligands, or antibody–antigen interactions.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 11
    The Sashimi project hosts the Trans-Proteomic Pipeline (TPP), a mature suite of tools for mass-spec (MS, MS/MS) based proteomics: statistical validation, quantitation, visualization, and converters from raw MS data to the open mzML/mzXML formats.
    Leader badge
    Downloads: 36 This Week
    Last Update:
    See Project
  • 12

    CPAT

    RNA coding potential assessment tool

    Using RNA-seq, tens of thousands of novel transcripts and isoforms have been identified (Djebali, et al Nature, 2012 , Carbili et al, Gene & Development, 2011) The discovery of these hidden transcriptome rejuvenate the need of distinguishing coding and noncoding RNA. However, Most previous coding potential prediction methods heavily rely on alignment, either pairwise alignment to search for protein evidence or multiple alignments to calculate phylogenetic conservation score (such as CPC , PhyloCSF and RNACode ). This is because most previously identified transcripts including protein coding RNA and short, housekeeping/regulatory RNAs such as snRNAs, snoRNA and tRNA are highly conserved. While still very useful, these approaches have several limitations: Most lncRNAs are less conserved and tend to be lineage specific which greatly limit the discrimination power of alignment-based methods. For example, of 550 lncRNAs detected from zebrafish, only 29 of them had detectable sequence
    Leader badge
    Downloads: 34 This Week
    Last Update:
    See Project
  • 13
    Pysces

    Pysces

    PySCeS is the Python Simulator of Cellular Systems

    PySCeS is the Python Simulator of Cellular Systems. For a network of coupled reactions it does a stoichiometric matrix analysis, calculates the time course and steady state, and does a complete control analysis.
    Leader badge
    Downloads: 8 This Week
    Last Update:
    See Project
  • 14
    BioNeMo

    BioNeMo

    BioNeMo Framework: For building and adapting AI models

    BioNeMo is an AI-powered framework developed by NVIDIA for protein and molecular generation using deep learning models. It provides researchers and developers with tools to design, analyze, and optimize biological molecules, aiding in drug discovery and synthetic biology applications.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 15
    Evo 2

    Evo 2

    Genome modeling and design across all domains of life

    Evo 2 is a DNA language model system designed for long-context genome modeling and biological sequence design across all domains of life. The project models DNA at single-nucleotide resolution and supports context windows of up to one million base pairs, which places it in a class of models built for very large genomic reasoning tasks. According to the repository, it uses the StripedHyena 2 architecture, was pretrained with Savanna, and was trained autoregressively on the OpenGenome2 dataset containing 8.8 trillion tokens. The codebase is focused on local inference and generation through the Vortex inference stack rather than serving as a full training framework alone, although it also points users to training and fine-tuning resources. It supports multiple ways of working with the model, including forward passes, embeddings, generation workflows, notebooks, hosted APIs, and self-hosted deployment through NVIDIA NIM.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 16

    miRge3

    Comprehensive analysis of small RNA sequencing data

    An update to Python package to perform comprehensive analysis of small RNA sequencing data, including miRNA annotation, A-to-I editing, novel miRNA detection, isomiR analysis, visualization through IGV, processing Unique Molecular Identifieres (UMI), tRF detection and producing interactive graphical output. miRge3.0 is developed in python v3.8 and is a recent update of our previous version miRge2.0. This build includes command line interface (CLI) and cross-platform Graphical User Interface (GUI). For more details refer to documentation link below.
    Downloads: 26 This Week
    Last Update:
    See Project
  • 17

    FusionCatcher

    Somatic fusion-genes finder for RNA-seq data

    FusionCatcher searches for novel/known somatic fusion genes, translocations, and chimeras in RNA-seq data (paired-end reads from Illumina NGS platforms like Solexa and HiSeq) from diseased samples. The aims of FusionCatcher are: - very good detection rate for finding candidate fusion genes, - very easy to use (i.e. no a priori knowledge of databases and bioinformatics is needed in order to run FusionCatcher), - very good detection of challenging fusion genes, like for example IGH fusions, CIC fusions, DUX4 fusions, CRLF2 fusions, TCF3 fusions, etc. - to be as automatic as possible (i.e. the FusionCatcher will choose automatically the best parameters in order to find candidate fusion genes, e.g. finding automatically the adapters, building the exon-exon junctions automatically based on the length of the input reads, etc.) while providing the best possible detection rate for finding fusion genes.
    Leader badge
    Downloads: 25 This Week
    Last Update:
    See Project
  • 18
    QUAST

    QUAST

    Quality Assessment Tool for Genome Assemblies

    QUAST performs fast and convenient quality evaluation and comparison of genome assemblies. It is maintained by the Gurevich lab at HIPS (https://helmholtz-hips.de/en/hmsb). For the most up-to-date description, please visit http://quast.sf.net. Below are just some highlights. QUAST computes several well-known metrics, including contig accuracy, the number of genes discovered, N50, and others, as well as introducing new ones, like NA50 (see details in the paper and manual). A comprehensive analysis results in summary tables (in plain text, tab-separated, and LaTeX formats) and colorful plots. The tool also produces web-based reports condensing all information in one easy-to-navigate file. QUAST and its three follow-up papers (MetaQUAST, Icarus, QUAST-LG) papers were published in Bioinformatics; the last paper (WebQUAST) is out in Nucl Acid Research.
    Leader badge
    Downloads: 24 This Week
    Last Update:
    See Project
  • 19
    STochastic Engine for Pathway Simulation
    Stochastic simulator for signaling pathways occuring in biological cells. Handles reaction-diffusion processes in 3D tetrahedral meshes. Please check our new repository: https://github.com/CNS-OIST/STEPS
    Downloads: 4 This Week
    Last Update:
    See Project
  • 20

    BioC

    We describe a simple XML format to share text documents and annotation

    A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We are interested in reuse, and we focus on common NLP tasks that are broadly useful for textmining.
    Leader badge
    Downloads: 16 This Week
    Last Update:
    See Project
  • 21
    CBMPy

    CBMPy

    PySCeS Constraint Based Modelling

    PySCeS CBMPy is a new platform for constraint based modelling and analysis. It has been designed using principles developed in the PySCeS simulation software project: usability, flexibility and accessibility. CBMPy supports the latest standards for encoding CBM models encoding, SBML L3 FBC, COBRA as well as MIRIAM compliant RDF and custom annotations. Its architecture is both extensible and flexible using data structures that are intuitive to the biologist while transparently translating these into the underlying mathematical structures. . CBMPy implements popular analyses such as FBA, FVA, element/charge balancing, network analysis and model editing as well as advanced methods developed for the ecosystem modelling. CBMPy supports user interaction via: - interactive console or as a library for advanced use - GUI, visual representation of the model, analysis methods - a SOAP based webAPI exposes high level functionality via web services
    Downloads: 16 This Week
    Last Update:
    See Project
  • 22
    raxmlGUI
    RELEASE NOTE: Get raxmlGUI 2.0 at the NEW PROJECT LOCATION: https://antonellilab.github.io/raxmlGUI/ raxmlGUI is a graphical user interface to RAxML, one of the most popular and widely used software for phylogenetic inference using maximum likelihood. A userfriendly graphical front-end for phylogenetic analyses using RAxML (Stamatakis, 2006). Please cite: Silvestro, Michalak (2012) - raxmlGUI: a graphical front-end for RAxML. Organisms Diversity and Evolution 12, 335-337. DOI: 10.1007/s13127-011-0056-0
    Downloads: 16 This Week
    Last Update:
    See Project
  • 23
    This is a sophisticated & integrated simulation and analysis environment for dynamical systems models of physical systems (ODEs, DAEs, maps, and hybrid systems). It supports symbolic math, optimization, continuation, data analysis, biological apps...
    Downloads: 4 This Week
    Last Update:
    See Project
  • 24
    A univariate and multivariate analysis UI. This project is no longer under development. Please use as you wish.
    Leader badge
    Downloads: 15 This Week
    Last Update:
    See Project
  • 25

    MToolBox

    A bioinformatics pipeline to analyze mtDNA from NGS data

    MToolBox is a highly automated bioinformatics pipeline to reconstruct and analyze human mitochondrial DNA from high throughput sequencing data. MToolBox includes an updated computational strategy to assemble mitochondrial genomes from Whole Exome and/or Genome Sequencing (PMID: 22669646) and an improved fragment-classify tool (PMID:22139932) for haplogroup assignment, functional and prioritization analysis of mitochondrial variants. MToolBox provides pathogenicity scores, profiles of genome variability and disease-associations for mitochondrial variants. MToolBox provides also a Variant Call Format file (version 4.0) featuring, for the first time, allele-specific heteroplasmy. Please, check out the most recent updates of the source code with the Github repository of MToolBox: https://github.com/mitoNGS/MToolBox or visit the Web version of MToolBox @ MSeqDR: https://mseqdr.org/mtoolbox.php
    Leader badge
    Downloads: 12 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next
MongoDB Logo MongoDB