A file format for exchanging computational models in systems biology
The Systems Biology Markup Language (SBML) is an XML-based description language for representing computational models in systems biology. Visit the project web site to learn more.
The Sashimi project hosts the Trans-Proteomic Pipeline (TPP), a mature suite of tools for mass-spec (MS, MS/MS) based proteomics: statistical validation, quantitation, visualization, and converters from raw MS data to the open mzML/mzXML formats.
An open source framework for LC-MS based proteomics and metabolomics. OpenMS offers data structures and algorithms for the processing of mass spectrometry data. The library is written in C++.
CBFLIB is a library of ANSI-C functions providing a simple mechanism for accessing Crystallographic Binary Files (CBF files) and Image-supporting CIF (imgCIF) files. The CBFLIB API is loosely based on the CIFPARSE API for mmCIF files.
A Fast, OpenSource Program for Docking Ligands to Proteins and N.Acids
rDock is a fast and versatile Open Source docking program that can be used to dock small molecules against proteins and nucleic acids. It is designed for High Throughput Virtual Screening (HTVS) campaigns and Binding Mode prediction studies. rDock is mainly written in C++ and accessory scripts and programs are written in C++, perl or python languages. The full rDock software package requires less than 50 MB of hard disk space and it is compilable in all Linux computers. Thanks to its design...
The ProteoWizard Library is a set of software libraries and tools for rapid development of proteomics data analysis software. The libraries are cross-platform and built from the ground up using modern C++ techniques.
Suite of community detection algorithms based on Modularity
... Review E, 74(3):036104, 2006.  B. Ball et al, An efficient and principled method for detecting communities in networks, 2011. The suite is based upon the fast community algorithm implemented by Aaron Clauset <email@example.com>, Chris Moore, Mark Newman, and the R IGraph library Copyright (C) 2007 Gabor Csardi <firstname.lastname@example.org>. It also makes of the classes available from Numerical Recipies 3rd Edition W. Press, S. Teukolsky, W. Vetterling, B. Flanne
Antimony is a human-readable, human-writable model definition language
Antimony is a human-readable, human-writable model definition language. libAntimony is a library that will read and write Antimony and SBML files and provides an API for other programs to import Antimony models into their own internal formats.
RoadRunner represents a state-of-the-art simulation library for SBML models. It is fast, portable and accurate. This projects hosts the library and tools using the library.
Several RNA and protein structure volume calculation tools.
Phaistos is a framework for all-atom Monte Carlo simulations of proteins. It incorporates several advanced probabilistic models of protein structure for conformational sampling, efficient move-algorithms and the OPLS and PROFASI forcefields.
Generation of molecular formulas by high-resolution MS and MS/MS data
This is an open source implementation of molecular formula calculation by high resolution MS and MS/MS data as described in M. Meringer, S. Reinker, J. Zhang, A. Muller: MS/MS Data Improves Automated Determination of Molecular Formulas by Mass Spectrometry. MATCH Commun. Math. Comput. Chem. 65, 259-290, 2011. The software user manual is available here: https://www.researchgate.net/publication/307964728_MOLGEN-MSMS_Software_User_Manual Example MS and MS/MS data files for GenForm can be...
C++ header only library with AI and bioinformatics algorithms
C++ header only library, small and fast; Naive Bayesian Classifier, Decision Tree Classifier (ID3), DNA/RNA nucleotide second structure predictor, timeseries management, timeseries prediction, generic Evolutionary Algorithm, generic Hill Climbing algorithm and others.
Parallel pairwise correlation computation on Intel Xeon Phi clusters
The first parallel and distributed library for pairwise correlation/dependence computation on Intel Xeon Phi clusters. This library is written in C++ template classes and achieves high speed by exploring the SIMD-instruction-level and thread-level parallelism within Xeon Phis as well as accelerator-level parallelism among multiple Xeon Phis. To facilitate balanced workload distribution, we have proposed a general framework for symmetric all-pairs computation by building provable bijective...
Saguaro Genome-Wide is a program to detect signatures of selection within populations, strains, or species. It takes SNPs or nucleotides as input, and creates statistical local phylogenies for each region in the genome.
polyglot language framework to analyze genetic data
This software solves the Phylogeny problem using Median Networks. Exact and heuristic methods available. Input: an alignment and a reversible stepmatrix, output: (sub)optimal trees and their combinations. Steiner problem solver included.
Analyses and Interactive visualizations of the structures, functions and actions of biomolecules including molecular surface computation, and protein-ligand interface and protein-ligand docking where the ligand could a small compound, a nucleic acid, membrane and other proteins, written in C++/Qt/openGL/GLSL with more than 125,000 lines of codes. In addition to Qt, the only external library needed is GSL everything else is coded from the scratch.
Multi-technology, INDEL aware error correction for NGS data
MuffinEC is an error correction software capable of handling all types of errors (insertion deletions, mismatches and unknown bases). It officially supports four technologies (Illumina, 454, ion Torrent and PacBio - experimental) and it also has a generic setup for others (old and/or new). It is released under LGPL version 3.0. MuffinEC can use multicore systems, thanks to its OpenMP implementation. We are developing the 2nd version of MuffinEC. The beta version is already available online....
Extremely fast NGS read counter
... bam_file(s) > output Requires bamtools API library at run time, and c++0x for compile. git clone https://github.com/pezmaster31/bamtools cd bamtools mkdir build cd build cmake .. make export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:path to/lib g++ -I bamtools/include/ -L bamtools/lib/ -o fast_count_multi fast_count_multi.cpp -lz -lbamtools -fpermissive -pthread -std=c++0x
Multiscale Neuroscience and Systems Biology Simulator
Moose is the core of a modern software platform for the simulation of neural systems ranging from subcellular components and biochemical reactions to complex models of single neurons, large networks, and systems-level processes. We have moved Github.com. This should be your source for the latest version of the code.
Application to estimate haplotypes and other genetics data, based on maximum likehood approach. It can handle present-absent as well as multiallelic loci.
Standard Bioinformatics Library
set of classes used to handle input/outputs of bioinformatics data files and basic operations as multiple alignment and phylogenetic reconstruction.
The General Hidden Markov Model Library (GHMM) is a C library with additional Python bindings implementing a wide range of types of Hidden Markov Models and algorithms: discrete, continous emissions, basic training, HMM clustering, HMM mixtures.