IRAMUTEQ : Interface de R pour les Analyses Multidimensionnelles de Textes et de Questionnaires. Logiciel de traitement de données pour des corpus texte ou de type individus/caractères. Permet notamment de réaliser des analyses de type "ALCESTE"
SimpleITK: a simplified layer build on top of the Insight Toolkit (ITK), intended to facilitate its use in rapid prototyping, education and interpreted languages.
Repository of desktops and programs for the Cygwin platform
A large repository of open-source programs built for Cygwin, including X11 desktops, language interpreters, multimedia frameworks, cross-compiler toolchains, and much more. Also hosted here is the cygport tool for building Cygwin packages. cygport releases and the Ports Git repositories are hosted here; Ports packages are available from the website.
The application Bio7 is an integrated development environment for ecological modelling and contains powerful tools for model creation, scientific image analysis and statistical analysis. The application itself is based on an RCP-Eclipse-Environment (Rich-Client-Platform) which offers a huge flexibility in configuration and extensibility because of its plug-in structure and the possibility of customization.
Java-based scientific graphics
Java-based scientific graphics with support for Java, Groovy, MATLAB, Python, the R statistical environment, Scala and SciLab.
Copy Number Analysis for Targeted Resequencing (CONTRA) is a tool for copy number variation (CNV) detection for targeted resequencing data such as those from whole-exome capture data.
A multi-modeling and simulation environment to study complex systems
VLE is a multi-modeling and simulation environment to study complex dynamic systems. VLE is based on the discrete event specification DEVS. and it implements the DSDE formalism (A merge of Dynamic Structure DEVS, DSDEVS, with Parallel DEVS, PDEVS). VLE provides a complete set of C++ libraries, called VFL (VLE Foundation Libraries), to develop DEVS models, to gets results of simulations, to launch simulation on cluster. The models can be developed with the DEVS formalism or with the classical mathematical formalism: Ordinary Differential Equation with Euler, Range-Kutta or QSS integrator, Finite state automaton (FDDEVS, UML State chart, Hybrid Petri net). The VLE environment provides an IDE to develop C++ models, DEVS coupled models. VLE have also three ports to use the VFL with Python, Java and R programming languages.
Aberration detection in tumour exome
Aberration Detection in Tumour Exome (ADTEx) is a tool for copy number variation (CNV) detection for whole-exome data from paired tumour/matched normal samples.
A graphical user interface to R for use in Clinical Chemistry
This project is a graphical user interface to the R statistical programming language designed for use in Clinical Chemistry. It allows the user to perform Passing Bablok, Deming and Linear Regression and to produce high quality images in any file format for publication. The front end is written in Python 3.3 and PyQt4 and the form was designed using Qt4 Designer. The statistical analysis is written in R. The compiled binary was made with cx_freeze. This software is free and open-source. It is released under the GNU Public license and comes with absolutely no warranty.
Library for creating and manipulating graphs
This is a library for creating and manipulating graphs with focus on speedy operations for large, sparse graphs.
Bayesian estimation of diversification rates
BayesRate is a program to estimate speciation and extinction rates from dated phylogenies in a Bayesian framework. The methods are described in: Silvestro, D., Schnitzler, J. and Zizka, G. (2011) A Bayesian framework to estimate diversification rates and their variation through time and space. BMC Evolutionary Biology, 11, 311 Silvestro D., Zizka G. & Schulte K. (2014) Disentangling the effects of key innovations on the diversification of Bromelioideae (Bromeliaceae). Evolution, 68, 163-175.
Metaboflux is a generic approach for predicting flux distribution in metabolic networks under multiple and various constraints deducted from the experiments, to increase the biological relevance of the model.
TCP Experiment Automation Controlled Using Python
TEACUP automates many aspects of running TCP performance experiments in a specially-constructed physical testbed. TEACUP enables repeatable testing of different TCP algorithms over a range of emulated network path conditions, bottleneck rate limits and bottleneck queuing disciplines. TEACUP utilises a text-based configuration file to define experiments as combinations of parameters specifying desired network path and end host conditions. When multiple values are provided (e.g. for TCP congestion control algorithm), an experiment is made up of multiple tests. For each experiment and test, TEACUP collects a range of data, such as tcpdump files of traffic seen or TCP stack information (e.g. using Web10G). TEACUP also collects a variety of metadata from the end hosts and bottleneck router, such as the actual OS/kernel version(s) used. TEACUP also provides some simple tools for analysing the results of experiments, such as plotting a flow's experienced RTT over time.
A Python library bringing together utilities I've written over the years for work in bioinformatics and biostatistics, which should be generally applicable outside these fields as well. Focuses on string processing, DBI, and math.
Bayesian Estimation of Speciation and Extinction from Fossil Data
PyRate is a Python program to estimate speciation, extinction, and preservation rates from fossil occurrence data using a Bayesian framework. The method was described by D Silvestro, J Schnitzler, LH Liow, A Antonelli, and N Salamin in Systematic Biology (http://sysbio.oxfordjournals.org/content/early/2014/02/08/sysbio.syu006.abstract). Please download the most up-to-date code from the "PyRate code" tab on this page or from: https://github.com/dsilvestro/PyRate An updated manual can be found here: https://github.com/dsilvestro/PyRate/wiki
Syzygy is a toolkit for the analysis of targeted (pooled/individual) resequencing datasets.
Non-Coding RNA PROfiling from sRNA-seq
ncPRO-seq is a tool for annotation and profiling of ncRNAs from smallRNA sequencing data. It aims to interrogate and perform detailed analysis on small RNAs derived from annotated non-coding regions in miRBase, piRBase, Rfam and repeatMasker, and regions defined by users. The ncPRO pipeline also has a module to identify regions significantly enriched with short reads that can not be classified as known ncRNA families. ############# Docker version : download and run Dockerfile (go in "Files" section) ############# GitHub : https://github.com/jbrayet/ncpro-seq
A collection of bioinformatics-related software created by members of Cancer Research UK (possibly among others).
ConDEnSE (Confidential Data Enabled Statistical Exploration) will be a web-based environment for statistical analysis of confidential data from various database sources, based on Plone and R, and using the Jackknife method of confidentiality protection.
Differential Expression Analysis for Pathways
This project contains the source code associated with the PLoS Computational Biology publication: "Differential Expression Analysis for Pathways". The paper text can be found here: http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1002967
Django-dataplot enables developers using the Django web framework to seamlessly integrate data-driven graphical plots into their web pages.
Data analysis and circular statistics with OpenElectrophy and R
Set of tools for basic analysis of electrophysiological data. The Python classes show how to call OpenElectrophy functions and save data. The R library applies circular statistics to spike phase data and saves the best von Mises fit and the Rayleigh statistics on the disk. The wavelet coherence analysis is done in R by the package "sowas". Check the module R_coherence to see how we solved that problem. This packages may be useful for people who start using OpenElectrophy and circular statistics in R. If you find errors, please report them. The project is still in development, so in the future you'll get updates.
Weather and Climate Risk Management
GEPETTO (GEne Prioritization ExTended TOol)
GEPETTO (GEne PrioriTization ExTended TOol) is an original open-source framework, distributed under the LGPL license, for gene selection and prioritization on a desktop computer that ensures confidentiality of personal data. It takes advantage of the data integration capabilities in the SM2PH-Central Framework(KD4v,MSV3d,BIRD,..), combined with in-house developed gene prioritization methods. It currently incorporates six prioritization modules, based on gene sequence, protein-protein interactions, gene expression, disease-causing probabilities, genomic context). GEPETTO is written in Java/Python and supported by an advanced modular architecture, which means that it can easily be modified and extended by the user, in order to include alternative scoring methods and new data sources. We intend to extend the system from gene-level to variant-level prioritization, by exploiting the variant data in the MSV3D database. Contact: email@example.com or firstname.lastname@example.org
Ready-to-use appliance for GIS, Data Mining and Analysis, Modelling...
GISBox by Citiviz is a ready-to-use virtual appliance containing an assortment of powerful, free and open source GIS, Data Mining and Analysis, Modelling and Programming software and packages. The pre-configured and tested GISBox allows you to try and use in few minutes a variety of software used by data scientists, engineers, physicists, mathematicians, geomatics engineers, and geographers from around the globe, without installing anything more than once the Oracle VirtualBox™ software (https://www.virtualbox.org/) on your Linux, Windows or Mac computer. The GISBox is free and open, allowing it to be freely distributed, duplicated and passed around, even with your own datasets and in-house applications. If you read this and are related to the Ecole polytechnique de Lausanne - EPFL (http://www.epfl.ch) in Switzerland, please use this link for downloading or support information: http://gisbox.epfl.ch. Happy (geo)Data Hacking! Brought to you by Citiviz' Builders