Unicode-XML-TEI text/corpus analysis platform
TXM is a free and open-source cross-platform Unicode & XML based text/corpus analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in. DOWNLOAD LATEST VERSION OF TXM : http://perso.ens-lyon.fr/serge.heiden/txm/files/software/TXM/0.7.7 TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP full text search engine (http://cwb.sourceforge.net) and a range of statistical functions (factorial analysis, classification, cooccurrency analysis, etc.) based on R packages (http://www.r-project.org). Read the scientific background at the Textométrie project web site http://textometrie.ens-lyon.fr/?lang=en. Read a full description at the TEI Tools wiki http://wiki.tei-c.org/index.php/TXM.
R packages for PK/PD modeling, BE/BA, drug stability, ivivc, etc.
These R packages are developed for data analysis of PK/PD modeling, bioequivalence/bioavailability (BE/BA), drug stability, in-vitro and in-vivo correlation (ivivc), as well as therapeutic drug monitoring (TDM). They include bear, ivivc, PKfit, stab and tdm.
OpenM++: open source microsimulation platform
OpenM++ is an open source microsimulation platform inspired by and compatible with Modgen. OpenM++, compared to its closed source predecessor Modgen, has advantages like portability, scalability and open source. It is not a copy of Modgen, but a new, functionally equivalent implementation of the publicly available language specification.
R package for modelling anthropogenic deforestation
phcfM is an R package for modelling anthropogenic deforestation. It was named after the REDD+ pilot-project 'programme holistique de conservation des forêts à Madagascar'. phcfM includes two main functions: (i) demography(), to model the population growth with time in a hierarchical Bayesian framework using population census data and Gaussian linear mixed models and (ii) deforestation(), to model the deforestation process in a hierarchical Bayesian framework using land-cover change data and Binomial logistic regression models with variable time-intervals between land-cover observations. The two functions use embedded Gibbs samplers written in C++ with the Scythe statistical library to reduce computational time.
Kaplan-Meier for Windows
KMWin (Kaplan-Meier for Windows) is a convenient tool for graphical presentation of results from Kaplan-Meier survival time analysis. The programme is based on the statistical software environment R and provides an easy to use graphical interface. As an introduction, see http://dx.plos.org/10.1371/journal.pone.0038960#s2.
The 'runjags' R package and standalone JAGS extension module
This package provides high-level interface utilities for MCMC models via Just Another Gibbs Sampler (JAGS), facilitating the use of parallel (or distributed) processors for multiple chains, automated control of convergence and sample length diagnostics, and evaluation of the performance of a model using drop-k validation or against simulated data. Template model specifications can be generated using a standard lme4-style formula interface to assist users less familiar with the BUGS syntax. A JAGS extension module provides additional distributions including the Pareto family of distributions, the DuMouchel prior and the half-Cauchy prior.
BigBang/Horizon is a proteomics data analysis pipeline with focus on the shotgun LC/MSMS workflow.
A population-based method for DNA copy number analysis: recurrent copy number aberration indentification in multiple samples (with no need of single-sample calling). Developed for a quick analysis of high resolution and large population data.
Suite of community detection algorithms based on Modularity
- MixtureModel_v1r1: overlapping community algorithm , which includes novel partition density and fuzzy modularity metrics. - OpenMP versions of algorithms in  are available to download. - Main suite containing three community detection algorithms based on the Modularity measure containing: Geodesic and Random Walk edge Betweenness  and Spectral Modularity . Collaborator: Theologos Kotsos.  M. Newman & M. Girvan, Physical Review, E 69 (026113), 2004.  M. Newman, Physical Review E, 74(3):036104, 2006.  B. Ball et al, An efficient and principled method for detecting communities in networks, 2011. The suite is based upon the fast community algorithm implemented by Aaron Clauset <email@example.com>, Chris Moore, Mark Newman, and the R IGraph library Copyright (C) 2007 Gabor Csardi <firstname.lastname@example.org>. It also makes of the classes available from Numerical Recipies 3rd Edition W. Press, S. Teukolsky, W. Vetterling, B. Flanne
An R Package for Environmental Statistics
EnvStats is an R package for environmental statistics. It is the open-source successor to the commercial module for S-Plus© called "EnvironmentalStats for S-Plus", which was first released in April, 1997. The EnvStats package, along with the R software environment, provides comprehensive and powerful software for environmental data analysis. EnvStats brings the major environmental statistical methods found in the literature and regulatory guidance documents into one statistical package, along with an extensive hypertext help system that explains what these methods do, how to use these methods, and where to find them in the environmental statistics literature. Also included are numerous built-in data sets from regulatory guidance documents and the environmental statistics literature. EnvStats combined with other R packages (e.g., for spatial analysis) provides the environmental scientist, statistician, researcher, and technician with tools to “get the job done!”
Monte Carlo permutation method for SNP multiple test correlation
MCPerm: A Monte Carlo permutation method for multiple test correlation in case-control association study Traditional permutation (TradPerm) test is an important non-parametric analysis method which can be treated as the gold standard for multiple testing corrections in case-control association study. However, it relies on the original single nucleotide polymorphism (SNP) genotypes and phenotypes data to perform a large number of random shuffles, and thus it is computationally intensive, especially for genome-wide association study (GWAS). To improve the calculation speed without changing the size of the TradPerm p-value, we developed a Monte Carlo permutation (MCPerm) method as an efficient alternative to TradPerm. Methods: MCPerm does not need to shuffle the original genotypes and phenotypes data. It uses Monte Carlo method, employs two-step hypergeometric distribution to generate the random number of genotypes (AA, Aa and aa) in cases and controls.
R packages supporting parallel computing.
Sistema Estadístico basado en R
Priotelus (Sistema Estadístico basado en R) es un software libre para realizar análisis estadísticos utilizando R. Su interfaz limpia y sus potentes funcionalidades tienen como objetivo facilitarle su trabajo, enfocándose cada vez más en estudios completos y evitar la realización de muchos cálculos parciales para poder completar un estudio. Los análisis estadísticos se realizan utilizando el lenguaje y entorno de programación para análisis estadístico y gráfico, R, con más de 6000 paquetes desarrollados y avalados por su amplia comunidad científica. Funcionalidades: -Carga de datos desde los formatos XLS, CSV y TXT. -Exportación de datos hacia los formatos XLS, CSV, TXT y PDF. -Construcción y edición de tablas de datos. -Salva de un proyecto con el estado actual de nuestro trabajo. -Cifrado de un proyecto utilizando el método AES con una clave de 256 bits. -Graficado de datos, utilizando diferentes tipos de gráficas.
computing f2 bootstrap CI BCA
Computing similarity factor (f2) bootstrap bias corrected and accelerated confidence interval
An R package implementation of a consensus clustering methodology. This package allows users to perform re-sampling statistics based clustering using multiple clustering algorithms to assess the robustness of both clusters and members of clusters.
R package for hierarchical species distribution models
hSDM is an R package for hierarchical species distribution models. Such models allows interpreting the observations (occurrence and abundance of a species) as a result of several hierarchical processes including ecological processes (habitat suitability, spatial dependence and anthropogenic disturbance) and observation processes (species detectability). Hierarchical species distribution models are essential for accurately characterizing the environmental response of species, predicting their probability of occurrence, and assessing uncertainty in the model results.
Tools to analyse and use passport data for biological collections.
R interface to the Corpus Query Protocol
Implements the Corpus Query Protocol as a package for the R statistical environment. It allows to query linguistic corpora and manipulate the data as native R objects. It is based on the CWB software.
The TreeRank project is a R package implementing a Machine Learning algorithm to build tree-based ranking rules from data with binary labels, based on ROC optimization.