A cross-platform statistical package for econometric analysis
gretl is a cross-platform software package for econometric analysis, written in the C programming language.
SOFA is a statistics, analysis, and reporting program with an emphasis on ease of use, learn as you go, and beautiful output.
Machine Learning Python
mlpy is a Python module for Machine Learning built on top of NumPy/SciPy and of GSL. mlpy provides high-level functions and classes allowing, with few lines of code, the design of rich workflows for classification, regression, clustering and feature selection. mlpy is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License version 3. mlpy is available both for Python >=2.6 and Python 3.X.
Unicode-XML-TEI text/corpus analysis platform
TXM is a free and open-source cross-platform Unicode & XML based text/corpus analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in. DOWNLOAD LATEST VERSION OF TXM : http://textometrie.ens-lyon.fr/spip.php?rubrique61&lang=en TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP full text search engine (http://cwb.sourceforge.net) and a range of statistical functions (factorial analysis, classification, cooccurrency analysis, etc.) based on R packages (http://www.r-project.org). Read the scientific background at the Textométrie project web site http://textometrie.ens-lyon.fr/?lang=en. Read a full description at the TEI Tools wiki http://wiki.tei-c.org/index.php/TXM.
This material enables IBM SPSS Statistics users to run code written in the R language inside Statistics. Additional free items for R in Statistics and other materials are available from the SPSS Community at www.ibm.com/developerworks/spssdevcentral
Model-based Analysis of Genome-wide CRISPR-Cas9 Knockout
Model-based Analysis of Genome-wide CRISPR-Cas9 Knockout (MAGeCK) is a computational tool to identify important genes from the recent genome-scale CRISPR-Cas9 knockout screens technology. For instructions and documentations, please refer to the wiki page. MAGeCK is developed and maintained by Wei Li and Han Xu from Dr. Xiaole Shirley Liu's lab at Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute and Harvard School of Public Health. We thank the support from Claudia Adams Barr Program in Innovative Basic Cancer Research to develop MAGeCK.
Statcato is a Java software application for elementary statistics. Its features include data and graph generation, probability distributions, descriptive statistics, confidence intervals, hypothesis tests, correlation, regression, and analysis of var
R packages for PK/PD modeling, BE/BA, drug stability, ivivc, etc.
These R packages are developed for data analysis of PK/PD modeling, bioequivalence/bioavailability (BE/BA), drug stability, in-vitro and in-vivo correlation (ivivc), as well as therapeutic drug monitoring (TDM). They include bear, ivivc, PKfit, stab and tdm.
A simple programmable spreadsheet for learning statistics.
Myrtle is a simple programmable spreadsheet and statistical analysis software specifically designed for learning statistics. It provides the standard spreadsheet functionality one would expect like multiple tabbed sheets, relative and absolute row and column referencing in formulas, and a large catalog of built-in functions. Functions specific to logic and computer science, mathematics, probability, and statistics are available. Student's can easily create, customize, and update plots and graphical summaries of their analyses. Myrtle offers a unique bookmarking facility which allows students to create and reuse named references to their favorite cell ranges. This can help students focus attention on the important relationships among particular rows or columns of data. Myrtle's graphics and reporting features allow students to report back to their instructors their mastery of course content.
A C++ library for principal component analysis
libpca is a C++ library for principal component analysis and related transformations. It comes with example and unit tests. libpca is successfully tested on Linux and MacOSX using g++ (>=4.6), clang++ (>=3.2), and icc (>=14.0). libpca requires Armadillo (>=3.2.4) which can be obtained as a pre-compiled package on most distributions or directly from http://arma.sourceforge.net. libpca is being developed by Christian Blume. Contact Christian at firstname.lastname@example.org for any questions or comments.
a Small (Matlab/Octave) Toolbox for Kriging
The STK is a (not so) Small Toolbox for Kriging. Its primary focus in on the interpolation / regression technique known as kriging, which is very closely related to Splines and Radial Basis Functions, and can be interpreted as a non-parametric Bayesian method using a Gaussian Process (GP) prior. The STK also provides tools for the sequential and non-sequential design of experiments. Even though it is, currently, mostly geared towards the Design and Analysis of Computer Experiments (DACE), the STK can be useful for other applications areas (such as Geostatistics, Machine Learning, Non-parametric Regression, etc.).
Software tool for Research in Computational Population Genetics
Development of exact and approximate methods (Importance Sampling and MCMC based) for computing likelihoods under the standard population genetic models of mutation,migration & recombination. Project issues are mainatined at https://freecode4susant.atlassian.net/browse/COALESCENT
Tail probability calculator for continuous random variable
A suite of Matlab functions that calculate the tail probability / cdf / pdf / quantile of linear combination of random variables in one of the following classes: (1) symmetric random variables with support on the real axis (normal, Student's t, uniform and triangular); (2) random variables with support on the positive real axis (chi-squared and log-Lambert W x chi-squared distributions; inverse gamma distribution is temporarily disabled due to numerical issues).
MinimPy is a desktop application program for sequential allocation of subjects to treatment groups in clinical trials by using the method of minimisation. Comprehensive reference help is available at: http://minimpy.sourceforge.net For those who have difficulty installing MinimPy, an online version is available at: http://qminim.sourceforge.net MinimPy has been full described in the foolowing article: Saghaei, M. and Saghaei, S. (2011) Implementation of an open-source customizable minimization program for allocation of patients to parallel groups in clinical trials. Journal of Biomedical Science and Engineering, 4, 734-739. doi: 10.4236/jbise.2011.411090. Available at: http://www.scirp.org/journal/PaperInformation.aspx?PaperID=8518
computing f2 bootstrap CI BCA
Computing similarity factor (f2) bootstrap bias corrected and accelerated confidence interval
KinetDS is a software for curve fitting particularly designed for kinetic (mechanistic and empirical) description of a substance dissolution from solid state. It was primarily designed for handling pharmaceutical dissolution tests
FDA's f2 computation with bootstrap technique
This program was developed as a help in establishing pharmaceutical equivalence by use of FDA f2 coefficient. It was designed to help with f2 computation in cases when intra- and inter-batch variability is large, namely RSD>10%. The use of statistical bootstrap technique allows to implement confidence interval (CI) into the f2 coefficients resulting in overcoming of their major drawback in the original metrics. The algorithm provides possible “worst case scenario” of f2 values, thus supporting claim about pharmaceutical equivalence. The target users are researchers from industry and academia dealing with pharmaceutical equivalence problem. The software is Open Source. It was developed in Lazarus environment, therefore source code is available in ObjectPascal.
Free Matlab toolbox to compute robust correlations
The Robust Correlation Toolbox is a free collection of Matlab functions allowing to visualize data in univariate and bivariate space, check assumptions of normality and homoscedasticity and compute Pearson's and Spearman's, percentage bend, and skipped correlations with bootstrapped confidence intervals - see http://www.frontiersin.org/Quantitative_Psychology_and_Measurement/10.3389/fpsyg.2012.00606/full
Math.NET aims to provide a self contained clean framework for symbolic mathematical (Computer Algebra System) and numerical/scientific computations, including a parser and support for linear algebra, complex differential analysis, system solving and more
Maximal Information-based Nonparametric Exploration
The minepy homepage has moved to http://minepy.readthedocs.io. The download page is now at https://github.com/minepy/minepy/releases.
Uranie is CEA's uncertainty analysis platform, based on ROOT
Uranie is a sensitivity and uncertainty analysis plateform based on the ROOT framework (http://root.cern.ch) . It is developed at CEA, the French Atomic Energy Commission (http://www.cea.fr). It provides various tools for: - data analysis - sampling - statistical modeling - optimisation - sensitivity analysis - uncertainty analysis - running code on high performance computers - etc. Thanks to ROOT, it is easily scriptable in CINT (c++ like syntax) and Python. Is is available both for Unix and Windows platforms (a dedicated platform archive is available on request). Note : if you have downloaded version 3.12 before the 8th of february, a patch exists for a minor bug on TOutputFileKey file, don't hesitate to ask us.
A Python package for estimating the statistical impact of features
This package let's you compute the statistical impact of features given a scikit-learn estimator. The computation is based on the mean variation of the difference between quantile and original predictions. The impact is reliable for regressors and binary classifiers. Currently, all features must consist only of pure-numerical, non-categorical values.
Java library of statistical distribution
A Java package that provides routines for various statistical distributions. Based on R version 2.14.1 (continuously updated; current as of R v3.3.0). The major difference is that JDistlib is thread safe. The library contains the density (pdf), cumulative (cdf), quantile, and random number generator (RNG) routines of the following distributions: Ansari, Beta, Binomial, Cauchy, Chi square, Exponential, Fisher's F, Gamma, Geometric, Hypergeometric, Kendall, Logistic, Log normal, Negative binomial, Noncentral beta, Noncentral chi square, Noncentral f, Noncentral t, Normal, Poisson, Sign Rank, Spearman, Student's T, Tukey, Uniform, Weibull, Wilcoxon, and many more. Normality tests, such as: Kolmogorov-Smirnov, Anderson-Darling, Cramer-Von Mises, D'Agostino-Pearson, Jarque Bera, Kolmogorov-Lilliefors, Shapiro-Francia, Shapiro-Wilk. And many others.
Suite of community detection algorithms based on Modularity
- MixtureModel_v1r1: overlapping community algorithm , which includes novel partition density and fuzzy modularity metrics. - OpenMP versions of algorithms in  are available to download. - Main suite containing three community detection algorithms based on the Modularity measure containing: Geodesic and Random Walk edge Betweenness  and Spectral Modularity . Collaborator: Theologos Kotsos.  M. Newman & M. Girvan, Physical Review, E 69 (026113), 2004.  M. Newman, Physical Review E, 74(3):036104, 2006.  B. Ball et al, An efficient and principled method for detecting communities in networks, 2011. The suite is based upon the fast community algorithm implemented by Aaron Clauset <email@example.com>, Chris Moore, Mark Newman, and the R IGraph library Copyright (C) 2007 Gabor Csardi <firstname.lastname@example.org>. It also makes of the classes available from Numerical Recipies 3rd Edition W. Press, S. Teukolsky, W. Vetterling, B. Flanne