A cross-platform statistical package for econometric analysis
gretl is a cross-platform software package for econometric analysis, written in the C programming language.
Unicode-XML-TEI text/corpus analysis platform
TXM is a free and open-source cross-platform Unicode & XML based text/corpus analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in. DOWNLOAD LATEST VERSION OF TXM : http://perso.ens-lyon.fr/serge.heiden/txm/files/software/TXM/0.7.7 TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP full text search engine (http://cwb.sourceforge.net) and a range of statistical functions (factorial analysis, classification, cooccurrency analysis, etc.) based on R packages (http://www.r-project.org). Read the scientific background at the Textométrie project web site http://textometrie.ens-lyon.fr/?lang=en. Read a full description at the TEI Tools wiki http://wiki.tei-c.org/index.php/TXM.
Machine Learning Python
mlpy is a Python module for Machine Learning built on top of NumPy/SciPy and of GSL. mlpy provides high-level functions and classes allowing, with few lines of code, the design of rich workflows for classification, regression, clustering and feature selection. mlpy is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License version 3. mlpy is available both for Python >=2.6 and Python 3.X.
Model-based Analysis of Genome-wide CRISPR-Cas9 Knockout
Model-based Analysis of Genome-wide CRISPR-Cas9 Knockout (MAGeCK) is a computational tool to identify important genes from the recent genome-scale CRISPR-Cas9 knockout screens technology. For instructions and documentations, please refer to the wiki page. MAGeCK is developed and maintained by Wei Li and Han Xu from Dr. Xiaole Shirley Liu's lab at Department of Biostatistics and Computational Biology, Dana-Farber Cancer Institute and Harvard School of Public Health. We thank the support from Claudia Adams Barr Program in Innovative Basic Cancer Research to develop MAGeCK.
a Small (Matlab/Octave) Toolbox for Kriging
The STK is a (not so) Small Toolbox for Kriging. Its primary focus in on the interpolation / regression technique known as kriging, which is very closely related to Splines and Radial Basis Functions, and can be interpreted as a non-parametric Bayesian method using a Gaussian Process (GP) prior. The STK also provides tools for the sequential and non-sequential design of experiments. Even though it is, currently, mostly geared towards the Design and Analysis of Computer Experiments (DACE), the STK can be useful for other applications areas (such as Geostatistics, Machine Learning, Non-parametric Regression, etc.).
Uranie is CEA's uncertainty analysis platform, based on ROOT
Uranie is a sensitivity and uncertainty analysis plateform based on the ROOT framework (http://root.cern.ch) . It is developed at CEA, the French Atomic Energy Commission (http://www.cea.fr). It provides various tools for: - data analysis - sampling - statistical modeling - optimisation - sensitivity analysis - uncertainty analysis - running code on high performance computers - etc. Thanks to ROOT, it is easily scriptable in CINT (c++ like syntax) and Python. Is is available both for Unix and Windows platforms (a dedicated platform archive is available on request). Note : if you have downloaded version 3.12 before the 8th of february, a patch exists for a minor bug on TOutputFileKey file, don't hesitate to ask us.
Maximal Information-based Nonparametric Exploration
The minepy homepage has moved to http://minepy.readthedocs.io. The download page is now at https://github.com/minepy/minepy/releases.
Kaplan-Meier for Windows
KMWin (Kaplan-Meier for Windows) is a convenient tool for graphical presentation of results from Kaplan-Meier survival time analysis. The programme is based on the statistical software environment R and provides an easy to use graphical interface. As an introduction, see http://dx.plos.org/10.1371/journal.pone.0038960#s2.
Statistics modules in Perl Data Language, with a quick-start guide for non-PDL people. They make the PDL shell work like R, but with PDL threading (fast automatic iteration) of procedures including t-test, linear regression, and k-means clustering.
Soccer tournament predictions & simulations
Unique football (soccer) World Cup, European Cup and league (e.g. Bundesliga) simulation - based on Elo coefficient method, it calculates probabilities for Championship, Final, Quarter-Final, etc - additional features: single-play prediction, Elo number calculator, odd quota estimation Impress your friends with soccer expertise - just download, install & run!
Library for optimization using a genetic algorithm or particle swarms
libfgen is a library that implements an efficient and customizable genetic algorithm (GA). It also provides particle swarm optimization (PSO) functionality and an interface for real-valued function minimization or model fitting. It is written in C, but can also be compiled with a C++ compiler. Both Linux and Windows are supported.
A collection of compact, powerful programs and functions, written in C, for the analysis and transformation of data. No object or structure dependencies, no fancy interfaces - just good tools.
Linear Programming (LP) has been used in the case of the Federalist Papers whose authorship had been disputed. The task here will be to use the Cell Broadband Engine to do likewise with papers whose authorship is disputed.
Calculates how much time and money you need to win in the lottery. It uses multiple gthreads and gtk2 to sort 1 to 50 numbers until the user wins. Asking for the cost and time beetwin games, it calculates if you or your grandchildren will be rich.
That project aims at studying and comparing typical airline IT methods, for instance RM-related algorithms. It works from a Unix/Linux/Mac command-line, and exposes basic APIs. It is being developed in C++, with Python wrappers for some components.
A population-based method for DNA copy number analysis: recurrent copy number aberration indentification in multiple samples (with no need of single-sample calling). Developed for a quick analysis of high resolution and large population data.
Stand-alone software tool for the interactive CE analysis of microarray data. The software is a user-friendly and allows on-the-fly study of CE
A collection of small utilities with a connection to cryptography and cryptanalysis. Written for efficiency and a small memory footprint.
A command line toolki to solve a problem your favorite program defines
Diagonal can be used for: - getting descriptive statistics such as mean/median/mode with your program producing a sample - finding a root of an equation your program defines - calculating a fixed point of a function your program defines - detecting a cycle of a fuction your program defines as well as - decoding a VCDIFF file
Facinas: Probabilistic Graphical Models is an extensive set of librairies, algorithms and tools for Probabilistic Inference and Learning and Reasoning under uncertainty. It implements all sort of Probabilistic Graphical Models using discrete and continuous distributions.
FastPval is multiple stage p-value computing software that computes empirical p-values from a large set of permutated/resampled background data.
GpaNom is a simple command line GPA calculator, written in C, with the goal of being fast and precise.
A versatile MCMC and downhill optimization engine
Hrothgar is a parallel minimizer and Markov Chain Monte Carlo generator by Andisheh Mahdavi of San Francisco State University. It has been used to solve optimization problems in astrophysics (galaxy cluster mass profiles) as well as in experimental particle physics (hadronic tau decays). It is probably adaptable enough to be applied to your merit function if you can write it in C.
Unix/Linux math calculator
An easy, small and handy math calculator for Unix/Linux systems. It can calculate easy and complex mathematical expressions passed as command line arguments.
Software for speech research. It includes programs and libraries for signal processing, along with general purpose scientific libraries. Most of the code is in Python, with C/C++ supporting code. Also, contains code releases corresponding to publishe