1. Create an object-oriented python script that can represent mathematical concepts and their properties. 2. Represent all numeric values exactly. 3. Provide a variety of formats to export or embed representations of the mathematical concepts.
MooGraph is an utility to produce interactive dynamical graphs from statistical data. Deliberately inspired to GapMinder, it represents the evolution of multivariate data in time by means of coloured bubbles of variable sizes on an (x,y) plot.
An R package to normalise, classify and analyses raw AFLP data.
Improved base-calling for homopolymer-sensitive next-gen data.
The current software contains the implementation for the 454 pyrosequencing platform.
Realtime bigdata tool for bit strings up to 2^63 based on AVL forest
Realtime bigdata tool at the bit level based on immutable AVL forest which can be run in memory or, in future versions, as a merkle forest like a blockchain. Main object is a sparse bit string (Bits) that efficiently scales up to 2^63 bits normally compressed as forest has duplicated substrings. Bits objects support reading bit, byte, short, int, or long (Java primitives) at any bit index in 64 bit range. Example: instead of building a class to hold a header and then data, represent all of that as Bits, subranges of them, and ints for sizes of its parts. Expansion ability for other kinds of compression, since Bits is a Java interface. Main functions on bits are substring, concat, number of 0 or 1 bits, and number of bits (size). All those operations can be done millions of times per second regardless of size because the AVL forest reuses existing branches recursively. Theres a scalar (originally for copy/pasting subranges of sounds) and a bit Java package. Sparse n dimensional matrix.
A Matlab software routine to perform Principal Component Analysis using Covariance, Correlation or Comedian as the criterion. Though, initially developed for experiments related to fretting wear but can be effectively used to interpret experimental data from any field. The attached files contain source code as well as a sample MATLAB (.mat) data file of 13 variables. It could be replaced to the data file of your choice. The code is open source but you are requested to give credits if used. Additionally, it also has some useful functions for exporting and generating publication quality figures for different kind of figures in MATLAB
Statastizard(c)2012 Nick Emblow is a program designed to guide its user into the right statistical test, perform the test, and report the results in a meaningful way. The idea of this is that the user won't actually need to know that they are using a mann-whitney U over a t-test, for example - it just tests for normality and homoscedastic of variance, and decides which test to use, then reports the results. Why? Because time after time, people come to me with some test that they've done that they think "proves" something - when in reality they've either not done a comparative test to start with (ex. a correlation), failed to test for confounding factors (such as normality etc) and as a result, failed to appropriately identify even the right test to do.
A statistics extension for Open Office with a feel of SPSS. Primarily meant for students of social sciences as well as other in a need of easy-to-use statistics software.
approximate Bayesian computation for stochastic differential equations
A MATLAB toolbox for approximate Bayesian computation (ABC) in stochastic differential equation models. It performs approximate Bayesian computation for stochastic models having latent dynamics defined by stochastic differential equations (SDEs) and not limited to the "state-space" modelling framework. Both one- and multi-dimensional SDE systems are supported and partially observed systems are easily accommodated. Variance components for the "measurement error" affecting the data/observations can be estimated. A 50-pages Reference Manual is provided with two case-studies implemented and discussed. The methodology is based on the research article available at http://arxiv.org/abs/1204.5459 Author's research page is http://www.maths.lth.se/matstat/staff/umberto/
Creates a data density plot of a 2 dimensional data distribution.
Shows the data density of a 2 dimensional distribution. The problem of showing data density visually is not mathematically well defined, and there are several methods. The program uses sum of reciprocal squared distance to calculate density at each point, with a smear factor to prevent points going to infinity. The smear factor also controls the amount of clustering. There are several options for colour output. Input is via a csv (comma-separated values) file. Now there's a nice GUI built in Baby X for Linux and Windows
The final build of this software now is distributed in R, embedded in "RedeR': an R/Bioconductor package for hierarchical and nested network analysis... more about RedeR: http://bioconductor.org/packages/2.9/bioc/html/RedeR.html
The fantail machine learning toolkit (Moved)
Moved to https://github.com/quansun/fantail-ml
A Java library to model and fit ARTA processes.
Utility classes from maps to search engine to random samplers
Collection of several multi-purpose Java libraries. --- knowceans-tools = collection of Java utility classes. --- Highlights: --- org.knowceans.util: IndexQuickSort, TableList: apply order of one array/list to others +++ Vectors, ArrayUtils: array convenience +++ RandomSamplers, CokusRandom, ArmSampler, Densities: random sampling and distributions +++ Arguments: command line parser +++ StopWatch, Which, ExternalProcess: runtime stuff +++ ParallelFor: OpenMP workalike +++ PatternString, NamedGroupRegex: regex convenience --- org.knowceans.corpus: CorpusSearcher: full-text search engine +++ LabelNumCorpus: svmlight corpus storage and filtering +++ NIPS corpus with text, authors, labels and citations --- org.knowceans.map: InvertibleHashMultiMap, BijectiveHashMap: implement n:m and 1:1 relations. --- Other libs: knowceans-arms = port of the Adaptive Rejection Metropolis Sampler (ARMS) for arbitrary distributions +++ lda-j = port of lda-c, implementing Latent Dirichlet Allocation (LDA)
External plugins for modnlp/teccli
This is a general project for modnlp/teccli plugins, with focus on text visualizaton.
Implementation in Python of some of the statistical methods provided by "asurv", the survival analysis software.
snlanalytic is a small Python script that takes a stem-and-leaf plot as input and returns basic statistics (sum, mean, median, mode) to the user.
Worst Cases is a python script that enables calculations on worst case objects. Like a calculator operating on worst cases (e.g. [1,2,3]) rather than on real numbers (2.35) WARNING: This program is released as is, and has never been extensively teste