jAgg - Java Aggregation Operations
jAgg is a Java 5.0 API that supports “group by” operations on Lists of Java objects: aggregate operations such as count, sum, max, min, avg, and many more. It supports Super Aggregation: Rollups, Cube, and Grouping Sets. It supports analytic operations such as lag/lead and row number and more. It also allows custom aggregate and analytic operations.
Statistical data analysis
Programa multiplataforma de libre distribución para el análisis estadístico y epidemiológico de datos. Free distribution cross-platform program for statistical and epidemiological analysis of data. Sitio web: http://www.sergas.es/Saude-publica/EPIDAT Souceforge: https://sourceforge.net/projects/epidat/ Wikipedia: https://es.wikipedia.org/wiki/Epidat
Implementation in Python of some of the statistical methods provided by "asurv", the survival analysis software.
Utility classes from maps to search engine to random samplers
Collection of several multi-purpose Java libraries. --- knowceans-tools = collection of Java utility classes. --- Highlights: --- org.knowceans.util: IndexQuickSort, TableList: apply order of one array/list to others +++ Vectors, ArrayUtils: array convenience +++ RandomSamplers, CokusRandom, ArmSampler, Densities: random sampling and distributions +++ Arguments: command line parser +++ StopWatch, Which, ExternalProcess: runtime stuff +++ ParallelFor: OpenMP workalike +++ PatternString, NamedGroupRegex: regex convenience --- org.knowceans.corpus: CorpusSearcher: full-text search engine +++ LabelNumCorpus: svmlight corpus storage and filtering +++ NIPS corpus with text, authors, labels and citations --- org.knowceans.map: InvertibleHashMultiMap, BijectiveHashMap: implement n:m and 1:1 relations. --- Other libs: knowceans-arms = port of the Adaptive Rejection Metropolis Sampler (ARMS) for arbitrary distributions +++ lda-j = port of lda-c, implementing Latent Dirichlet Allocation (LDA)
Realtime bigdata tool for bit strings up to 2^63 based on AVL forest
Realtime bigdata tool at the bit level based on immutable AVL forest which can be run in memory or, in future versions, as a merkle forest like a blockchain. Main object is a sparse bit string (Bits) that efficiently scales up to 2^63 bits normally compressed as forest has duplicated substrings. Bits objects support reading bit, byte, short, int, or long (Java primitives) at any bit index in 64 bit range. Example: instead of building a class to hold a header and then data, represent all of that as Bits, subranges of them, and ints for sizes of its parts. Expansion ability for other kinds of compression, since Bits is a Java interface. Main functions on bits are substring, concat, number of 0 or 1 bits, and number of bits (size). All those operations can be done millions of times per second regardless of size because the AVL forest reuses existing branches recursively. Theres a scalar (originally for copy/pasting subranges of sounds) and a bit Java package. Sparse n dimensional matrix.
A Multivariate Distributions Package for C++
A library based on Eigen and Boost to handle common multivariate distributions in C++
Differential Expression Analysis for Pathways
This project contains the source code associated with the PLoS Computational Biology publication: "Differential Expression Analysis for Pathways". The paper text can be found here: http://www.ploscompbiol.org/article/info%3Adoi%2F10.1371%2Fjournal.pcbi.1002967
neural network implementation in java
3-layer neural network for regression and classification with sigmoid activation function and command line interface similar to LibSVM. Quick Start: "java -jar nen.jar"
A statistics extension for Open Office with a feel of SPSS. Primarily meant for students of social sciences as well as other in a need of easy-to-use statistics software.
Graphical tool for data manipulation written in C++/QT.
Volbx is a graphical tool for data manipulation written in C++/QT. User can load data, filter, select, visualize, export and more using few clicks. Volbx reads, along with inner application format, xlsx and ods spreadsheet files. User can save output stream as csv or xlsx and plots as png files. Dynamic and interactive plots can be zoomed, moved, reset to initial state, quickly exported to clipboard and more. Application calculate dynamically such handy values as quantiles, standard deviation, average and more for given sample. This tool was originally created for real estate appraisal to give possibility, for people working in that area, to estimate past, current and future value of analysed properties. Project/start-up was dropped and decision about release of less specialized application was made. Most advanced and dedicated for real estates functionalities were cut but still some are quite useful. Used QT 5.3.2, QWT 6.1.1, QuaZip 0.7, Boost 1.55.0, MinGW 4.8.2
R package for hierarchical species distribution models
hSDM is an R package for hierarchical species distribution models. Such models allows interpreting the observations (occurrence and abundance of a species) as a result of several hierarchical processes including ecological processes (habitat suitability, spatial dependence and anthropogenic disturbance) and observation processes (species detectability). Hierarchical species distribution models are essential for accurately characterizing the environmental response of species, predicting their probability of occurrence, and assessing uncertainty in the model results.
A Matlab toolbox for interfacing with the pure JAVA numerical library Snifflib. This toolbox provides convenience m-files for interoperability with Snifflib from within an active Matlab session running a JAVA virtual machine.