Repository of desktops and programs for the Cygwin platform
A large repository of open-source programs built for Cygwin, including X11 desktops, language interpreters, multimedia frameworks, cross-compiler toolchains, and much more. Also hosted here is the cygport tool for building Cygwin packages. cygport releases and the Ports Git repositories are hosted here; Ports packages are available from the website.
Unicode-XML-TEI text/corpus analysis platform
TXM is a free and open-source cross-platform Unicode & XML based text/corpus analysis environment and graphical client, supporting Windows, Linux and Mac OS X. It can also be used online as a J2EE standard compliant web portal (GWT based) with access control built in. DOWNLOAD LATEST VERSION OF TXM : http://textometrie.ens-lyon.fr/spip.php?rubrique61&lang=en TXM offers a comprehensive range of analysis tools (concordances, collocate search, frequency lists, etc.) based on the powerfull CQP full text search engine (http://cwb.sourceforge.net) and a range of statistical functions (factorial analysis, classification, cooccurrency analysis, etc.) based on R packages (http://www.r-project.org). Read the scientific background at the Textométrie project web site http://textometrie.ens-lyon.fr/?lang=en. Read a full description at the TEI Tools wiki http://wiki.tei-c.org/index.php/TXM.
Scheduling lockages at ship locks with several parallel lock chambers
This Java software includes algorithms of combinatorical optimization for the NP-hard offline ship lock scheduling problem. Solutions and performed computations can be displayed graphically. Besides, there is a framework for generating test instances and running these in parallel, as well as R/JGR code for statistical evaluation. Some tools for estimating the quality of calculated solutions will be further improved. Initially the software was developed within a project of TU Berlin regarding the Kiel Canal. See project wiki for conditions that should be met by applications.
R packages for PK/PD modeling, BE/BA, drug stability, ivivc, etc.
These R packages are developed for data analysis of PK/PD modeling, bioequivalence/bioavailability (BE/BA), drug stability, in-vitro and in-vivo correlation (ivivc), as well as therapeutic drug monitoring (TDM). They include bear, ivivc, PKfit, stab and tdm.
IAP - the Integrated Analysis Pipeline
The Integrated Analysis Platform (IAP) has been designed and developed to support the analysis of large-scale image data sets of different camera systems. It aims in bridging different data domains and in integrating different approaches to data analysis and post-processing.
Experiment Design for Differential Abundance Analysis
EDDA is a tool for systematic assessment of the impact of experimental design and the statistical test used on the ability to detect differential abundance. EDDA can aid in the design of a range of common experiments such as RNA-seq, ChIP-seq, Nanostring assays, RIP-seq and Metagenomic sequencing, and enables researchers to comprehensively investigate the impact of experimental decisions on the ability to detect differential abundance. More details of EDDA can be found at Luo, Huaien et al. “The Importance of Study Design for Detecting Differentially Abundant Features in High-Throughput Experiments.” Genome Biology 2014;15(12):527 (http://www.ncbi.nlm.nih.gov/pubmed/25517037/). An accompanying web server (http://edda.gis.a-star.edu.sg/) is available for easy access to some functionality of EDDA. Additionally a Bioconductor package (http://www.bioconductor.org/packages/release/bioc/html/EDDA.html) is available for easy installation of EDDA R package.
GEPETTO (GEne Prioritization ExTended TOol)
GEPETTO (GEne PrioriTization ExTended TOol) is an original open-source framework, distributed under the LGPL license, for gene selection and prioritization on a desktop computer that ensures confidentiality of personal data. It takes advantage of the data integration capabilities in the SM2PH-Central Framework(KD4v,MSV3d,BIRD,..), combined with in-house developed gene prioritization methods. It currently incorporates six prioritization modules, based on gene sequence, protein-protein interactions, gene expression, disease-causing probabilities, genomic context). GEPETTO is written in Java/Python and supported by an advanced modular architecture, which means that it can easily be modified and extended by the user, in order to include alternative scoring methods and new data sources. We intend to extend the system from gene-level to variant-level prioritization, by exploiting the variant data in the MSV3D database. Contact: email@example.com or firstname.lastname@example.org
Tools to analyse and use passport data for biological collections.
A machine learning system for supervised document classification
An open source system for supervised document classification based on statistical machine learning techniques. On the contrary of the state of art classification techniques, MyNook just requires the title of the document, not the content itself.
An R package to normalise, classify and analyses raw AFLP data.
vipR is a program to screen for sequence variants (SNPs, deletions) in sequence data generated by high-throughput-sequencing platforms. Information on this and other projects can be found on: http://www.altmann.eu
Hoea is a python module for hierarchical ontology enrichment analysis, which facilitated GO (Gene Ontology)/KO (KEGG Orthology) enrichment analysis at any desktop.
Open Metaheuristic (oMetah) is a library aimed at the conception and the rigourous testing of metaheuristics (i.e. genetic algorithms, simulated annealing, ...). The code design is separated in components : algorithms, problems and a test report generator
The goal of this project is to develop a Python tool to make an in-depth quantitative analysis about Wikipedia, generating graphics and statistical results for each language version of Wikipedia.
Suite of community detection algorithms based on Modularity
- MixtureModel_v1r1: overlapping community algorithm , which includes novel partition density and fuzzy modularity metrics. - OpenMP versions of algorithms in  are available to download. - Main suite containing three community detection algorithms based on the Modularity measure containing: Geodesic and Random Walk edge Betweenness  and Spectral Modularity . Collaborator: Theologos Kotsos.  M. Newman & M. Girvan, Physical Review, E 69 (026113), 2004.  M. Newman, Physical Review E, 74(3):036104, 2006.  B. Ball et al, An efficient and principled method for detecting communities in networks, 2011. The suite is based upon the fast community algorithm implemented by Aaron Clauset <email@example.com>, Chris Moore, Mark Newman, and the R IGraph library Copyright (C) 2007 Gabor Csardi <firstname.lastname@example.org>. It also makes of the classes available from Numerical Recipies 3rd Edition W. Press, S. Teukolsky, W. Vetterling, B. Flanne
The 'runjags' R package and standalone JAGS extension module
This package provides high-level interface utilities for MCMC models via Just Another Gibbs Sampler (JAGS), facilitating the use of parallel (or distributed) processors for multiple chains, automated control of convergence and sample length diagnostics, and evaluation of the performance of a model using drop-k validation or against simulated data. Template model specifications can be generated using a standard lme4-style formula interface to assist users less familiar with the BUGS syntax. A JAGS extension module provides additional distributions including the Pareto family of distributions, the DuMouchel prior and the half-Cauchy prior.
DuffyRNAseq is an R package that implements an analysis pipeline for processing RNA-seq data from Illumina NextGen sequencers, to measure gene transcription and differential expression.
PAN And Core-gEnome Analysis
A tool to calculate the Pan-Genome of a set of annotated genomes
Classification Algorithm Based on a Bayesian method for Genomics
Xpose - an S/R based population PK/PD model building aid for NONMEM.
A plug-in for the gedit software bundled in the Gnome desktop. It creates a terminal titled "gedit Connected Terminal" at start-up. The plug-in associates copying to that terminal in gedit with the F5 key. This works well with the R interpreter.
An implementation of the Kernel-based Orthogonal Projections to Latent Structures (K-OPLS) method for MATLAB and R. The supplied functionality includes e.g. cross-validation, kernel parameter optimization, model diagnostics and plot tools.
In this very first part of the project OpenSpaceSyntax is a collection of functions in shall scripts oriented to the calculation of classic measures of a space syntax analisys procedure for urban fabrics and based on GIS-GRASS (grass.itc.it).
ALEXA-Seq is a method for using massively parallel paired-end transcriptome sequencing for 'alternative expression analysis'.
Software for visualizing and interpreting NMR data, with an emphasis on metabolomics. Go to the rNMR homepage at http://rnmr.nmrfam.wisc.edu for more details.