SciDAVis is a user-friendly data analysis and visualization program primarily aimed at high-quality plotting of scientific data. It strives to combine an intuitive, easy-to-use graphical user interface with powerful features such as Python scriptability.
Technical Analysis Library
Technical analysis library with indicators like ADX, MACD, RSI, Stochastic, TRIX... This is not an end-user GUI trading or charting application. It is instead targeted to application developers using either Excel, .NET, Mono, Java, Perl, Python or C/C++.
IRAMUTEQ : Interface de R pour les Analyses Multidimensionnelles de Textes et de Questionnaires. Logiciel de traitement de données pour des corpus texte ou de type individus/caractères. Permet notamment de réaliser des analyses de type "ALCESTE"
QtiPlot is a user-friendly, platform independent data analysis and visualization application similar to the non-free Windows program Origin.
GDL - GNU Data Language, a free IDL (Interactive Data Language, see http://ittvis.com/idl/) compatible incremental compiler.
This is a Python-based efficient implementation of several semantic similarity measures. The target is to enable fast and easy calculation of similarity between proteins and genes using the Gene Ontology (GO).
Everyday Software for the Researcher of Restricted Means: This project is intended to offer scientists and students with small budgets advanced versions of aqua-built desktop-editing, graphics and statistical applications for MacOSX.
A collection of software made by Milos Rancic.
HeWIT helps you create and fill out forms. It tells you if you've missed anything or made any mistakes. You can then send the form over email, upload to a web site, or pass it by memory stick to whoever needs it next.
Yet Another Audio Feature Extractor is a toolbox for audio analysis. Easy to use and efficient at extracting a large number of audio features simultaneously. WAV and MP3 files supported, or embedding in C++, Python or Matlab applications.
A windrose, also know as polar rose plot, is a special diagram for representing the distribution of meteorological datas, typically wind speeds by class and direction.
The Modular toolkit for Data Processing (MDP) is a Python data processing framework. From the user's perspective, MDP is a collection of supervised and unsupervised learning algorithms and other data processing units that can be combined into data processing sequences and more complex feed-forward network architectures. From the scientific developer's perspective, MDP is a modular framework, which can easily be expanded. The implementation of new algorithms is easy and intuitive. The new implemented units are then automatically integrated with the rest of the library. The base of available algorithms is steadily increasing and includes signal processing methods (Principal Component Analysis, Independent Component Analysis, Slow Feature Analysis), manifold learning methods ([Hessian] Locally Linear Embedding), several classifiers, probabilistic methods (Factor Analysis, RBM), data pre-processing methods, and many others.
Transana is a cross-platform tool for the transcription and qualitative analysis of audio and video data. It also provides the ability to identify and organize analytically interesting portions of videos, as well as attach keywords to those video clips.
We describe a simple XML format to share text documents and annotation
A minimalist approach to share text documents and data annotations. Allows a large number of different annotations to be represented. Project files contain: - simple code to hold/read/write data and perform sample processing. - BioC-formatted corpora - BioC tools that work with BioC corpora BioC goals - simplicity - interoperability - broad use - reuse There should be little investment required to learn to use a format or a software module to process that format. We are interested in reuse, and we focus on common NLP tasks that are broadly useful for textmining.
py-rrdtool is a Python module provides an interface to RRDTool, the well known graphing/logging tool.
A Rule-based Part-of-Speech and Morphological Tagging Toolkit
RDRPOSTagger is a robust, easy-to-use and language-independent rule-based toolkit for Part-of-Speech (POS) and morphological tagging. RDRPOSTagger obtains fast performance in both learning and tagging process. RDRPOSTagger also achieves a very competitive accuracy in comparison to the state-of-the-art results. RDRPOSTagger now supports pre-trained POS and morphological tagging models for Bulgarian, Czech, Dutch, English, French, German, Hindi, Italian, Portuguese, Spanish, Swedish, Thai and Vietnamese. Additionally, RDRPOSTagger supports the pre-trained Universal POS tagging models for 40 languages. See the full usage of RDRPOSTagger at: http://rdrpostagger.sourceforge.net/
Pretty Damn Quick (PDQ) analytically solves queueing network models of computer and manufacturing systems, data networks, etc., written in conventional programming languages. Generic or customized reports of predicted performance measures are output.
Autoplot is an interactive browser for data on the web
Autoplot is an interactive browser for data on the web. Give Autoplot a URL or local file name and it creates a sensible plot of the data. Autoplot allows you to interactively browse data stored in ascii, .cdf, netcdf, and many other formats.
This is a neat little tool to visualize binary data in a graphical form. It's really interesting to look at different types of files.
JBoost is a simple, robust system for classification. JBoost contains implementations of several boosting algorithms in an alternating decision tree framework. In addition, JBoost provides extensible software for adding more learning algorithms.
RISO: distributed, heterogeneous Bayesian belief networks. Belief network: a probability model defined on an acyclic directed graph; distributed: nodes can be on different hosts; and heterogeneous: allowing different types of conditional distributions.
Discrete wavelet methods for time series analysis using python
Several python libraries implement discrete wavelet transforms. However, none of them, or at least none that I know, is aimed at scientific use. This library aims at filling this gap, in particular considering discrete wavelet transform as described by Percival and Walden. This module started as translation of the wmtsa Matlab toolbox (http://www.atmos.washington.edu/~wmtsa/), so most naming conventions and most of the code structure follows their choices. The code uses a mix of python and cython for improved performance. The code reflects my needs and preferences, but contributions from others are welcome. The code has to some extent been tested, but bugs are to be expected.
BRAHMS is a Modular Execution Framework for dynamical systems. It knits together independently-authored software modules implementing dynamical processes into an integrated system, and supervises the deployment and execution of that system.
A ContextTree is a way of dynamically forming relationships between information: the same information can be viewed in different ways, depending on what you want from it.
A lightweight, browsing-based, 100% Python, federated data integration framework. Users may create custom schemas for disparate sources, query and expand results across sources to find related data; for use in fields such as bioinformatics and datamining