Distributed scheduled job framework
Big Data Stream Analytics Framework.
A portable SCADA/IoT platform centered on the MongoDB database server.
Integrated Comprehensive Data Architecture & Methodology
TensorBase is a new big data warehousing with modern efforts
World's first open source data quality & data preparation project
SZT‑bigdata is an open source project
osDQ dedicated to create apache spark based data pipeline using JSON
MapReduce-based tool to remove duplicate DNA reads
Data Mining and Machine Learning Algorithms based on MapReduce
sparse and dense matrix, linear algebra, visualization, big data
Log-linear analysis (data modelling) for high-dimensional data
Workflow Designer, Hive Editor, Pig Editor, File System Browser
Open Source Reporting & Data Visualization Platform
giServer the easy to use and extensible batch and integration server
PMML-compliant scoring engine and analytic toolkit