Big Data Stream Analytics Framework.
Distributed scheduled job framework
World's first open source data quality & data preparation project
osDQ dedicated to create apache spark based data pipeline using JSON
MapReduce-based tool to remove duplicate DNA reads
sparse and dense matrix, linear algebra, visualization, big data
Log-linear analysis (data modelling) for high-dimensional data
Open Source Reporting & Data Visualization Platform
Workflow Designer, Hive Editor, Pig Editor, File System Browser
giServer the easy to use and extensible batch and integration server