Big Data Stream Analytics Framework.
Distributed scheduled job framework
World's first open source data quality & data preparation project
osDQ dedicated to create apache spark based data pipeline using JSON
DSTK - DataScience ToolKit for All of Us
sparse and dense matrix, linear algebra, visualization, big data
giServer the easy to use and extensible batch and integration server
Workflow Designer, Hive Editor, Pig Editor, File System Browser