Parquet format file GUI editor
Big Data Stream Analytics Framework.
Unified metadata lake for data & AI assets.
World's first open source data quality & data preparation project
osDQ dedicated to create apache spark based data pipeline using JSON
Performance and Productivity at Scale
DSTK - DataScience ToolKit for All of Us
sparse and dense matrix, linear algebra, visualization, big data
giServer the easy to use and extensible batch and integration server
Open Source Reporting & Data Visualization Platform
Workflow Designer, Hive Editor, Pig Editor, File System Browser
Big Sack: A lightweight Java Key/Value store with undo and disk cache.