Distributed Big Data Orchestration Service
Get random, realtime read/write access to your Big Data
The open big data serving engine
A graph database that supports more than 100+ billion data
Apache InLong - a one-stop integration framework for massive data
Upserts, Deletes And Incremental Processing on Big Data
Distributed scheduled job framework
First open-source data discovery and observability platform
Distributed messaging and streaming platform with low latency
Apache Polaris, the interoperable, open source catalog
Big Data Stream Analytics Framework.
Unified metadata lake for data & AI assets.
Parquet format file GUI editor
Pentaho offers comprehensive data integration and analytics platform.
Precision Trigonometry: Advanced Calculator for Complex Math
World's first open source data quality & data preparation project
Active, high-performance open source database middleware
The Esri Geometry API for Java enables developers to write apps
osDQ dedicated to create apache spark based data pipeline using JSON
MapReduce-based tool to remove duplicate DNA reads
Performance and Productivity at Scale
Hadoop spliced read aligner for RNA-seq data
A lightweight report creation Java library
DSTK - DataScience ToolKit for All of Us
sparse and dense matrix, linear algebra, visualization, big data