Distributed Big Data Orchestration Service
Get random, realtime read/write access to your Big Data
The open big data serving engine
First open-source data discovery and observability platform
A graph database that supports more than 100+ billion data
Apache InLong - a one-stop integration framework for massive data
Upserts, Deletes And Incremental Processing on Big Data
Distributed scheduled job framework
Distributed messaging and streaming platform with low latency
Apache Polaris, the interoperable, open source catalog
Apache Iceberg
An end-to-end, realtime and cloud native Lakehouse framework
TestNG testing framework
Flexible tool to build planet-scale vector tilesets
A Flexible and Powerful Parameter Server for large-scale ML
Big Data Stream Analytics Framework.
Unified metadata lake for data & AI assets.
Parquet format file GUI editor
Pentaho offers comprehensive data integration and analytics platform.
Precision Trigonometry: Advanced Calculator for Complex Math
MiRDeep*
The next generation of cloud-native big data management expert
Alink is the Machine Learning algorithm platform based on Flink
A collection of practical tips can be found at the bottom of this page
World's first open source data quality & data preparation project