The open big data serving engine
Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Metadata/data identification Java library
High-level, high-performance dynamic language for technical computing
Web-based SQL editor run in your own private cloud
RStudio is an integrated development environment (IDE) for R
A distributed and extensible workflow scheduler platform
Docker image used to run data processing workloads
Build concurrent, distributed, and resilient message-driven apps
Distributed Big Data Orchestration Service
Open Source Data Orchestration for the Cloud
High-performance, open source distributed transaction solution
Qualitis is a one-stop data quality management platform
An open source recommender system service written in Go
SeaTunnel is a distributed, high-performance data integration platform
Scalable and Flexible Gradient Boosting
Upserts, Deletes And Incremental Processing on Big Data
HStreamDB is an open-source, cloud-native streaming database
A graph database that supports more than 100+ billion data
First open-source data discovery and observability platform
Ridgepole is a tool to manage DB schema. It defines DB schema
Apache IoTDB
Yao A low code engine to create web services and dashboard
Interactive Online Platform that Visualizes Algorithms from Code