Deequ is a library built on top of Apache Spark
Apache OpenWhisk is an open source serverless cloud platform
Simple and distributed Machine Learning
A unified analytics engine for large-scale data processing
A distributed, fault-tolerant graph database
High performance distributed in-memory key/value store