lakeFS - Git-like capabilities for your object storage
Pythonic tool for running machine-learning/high performance workflows
Connect processes into powerful data pipelines
Design, automate, operate and publish data pipelines at scale
osDQ dedicated to create apache spark based data pipeline using JSON