Pentaho offers comprehensive data integration and analytics platform.
Design, automate, operate and publish data pipelines at scale
osDQ dedicated to create apache spark based data pipeline using JSON
This is a Pentaho Data Integration plugin for CiviCRM.
Open source Extract Transform Load engine written in Java