Pentaho offers comprehensive data integration and analytics platform.
ETL engine based on Groovy
Design, automate, operate and publish data pipelines at scale
osDQ dedicated to create apache spark based data pipeline using JSON
This is a Pentaho Data Integration plugin for CiviCRM.
Open source Extract Transform Load engine written in Java