Pentaho offers comprehensive data integration and analytics platform.
ETL engine based on Groovy
Design, automate, operate and publish data pipelines at scale
osDQ dedicated to create apache spark based data pipeline using JSON
Java utility that reads the metadata from table(s)
A utility that uses Informatica Operations API
Open source Extract Transform Load engine written in Java