Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
Python scripts for ETL (extract, transform and load) jobs for Ethereum
Search replace files or pipe
Sync data between persistence engines, like ETL only not stodgy
Design, automate, operate and publish data pipelines at scale
osDQ dedicated to create apache spark based data pipeline using JSON
This is a Pentaho Data Integration plugin for CiviCRM.