Pandas on AWS, easy integration with Athena, Glue, Redshift, etc.
ETL engine based on Groovy
Sync data between persistence engines, like ETL only not stodgy
Design, automate, operate and publish data pipelines at scale
osDQ dedicated to create apache spark based data pipeline using JSON
Java tools for decoding and manipulating BER encoded ASN.1 Files
Open source Extract Transform Load engine written in Java
Extract, transform, and load DBF into MySQL