A ranked list of awesome Python open-source libraries
Kestra is an infinitely scalable orchestration and scheduling platform
A distributed and extensible workflow scheduler platform
SeaTunnel is a distributed, high-performance data integration platform
Pentaho offers comprehensive data integration and analytics platform.
Open source annotation and labeling tool for image and video assets
BitSail is a distributed high-performance data integration engine
Design, automate, operate and publish data pipelines at scale
osDQ dedicated to create apache spark based data pipeline using JSON
Mirror of Apache Kafka