Python module that helps you build complex pipelines of batch jobs
Making DAG construction easier
Build data pipelines, the easy way
Conduit streams data between data stores. Kafka Connect replacement
Tool for visualizing and tracking your machine learning experiments
Kestra is an infinitely scalable orchestration and scheduling platform
The open standard for data logging
Build, run, and manage data pipelines for integrating data
Open source annotation and labeling tool for image and video assets
StarRocks is a next-gen sub-second MPP database for full analytics
A ranked list of awesome Python open-source libraries
End to end data integration and analytics platform
Deal with bad samples in your dataset dynamically
Design, automate, operate and publish data pipelines at scale
osDQ dedicated to create apache spark based data pipeline using JSON