Real-time, incremental ETL library for ML with record-level depend
Open source annotation and labeling tool for image and video assets
Streaming reactive and dataflow graphs in Python
Build data pipelines, the easy way
BitSail is a distributed high-performance data integration engine
Use SQL to build ELT pipelines on a data lakehouse
Deal with bad samples in your dataset dynamically
Design, automate, operate and publish data pipelines at scale
osDQ dedicated to create apache spark based data pipeline using JSON
A FITS image data viewer & reducer, and UVIT Data Reduction Pipeline.
Mirror of Apache Kafka