Real-time, incremental ETL library for ML with record-level depend
Open source annotation and labeling tool for image and video assets
Streaming reactive and dataflow graphs in Python
Build data pipelines, the easy way
BitSail is a distributed high-performance data integration engine
Use SQL to build ELT pipelines on a data lakehouse
Deal with bad samples in your dataset dynamically
Design, automate, operate and publish data pipelines at scale
osDQ dedicated to create apache spark based data pipeline using JSON
Mirror of Apache Kafka