A lightweight stream processing library for Go
A distributed and extensible workflow scheduler platform
A ranked list of awesome Python open-source libraries
Kestra is an infinitely scalable orchestration and scheduling platform
Producer and consumer actors with back-pressure for Elixir
StarRocks is a next-gen sub-second MPP database for full analytics
Next-Generation Event Processing Platform
AutoGluon: AutoML for Image, Text, and Tabular Data
Real-time, incremental ETL library for ML with record-level depend
BitSail is a distributed high-performance data integration engine
Deal with bad samples in your dataset dynamically
osDQ dedicated to create apache spark based data pipeline using JSON
A FITS image data viewer & reducer, and UVIT Data Reduction Pipeline.
Mirror of Apache Kafka