Conduit streams data between data stores. Kafka Connect replacement
Python module that helps you build complex pipelines of batch jobs
Making DAG construction easier
Kestra is an infinitely scalable orchestration and scheduling platform
Backstage is an open platform for building developer portals
A fast script language for Go
Build, run, and manage data pipelines for integrating data
Privacy and Security focused Segment-alternative, in Golang
Automated Tool for Optimized Modelling
StarRocks is a next-gen sub-second MPP database for full analytics
A ranked list of awesome Python open-source libraries
AutoGluon: AutoML for Image, Text, and Tabular Data
Producer and consumer actors with back-pressure for Elixir
lakeFS - Git-like capabilities for your object storage
Code review for data in dbt
Open source annotation and labeling tool for image and video assets
Streaming reactive and dataflow graphs in Python
Use SQL to build ELT pipelines on a data lakehouse
Microsoft Integration, Azure, Power Platform, Office 365 and much more
Deal with bad samples in your dataset dynamically
Connect processes into powerful data pipelines
osDQ dedicated to create apache spark based data pipeline using JSON
A FITS image data viewer & reducer, and UVIT Data Reduction Pipeline.
Use SQL to build ELT pipelines on a data lakehouse