SeaTunnel is a distributed, high-performance data integration platform
Kestra is an infinitely scalable orchestration and scheduling platform
A distributed and extensible workflow scheduler platform
Backstage is an open platform for building developer portals
Conduit streams data between data stores. Kafka Connect replacement
Open Source Data Orchestration for the Cloud
The open standard for data logging
Build, run, and manage data pipelines for integrating data
lakeFS - Git-like capabilities for your object storage
AutoGluon: AutoML for Image, Text, and Tabular Data
Open-source data observability for analytics engineers
Python module that helps you build complex pipelines of batch jobs
Code review for data in dbt
Streaming reactive and dataflow graphs in Python
BitSail is a distributed high-performance data integration engine
Use SQL to build ELT pipelines on a data lakehouse
Microsoft Integration, Azure, Power Platform, Office 365 and much more
Connect processes into powerful data pipelines
osDQ dedicated to create apache spark based data pipeline using JSON
Mirror of Apache Kafka