A ranked list of awesome Python open-source libraries
Kestra is an infinitely scalable orchestration and scheduling platform
Backstage is an open platform for building developer portals
Open Source Data Orchestration for the Cloud
A distributed and extensible workflow scheduler platform
A lightweight stream processing library for Go
Open-source data observability for analytics engineers
Streaming reactive and dataflow graphs in Python
The open standard for data logging
A fast script language for Go
BitSail is a distributed high-performance data integration engine
Build, run, and manage data pipelines for integrating data
Light-weight, flexible, expressive statistical data testing library
Open source annotation and labeling tool for image and video assets
lakeFS - Git-like capabilities for your object storage
Privacy and Security focused Segment-alternative, in Golang
Automated Tool for Optimized Modelling
Making DAG construction easier
Conduit streams data between data stores. Kafka Connect replacement
Code review for data in dbt
Pythonic tool for running machine-learning/high performance workflows
Next-Generation Event Processing Platform
SeaTunnel is a distributed, high-performance data integration platform
Build data pipelines, the easy way
Tool for visualizing and tracking your machine learning experiments