A ranked list of awesome Python open-source libraries
lakeFS - Git-like capabilities for your object storage
Producer and consumer actors with back-pressure for Elixir
Python module that helps you build complex pipelines of batch jobs
Open-source data observability for analytics engineers
StarRocks is a next-gen sub-second MPP database for full analytics
Streaming reactive and dataflow graphs in Python
Design, automate, operate and publish data pipelines at scale
osDQ dedicated to create apache spark based data pipeline using JSON