Browse free open source Data Pipeline tools and projects for Mac below. Use the toggles on the left to filter open source Data Pipeline tools by OS, license, language, programming language, and project status.
Pentaho offers comprehensive data integration and analytics platform.
Conduit streams data between data stores. Kafka Connect replacement
lakeFS - Git-like capabilities for your object storage
AutoGluon: AutoML for Image, Text, and Tabular Data
Kestra is an infinitely scalable orchestration and scheduling platform
Privacy and Security focused Segment-alternative, in Golang
Real-time, incremental ETL library for ML with record-level depend
Mirror of Apache Kafka
A distributed and extensible workflow scheduler platform
A ranked list of awesome Python open-source libraries
Python module that helps you build complex pipelines of batch jobs
Build, run, and manage data pipelines for integrating data
Making DAG construction easier
Open Source Data Orchestration for the Cloud
Backstage is an open platform for building developer portals
StarRocks is a next-gen sub-second MPP database for full analytics
A fast script language for Go
A lightweight stream processing library for Go
The open standard for data logging
Design, automate, operate and publish data pipelines at scale
SeaTunnel is a distributed, high-performance data integration platform
Automated Tool for Optimized Modelling
BitSail is a distributed high-performance data integration engine
A FITS image data viewer & reducer, and UVIT Data Reduction Pipeline.
Pythonic tool for running machine-learning/high performance workflows