Browse free open source Data Pipeline tools and projects for Windows below. Use the toggles on the left to filter open source Data Pipeline tools by OS, license, language, programming language, and project status.
Pentaho offers comprehensive data integration and analytics platform.
A ranked list of awesome Python open-source libraries
StarRocks is a next-gen sub-second MPP database for full analytics
Privacy and Security focused Segment-alternative, in Golang
Backstage is an open platform for building developer portals
Microsoft Integration, Azure, Power Platform, Office 365 and much more
lakeFS - Git-like capabilities for your object storage
A distributed and extensible workflow scheduler platform
Kestra is an infinitely scalable orchestration and scheduling platform
Python module that helps you build complex pipelines of batch jobs
SeaTunnel is a distributed, high-performance data integration platform
AutoGluon: AutoML for Image, Text, and Tabular Data
Automated Tool for Optimized Modelling
BitSail is a distributed high-performance data integration engine
Conduit streams data between data stores. Kafka Connect replacement
Use SQL to build ELT pipelines on a data lakehouse
Open source annotation and labeling tool for image and video assets
Open-source data observability for analytics engineers
Build, run, and manage data pipelines for integrating data
Code review for data in dbt
Streaming reactive and dataflow graphs in Python
Light-weight, flexible, expressive statistical data testing library
A lightweight stream processing library for Go
Making DAG construction easier
Next-Generation Event Processing Platform