Drake
Data workflow tool, like a "Make for data"
Drake is a Clojure CLI tool designed for managing data workflows with declarative definitions of inputs, outputs, and processing steps. It resolves dependencies between data artifacts, detecting required rebuilds based on file timestamps. With built-in support for HDFS, Clojure-based plugins, logging, memoization, and automation, Drake is focused on reproducible, scalable data pipelines