Luigi is a Python (3.6, 3.7, 3.8, 3.9 tested) package that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization, handling failures, command line integration, and much more. The purpose of Luigi is to address all the plumbing typically associated with long-running batch processes. You want to chain many tasks, automate them, and failures will happen. These tasks can be anything, but are typically long running things like Hadoop jobs, dumping data to/from databases, running machine learning algorithms, or anything else. You can build pretty much any task you want, but Luigi also comes with a toolbox of several common task templates that you use. It includes support for running Python mapreduce jobs in Hadoop, as well as Hive, and Pig, jobs. It also comes with file system abstractions for HDFS, and local files that ensures all file system operations are atomic.

Features

  • Build complex pipelines of batch jobs
  • It’s easy to build up long-running pipelines that comprise thousands of tasks
  • Luigi helps you stitch many tasks together
  • Luigi takes care of a lot of the workflow management
  • Your data pipeline will not crash in a state containing partial data
  • Using Luigi’s visualiser, you get a nice visual overview of the dependency graph of the workflow

Project Samples

Project Activity

See All Activity >

License

Apache License V2.0

Follow Luigi

Luigi Web Site

Other Useful Business Software
Stop Storing Third-Party Tokens in Your Database Icon
Stop Storing Third-Party Tokens in Your Database

Auth0 Token Vault handles secure token storage, exchange, and refresh for external providers so you don't have to build it yourself.

Rolling your own OAuth token storage can be a security liability. Token Vault securely stores access and refresh tokens from federated providers and handles exchange and renewal automatically. Connected accounts, refresh exchange, and privileged worker flows included.
Try Auth0 for Free
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Luigi!

Additional Project Details

Programming Language

Python

Related Categories

Python Build Tools, Python Workflow Management Software, Python Data Pipeline Tool

Registered

2021-04-08