Cascalog is a powerful Clojure (and Java) data processing and querying library built atop Hadoop (via Cascading), providing a high-level, Datalog-inspired abstraction for both big data processing and local computation. Cascalog is hosted at Clojars, and some of its dependencies are hosted at Conjars. Both Clo/Con-jars are maven repos that's easy to use with maven or leiningen. The Cascalog website contains more information and links to Various articles and tutorials. The best way to get started with Cascalog is experiment with the toy datasets that ship with the project.

Features

  • Expressive, Datalog-like query language that runs on Hadoop or locally
  • Simplified abstraction over Cascading to avoid low-level Hadoop complexity
  • Seamless handling of distributed Big Data workflows
  • Pure Java API (JCascalog) available for Java integration and experimentation
  • Useful for prototyping data flows that scale from local tests to production clusters
  • Draws inspiration from existing tools like Pig, Hive, and Cascading while providing richer abstraction

Project Samples

Project Activity

See All Activity >

Categories

Data Management

License

MIT License

Follow Cascalog

Cascalog Web Site

Other Useful Business Software
Our Free Plans just got better! | Auth0 Icon
Our Free Plans just got better! | Auth0

With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
Try free now
Rate This Project
Login To Rate This Project

User Reviews

Be the first to post a review of Cascalog!

Additional Project Details

Operating Systems

Linux, Mac, Windows

Programming Language

Java

Related Categories

Java Data Management System

Registered

2025-08-20