Scalding is a Scala DSL built on Cascading that simplifies writing Hadoop MapReduce jobs. It lets users describe data transformations using Scala’s functional abstractions, while abstracting away low-level Hadoop boilerplate. It enables expressive and testable pipeline definitions and integrates with various input/output formats.
Features
- High-level Scala DSL for Hadoop MapReduce using Cascading
- Works with TextLine, Tsv, Json, JDBC, and custom sources
- Supports grouping, joins, aggregations, and typed case class pipelines
- Easily testable locally or on Hadoop clusters
- Compatible with sbt, Maven, Gradle, Mill, Ammonite, and Scala CLI
- Allows parameterized jobs and configurable reducers/settings
Categories
FrameworksLicense
Apache License V2.0Follow Scalding
Other Useful Business Software
AI-powered service management for IT and enterprise teams
Give your IT, operations, and business teams the ability to deliver exceptional services—without the complexity. Maximize operational efficiency with refreshingly simple, AI-powered Freshservice.
Rate This Project
Login To Rate This Project
User Reviews
Be the first to post a review of Scalding!