Python module that helps you build complex pipelines of batch jobs
Apache Drill is a distributed MPP query layer for self describing data
Parser generator to read, process, or translate structured text
A Spark library for Amazon SageMaker
Scalable and Flexible Gradient Boosting
Generate Hive Scripts Automatically from CSV Files
This project holds source code for Aspose for Hadoop project.
Open Source Reporting & Data Visualization Platform
Download Free Associated R open source script files for big data analy
Validation of complex Apache Oozie Hadoop workflow
Distributed Index with Apache Hadoop, Apache Lucene and Apache Tika