GETL - based package in Groovy, which automates the work of loading and transforming data. His name is an acronym for «Groovy ETL».
GETL is a set of libraries of pre-built classes and objects that can be used to solve problems unpacking, transform and load data into programs written in Groovy, or Java, as well as from any software that supports the work with Java classes.
GETL taken into account when developing ideas and following requirements:
1. The simpler the class hierarchy, the easier solution;
2. The data structures tend to change over time, or not be known in advance, working with them must be maintained;
3. All routine work ETL should be automated wherever possible;
4. Compiling the code on the fly bail speed and reserve for the optimization;
5. Sophisticated class hierarchy guarantee easy connection of other open source solutions.
- Support for working with CSV, JSON, XML and Excel files
- Support for working with JDBC sources (tables, sql queries, DDL, sequence)
- Support for copying the data flow between sources
- Intelligent processing of data (mapping and cast) in flows
- Support for working with temporal data
- Support for data transformation (aggregation and sorting)
- Support for parallel execution of tasks
- Support for configuration files
- Support work with log files
- Statistics gathering speed execution of processes
- Manage files on file systems and FTP
- Library of functions for working with strings, dates, files, lists, maps and for the transformation of values
Great project! Nice and flexible way of an ETL.