Data quality analysis, profiling, cleansing, duplicate detection +more
DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It's core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging. Website: http://datacleaner.github.io
ETL engine based on Groovy
P.S. Dear friends. Repository migration to https://github.com/ascrus/getl . You can download jar file from this site or maven. GETL - based package in Groovy, which automates the work of loading and transforming data. His name is an acronym for «Groovy ETL». GETL is a set of libraries of pre-built classes and objects that can be used to solve problems unpacking, transform and load data into programs written in Groovy, or Java, as well as from any software that supports the work with Java classes. GETL taken into account when developing ideas and following requirements: 1. The simpler the class hierarchy, the easier solution; 2. The data structures tend to change over time, or not be known in advance, working with them must be maintained; 3. All routine work ETL should be automated wherever possible; 4. Compiling the code on the fly bail speed and reserve for the optimization; 5. Sophisticated class hierarchy guarantee easy connection of other open source solutions.
XIForge is a team of IT volunteer to explore new free open source technology framework and platform. We focus Pentaho and OpenBravo ERP. Our current hosted project includes Pentaho Data Integration Parse JSON String plugin. Team founder is Reid Lai.
Data/Document Work Flow application
Data/Document Work Flow is a set of libraries to build simple cross-platform information system: - Dwf.Common - collections, reflections, io and networks helpers - Dwf.Data - cross RDBMS ORM - Dwf.Gui - Xwt based desktop UI - Dwf.Data.Gui - Database desktop UI - Dwf.Module.Flow - Document work flow module - Dwf.Mudule.FlowGui - Configure, create, edit, send document throw the flow
a financial math library and financial market data database
This project should combine a financial mathematics library with an underlying financial market database (and a set of other tools), which could be used by financial institutions for their financial market data needs as well as by students for research works.