Data quality analysis, profiling, cleansing, duplicate detection +more
DataCleaner is a data quality analysis application and a solution platform for DQ solutions. It's core is a strong data profiling engine, which is extensible and thereby adds data cleansing, transformations, enrichment, deduplication, matching and merging.
Website: http://datacleaner.github.io
Open source Extract Transform Load engine written in Java
ETL Framework is a standalone Extract Transform Load engine written in Java. It includes executables for all major platforms and can be easily integrated into other applications.
Key Features:
* embeddable, open source and free
* fast and scalable
* uses target database features to do transformations and loads
* manual and automatic data mapping
* data streaming
* bulk data loads
* data quality features using SQL, JavaScript?