GenerateData is a general purpose data generation engine. No plug-ins, no APIs, just data generation made easy. From single files, to referentially sound databases, point, click, tweak and generate.
An extension package to Pentaho Data Integration, providing plug-ins. Steps/job entries can be downloaded independently and each comes with source code in the .zip file. All are licensed as LGPL or GPL.