Febrl (Freely Extensible Biomedical Record Linkage) does data standardisation (segmentation and cleaning) and probabilistic record linkage ("fuzzy" matching) of one or more files or data sources which do not share a unique record key or identifier.
febrl-gen is a Java-based frontend to Febrl, an open-source data linkage system written in Python. Users can configure the parameters of a linkage project through the frontend, and febrl-gen will generate a Febrl-ready configuration file.