| Name | Modified | Size | Downloads / Week |
|---|---|---|---|
| Parent folder | |||
| metaqa.doc.v0.22.06.zip | 2010-11-17 | 260.0 kB | |
| meta_qa.readme | 2010-11-17 | 3.2 kB | |
| Totals: 2 Items | 263.2 kB | 0 | |
********************************************************************
* This code may be used and distributed under the GNU public licence.
********************************************************************
meta_qa: A generic program for auditing data quality by comparing a
delimited input file or SQL with a database table using
matching key columns.
Typical uses would be to detect errors and analyse in
interfaces (code, scheduling, timing) and data conversions.
Copyright (C) 2007 Mark Besaans
This program is free software; you can redistribute it and/or
modify it under the terms of the GNU General Public License
as published by the Free Software Foundation; either version 2
of the License, or (at your option) any later version.
This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU General Public License for more details.
You should have received a copy of the GNU General Public License
along with this program; if not, write to the Free Software
Foundation, Inc., 51 Franklin Street, Fifth Floor, Boston, MA 02110-1301, USA.
Refer to gpl.txt distributed with this file.
Contact obscured Email address: wwmbes at Woolworths dot co dot za
---------------------------------------------------------------------------------
Instructions for using meta_qa are in a MS-word document which
is in the meta_qa tarball.
Input files should be pipe delimited and begin with
a HEADER| and pipe delimited target-table column-names.
Output is a spreadsheet of the errors detected at key/column level.
---------------------------------------------------------------------------------
Instructions to compile meta_qa (a generic program for detecting differences
between source data files and target data in tables).
1) Make sure you have a working Informix 4gl environment.
It needs to be the 64 bit version for the 4glaccess.c file
and in the "compile" script, both the interpretive and machine
code versions will be attempted.
2) Run the compile script.
3) If you want to see how it works with some meta-data, create the
meta data table using the .sch file and then load the .unl data file into it
for demonstration purposes.
4) Run the program with -h or -help to display the command line options and
parameters.
5) Read the documentation that is displayed by the -help.
6) Prepare a pipe delimited input file with the a header line beginning
with "HEADER|". Each field in the header should contain the name of
a column in the table that is to be audited. (Alternatively provide a file
containing SQL of which that final SQL uses column aliases matching the
target-table column-names exactly statement and can drive a database cursor.)
7) Run the program with the command-line options you require.
8) Check for warnings and error messages.
9) Analize the .xls tab delimited spreadsheet audit trail file of differences
that is produced.
---------------------------------------------------------------------------------
end of file