Menu

scrubcsv / News: Recent posts

scrubcsv manual

scrubcsv.sh -p2 -p5 -c3 -c8 -s5 -s6 -a12 -n5 -n6 input.csv 1 2 3 4

<scrubcsv.sh input.csv> for list of column names & corresponding number.
<scrubbed.input.csv> is the final product.

input.csv refers to the file being scrubbed, AFTER HAVING REMOVED
","'s THEN CONVERTED TO A COMMA DELIMITED text file.

1 2 3 4 refers to 4 columns (4 is the maximum number of columns
that can be compared, simultaneously) and if all are exactly equal,
will be determined as a duplicate record.... read more

Posted by Aaron Prayther 2006-10-02
MongoDB Logo MongoDB