Create a cleanup report to find duplicate ISBNs for pubs with significantly different titles, e.g.:
select pub_isbn, MIN(pub_title), MAX(pub_title), count(*)
from pubs
where pub_isbn IS NOT NULL
and pub_isbn NOT LIKE '%Unknown%'
and pub_isbn NOT LIKE '%No ISBN%'
and pub_isbn NOT LIKE '%N/A%'
and pub_isbn NOT LIKE '%None%'
and pub_isbn NOT LIKE '#%'
and pub_isbn != ''
and pub_ctype != 'MAGAZINE'
group by pub_isbn
having count(distinct pub_title) > 1
AND INSTR(MIN(pub_title), MAX(pub_title)) = 0
AND INSTR(MAX(pub_title), MIN(pub_title)) = 0;
Anonymous
Implemented (using somewhat different logic):
Installed in r2014-374. Closing.
Added solid cell borders around the Ignore link in mod/same_isbns_different_titles.py 1.2. Installed in r2014-375 on 2014-10-30.
Added a Publication date column in mod/same_isbns_different_titles.py 1.3. Installed in r2014-376 on 2014-10-30.
Fixed the display logic to ignore blank ISBNs in mod/same_isbns_different_titles.py 1.4. Installed in r2014-377 on 2014-10-31.
Part 5 - Fixed a search bug in nightly/nightly_update.py 1.68. Installed in r2014-397 on 2014-11-09.
Part 6 - Fixed a bug that only manifested itself when there were no duplicate ISBNs. Implemented in nightly/nightly_update.py 1.69. Installed in r2014-398 on 2014-11-09.