Menu

#725 Create a cleanup report to find duplicate ISBNs

Approved
closed
None
5
2014-11-10
2014-10-26
Ahasuerus
No

Create a cleanup report to find duplicate ISBNs for pubs with significantly different titles, e.g.:

select pub_isbn, MIN(pub_title), MAX(pub_title), count(*)
from pubs
where pub_isbn IS NOT NULL
and pub_isbn NOT LIKE '%Unknown%'
and pub_isbn NOT LIKE '%No ISBN%'
and pub_isbn NOT LIKE '%N/A%'
and pub_isbn NOT LIKE '%None%'
and pub_isbn NOT LIKE '#%'
and pub_isbn != ''
and pub_ctype != 'MAGAZINE'
group by pub_isbn
having count(distinct pub_title) > 1
AND INSTR(MIN(pub_title), MAX(pub_title)) = 0
AND INSTR(MAX(pub_title), MIN(pub_title)) = 0;

Discussion

  • Ahasuerus

    Ahasuerus - 2014-10-30

    Implemented (using somewhat different logic):

    mod/cleanup.py 1.71
    mod/same_isbns_different_titles.py 1.1
    mod/TARGETS 1.60
    nightly/nightly_update.py 1.60
    

    Installed in r2014-374. Closing.

     
  • Ahasuerus

    Ahasuerus - 2014-10-30
    • status: open --> closed
    • assigned_to: Ahasuerus
     
  • Ahasuerus

    Ahasuerus - 2014-10-30

    Added solid cell borders around the Ignore link in mod/same_isbns_different_titles.py 1.2. Installed in r2014-375 on 2014-10-30.

     
  • Anonymous

    Anonymous - 2014-10-31

    Added a Publication date column in mod/same_isbns_different_titles.py 1.3. Installed in r2014-376 on 2014-10-30.

     
  • Ahasuerus

    Ahasuerus - 2014-10-31

    Fixed the display logic to ignore blank ISBNs in mod/same_isbns_different_titles.py 1.4. Installed in r2014-377 on 2014-10-31.

     
  • Ahasuerus

    Ahasuerus - 2014-11-10

    Part 5 - Fixed a search bug in nightly/nightly_update.py 1.68. Installed in r2014-397 on 2014-11-09.

     
  • Ahasuerus

    Ahasuerus - 2014-11-10

    Part 6 - Fixed a bug that only manifested itself when there were no duplicate ISBNs. Implemented in nightly/nightly_update.py 1.69. Installed in r2014-398 on 2014-11-09.

     

Anonymous
Anonymous

Add attachments
Cancel





MongoDB Logo MongoDB