Menu

#1372 Create cleanup reports to find invalid capitalization

Approved
closed
None
5
2020-11-17
2020-10-07
Ahasuerus
No

Create cleanup reports to find invalid capitalization. As per http://www.isfdb.org/wiki/index.php?title=Template:PublicationFields:Title :

For English titles, the ISFDB case regularization rules are as follows: ...

  1. all later words are capitalized except for "and", "or", "the", "a", "an", "for", "of", "in", "on", "by", "at", "from", "with", and "to"

There should be two separate cleanup reports, one for publications and another one for titles. The logic can be similar to:

select title_id from titles where (binary title_title like '% To %' or binary title_title like '% At %') and title_language=17;

Discussion

  • Ahasuerus

    Ahasuerus - 2020-10-07
    • summary: Cleanup reports to find invalid capitalization of prepositions --> Create cleanup reports to find invalid capitalization
    • Description has changed:

    Diff:

    --- old
    +++ new
    @@ -1,4 +1,4 @@
    -Create cleanup reports to find invalid capitalization of prepositions. As per http://www.isfdb.org/wiki/index.php?title=Template:PublicationFields:Title :
    +Create cleanup reports to find invalid capitalization. As per http://www.isfdb.org/wiki/index.php?title=Template:PublicationFields:Title :
    
     For English titles, the ISFDB case regularization rules are as follows:  ...
    
     2. all later words are capitalized except for "and", "or", "the", "a", "an", "for", "of", "in", "on", "by", "at", "from", "with", and "to"
    
     
  • Ahasuerus

    Ahasuerus - 2020-10-07
    • Description has changed:

    Diff:

    --- old
    +++ new
    @@ -3,4 +3,6 @@
     For English titles, the ISFDB case regularization rules are as follows:  ...
    
     2. all later words are capitalized except for "and", "or", "the", "a", "an", "for", "of", "in", "on", "by", "at", "from", "with", and "to"
    
    -There should be two separate cleanup reports, one for publications and another one for titles.
    +There should be two separate cleanup reports, one for publications and another one for titles. The logic can be similar to:
    +
    +select title_id from titles where (binary title_title like '% To %' or binary title_title like '% At %') and title_language=17;
    
     
  • Ahasuerus

    Ahasuerus - 2020-10-12

    Part 1 - Title records:

    common/isfdb.py
    edit/cleanup_lib.py
    edit/cleanup_report.py
    nightly/nightly_job.py
    

    Installed in SVN 553 on 2020-10-12. Still need to create a separate cleanup report for publication records.

     
  • Ahasuerus

    Ahasuerus - 2020-10-28

    Part 2 - Fixed a bug with the 1000 record limit:

    edit/cleanup_report.py
    nightly/nightly_job.py
    

    Installed in SVN 555 on 2020-10-28.

     
  • Ahasuerus

    Ahasuerus - 2020-11-17
    • status: open --> closed
    • assigned_to: Ahasuerus
     
  • Ahasuerus

    Ahasuerus - 2020-11-17

    Part 3 - Created a report for publication records:

    edit/cleanup_lib.py
    edit/cleanup_report.py
    nightly/nightly_job.py
    

    Installed SVN 567 on 2020-11-17. Closing the FR.

     

Anonymous
Anonymous

Add attachments
Cancel





MongoDB Logo MongoDB