Menu

#881 Split all Latin/Non-Latin cleanup reports into two

Approved
closed
None
5
2016-03-24
2016-03-23
Ahasuerus
No

Split all Latin/Non-Latin cleanup reports into two. The following requirement were copied from the Community Portal:

     This report -- and the other two "Latin/non-Latin" cleanup reports -- identifies two types of potential problems.

    First, it looks for publisher names which:

        use non-Latin characters, and
        do not have at least one transliterated name on file

    For example, C. B. Кульженко was an early 20-th century Russian publisher and アスキーメディアワークス is a Japanese publisher. Once we enter their transliterated names, the records will no longer appear on this report.

    Second, it looks for publications whose titles use a language associated with a non-Latin alphabet/script. It then checks the publisher record to see if its name is spelled using Latin characters. If it is, then the publisher is considered suspect and is added to this report. Of course, it's possible for a Japanese, Russian, etc publisher to use Latin characters in its name, e.g. "SF 2000", which is why moderators have the option to "ignore" publisher names.

    The reason why international publishers like Oxford University Press appear on this report is that they have presumably published books in non-Latin languages. On the other hand, it's also possible that one or more of their books have had an incorrect language assigned when we entered the data. There is no way to view a publisher-specific list of exceptions at this time, but I am going to add this functionality shortly.

Discussion

  • Ahasuerus

    Ahasuerus - 2016-03-24

    Publishers and publication series done in:

    edit/cleanup_report.py 1.32
    edit/isfdblib.py 1.147
    edit/publisher_exceptions.py 1.1
    edit/TARGETS 1.34
    nightly/nightly_update.py 1.153
    

    Installed in r2016-065 on 2016-03-23. Keeping open until we determine what to do with the Legal Names report.

     
  • Ahasuerus

    Ahasuerus - 2016-03-24
    • status: open --> open-accepted
    • assigned_to: Ahasuerus
     
  • Ahasuerus

    Ahasuerus - 2016-03-24
    • status: open-accepted --> closed
     
  • Ahasuerus

    Ahasuerus - 2016-03-24

    Part 2 - Legal names implemented in:

    edit/cleanup_report.py 1.33
    edit/isfdblib.py 1.148
    nightly/nightly_update.py 1.154
    

    Installed in r2016-066 on 2016-03-24. Closing.

     

Anonymous
Anonymous

Add attachments
Cancel





MongoDB Logo MongoDB