Menu

#48 Interwiki bot removes valid links to pl and ru

closed-fixed
nobody
None
8
2004-07-31
2004-07-21
Guanaco
No

The interwiki bot removes valid links to pl and ru. I
have tried changing the encoding of the links, but no
matter how they are encoded, it removes them.

======Post-processing [[en:Bioinformatics]]======
==status==
Changes to be made: Removing:ru
- [[de:Bioinformatik]] [[es:Bioinform tica]]
+
+
+
+
+
+
+
+ [[Category:Computer science]]

+ [[de:Bioinformatik]]
+ [[es:Bioinform tica]]
- [[pl:Bioinformatyka]]
[[ru:Биоинформатика]]
+ [[pl:Bioinformatyka]]
- [[Category:Computer science]]
NOTE: Replace [[en:Bioinformatics]]
ERROR: removing: ru
NOTE: Performing a recursive query first to save time....
NOTE: Nothing left to do 2
NOTE: Updating live wikipedia...
Sleeping for 5.4 seconds
Changing page en:Bioinformatics

As you can see with the pl link, it only does this when
special characters are in the title.

Discussion

  • Guanaco

    Guanaco - 2004-07-21
    • priority: 5 --> 8
     
  • Daniel Herding

    Daniel Herding - 2004-07-21

    Logged In: YES
    user_id=880694

    First, the pl: link was not removed
    Second, the ru: link is invalid.

     
  • Andre Engels

    Andre Engels - 2004-07-21
    • status: open --> pending-invalid
     
  • Andre Engels

    Andre Engels - 2004-07-21

    Logged In: YES
    user_id=843018

    The Russian link is removed because the page does not exist.
    Unless you find a better example, I intend to close this bug
    report.

     
  • Guanaco

    Guanaco - 2004-07-22
    • status: pending-invalid --> open
     
  • Andre Engels

    Andre Engels - 2004-07-22

    Logged In: YES
    user_id=843018

    Yes, I see... Those are rather problematic. If I see it
    correctly, what is going on is that (taking the example of
    Boleslaus III of Poland since the other page has already
    been corrected) the link to Polish contains "ł" to refer to
    "ł". Apparently taking the Latin-1 encoding from "ł" and
    reading that as Unicode gives "ł".

    I would like to ask help from Brion or someone else familiar
    with Wikipedia's code - how is Wikipedia able to find these
    links, and how can we copy this in the bot without having to
    make two guesses each time this occurs?

     
  • Rob W.W. Hooft

    Rob W.W. Hooft - 2004-07-31

    Logged In: YES
    user_id=47476

    Should be fixed now, it was broken by a utf-8ification.

    This bug only affected interwiki links that were
    url-encoded. There used to be a lot of these, but since the
    robot never uses % encoding in the pages, many of these
    style links have already been removed in the past thereby
    making this bug less visible.

     
  • Rob W.W. Hooft

    Rob W.W. Hooft - 2004-07-31
    • status: open --> closed-fixed
     

Log in to post a comment.