pywikibot / Bugs / #782 All pages soup problems

#782 All pages soup problems

Status: closed-works-for-me

Owner: nobody

Labels: General (277)

Priority: 7

Updated: 2014-08-26

Created: 2008-08-21

Creator: Multichill

Private: No

While running python2.4 imageuncat.py -start:Image:Chironomidae

Working on Image:Cicada.ogg
Got category Category:Images transwikied by BetacommandBot
Working on Image:Cicada.png
Got category Category:Magicicada
Working on Image:Cicada0001.jpg
Got category Category:Cicadellidae
Traceback (most recent call last):
File "/home/bot/pywikipedia/pagegenerators.py", line 755, in __iter__
for page in self.wrapped_gen:
File "/home/bot/pywikipedia/pagegenerators.py", line 688, in DuplicateFilterPageGenerator
for page in generator:
File "/home/bot/pywikipedia/pagegenerators.py", line 239, in AllpagesPageGenerator
for page in site.allpages(start = start, namespace = namespace, includeredirects = includeredirects):
File "/home/bot/pywikipedia/wikipedia.py", line 5169, in allpages
for p in soup.api.query.allpages:
AttributeError: 'NoneType' object has no attribute 'query'
'NoneType' object has no attribute 'query'

Pywikipedia [http] trunk/pywikipedia (r5827, Aug 21 2008, 14:32:44)
Python 2.4.4 (#1, Jun 11 2007, 23:35:50)
[GCC 3.3.3 (NetBSD nb3 20040520)]

Why are we using BeautifulSoup anyway? We dont need to screen-scrape the API.

Discussion

Jitse Niesen - 2008-08-21

Logged In: YES
user_id=194734
Originator: NO

I found something strange in allpages() which might have caused the problem and fixed it a minute ago in r5829. However, I'm not sure that this did cause the problem, so I'm leaving the bug open.

BeautifulSoup is used to parse the XML that the API provides. Do you think it's the wrong tool (I honestly don't know)?

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Stig Meireles Johansen - 2008-08-21

Logged In: YES
user_id=2116333
Originator: NO

I did a quick hack myself before I saw this beautifulsoup-version. I did it with json and simplejson ... I don't know which method is better, but this beautifulsoup-version is prettier.. :)

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Russell Blau - 2009-01-30

status: open --> closed-works-for-me
If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

Russell Blau - 2009-01-30

>pagegenerators.py -start:Image:Chironomidae
Checked for running processes. 1 processes currently running, including the current process.
File:Chiropotes aequatorialis map.png
File:Chiropotes chiropotes map.png
File:Chiropotes irrorata map.png
(etc.)

If you would like to refer to this comment somewhere else in this project, copy and paste the following link:

All pages soup problems

Python MediaWiki Bot Framework

Group

Searches

Help

#782 All pages soup problems

Discussion