Menu

#13 Duplicate check prevents article retrieval

open
nobody
5
2008-12-07
2008-12-07
dgrant9
No

One of the articles which I was trying to retrieve was unique in every respect, except that its title was the same as an older article which was present in the database. For some unknown reason, LeafRSS was refusing to pull this article into the database. I had LeafRSS pull the feed at least 5 times, while changing various parameters to try to get it to work, but it refused to accept that article. I decided to try turning off duplicate checking. With duplicate checking turned off, the article was pulled into the database with the next fetch of the feed, instantly and with no problems.

Looking at the code, I don't see how an article could be completely blocked from being entered into the database by the duplicate checking code. Duplicate checking is only supposed to mark an article as "ignored", and that is what the code appears to be designed to do. But in this instance, the duplicate checking code (or something affected by it) was blocking an article from being entered into the database.

Note: rssgenesis is useful for debugging problems like this, as it allows article and feed parameters to set and adjusted as needed for testing purposes.

Discussion


Log in to post a comment.

MongoDB Logo MongoDB