Re: [Htmlparser-developer] testStringBeanListener() consistently failing
Brought to you by:
derrickoswald
From: Somik R. <so...@ya...> - 2003-02-09 02:42:49
|
Derrick Oswald wrote: > ERROR: HTMLReader.readElement() : Error occurred while trying to > decipher the tag using scanners > at Line 686 : null > > ...and then it really starts to have problems. It seems the "xxxxxx's" > pattern causes grief as it reads the > in what it thinks is a single > quoted string and 'fixes' it. I doubt that this is the problem.. It was bcos of the TableScanner, Div Scanner, and Span scanners. I had taken a gamble by putting them in the current set of registered scanners - there's a lot of dirty html out there that don't close the div's or span's or table's. Instead of fixing this issue by adding more code, I want to try refactoring this logic from the link and form scanners and reuse it. For now, the above mentioned three scanners are not registered by default (the page gets parsed just fine after that). Regards, Somik |