From: JEFFREY H. <has...@sb...> - 2010-08-19 20:05:23
|
I was trying to parse a NYTimes.com story and I am not seeing any of the nodes with the article in them. I noticed that they use some custom tags around the article such as: <NYT_TEXT > What would be the behavior of NekoHTML upon parsing a tag such as that? It seems like it is dropping everything within it, or else I am not seeing the expected nodes that live inside of that tag. Thanks, Jeffrey Haskovec |