[Htmlparser-user] Help using StringBean
Brought to you by:
derrickoswald
From: Ido B. <ido...@sy...> - 2012-01-04 15:45:14
|
I'm trying to use stringbean to extract text from a short html. I have the following problem: When looking at an html that starts with 1 letter in one paragraph, and then it ends and another paragraph starts, then a CR is not added. I think the carriagereturn adding function has a bug there (It should be an || instead of the second &&). My questions are: 1. Is the project still active? I've seen a 2.1 version hidden somewhere, but can't see any update on the sourceforge update. (I don't want to start installing patches and editing the code locally). 2. I actually wish to read an html and when encountering a text tag, extract the text from it, while using the text editing capabilities of StringBean. Is there any good way to do this? Thanks, Ido |