Eigeny,
In general, you probably want to look at the filter package.
Try running the filterbuilder application (startup script is in the bin
directory) and read the help and tutorial.
Using this application you can create a Java program that selects only
the 'sometext' you want.
Derrick
Eugeny N Dzhurinsky wrote:
>Hello!
>I need to search for HTML code in a page, for instance the code to search
>looks like this:
>
><div class="someclass"><a href="somelocation" ><img src="image/here"
>border="0"></a></div><span style="style2">sometext</span>
>
>This code could be placed as single line or formatted somehow, containing one
>or more linebreaks.
>
>I need also to track situation while this code is commented out, or placed
>outside <body> section.
>
>For now I created a Lexer instance for document and for this code, comparing
>them token by token, but may be there is some better way?
>
>
>
|