[Htmlparser-user] another bug looking problem
Brought to you by:
derrickoswald
|
From: Navid H.L. <na...@ya...> - 2002-11-21 00:38:39
|
Hi Somik,
I tried a code like following to read text of a file.
for (Enumeration e =
parser.elements();e.hasMoreElements();) {
HTMLNode node = (HTMLNode)e.nextElement();
if (node instanceof HTMLStringNode) {
HTMLStringNode stringNode = (HTMLStringNode)node;
System.out.println(stringNode.getText());
}//if
}//for
The file has some javascript like following:
<script>
function test(){
for(i=0; i<12; i++)
{alert(i);
}
</script>
Exactly at i<12 section of java script, the program
quits. I guess, it counts "<" sign as a tag starting.
please tell me how I can report bugs. I lost my login
name.
Thanks
Navid
--- Somik Raha <so...@ya...> wrote:
> Hi Navid,
> It will be at least a week before its out, as
> this one's going to have a
> lot of docs...
> However, if you should need the version asap -
> dont worry - this is an
> open source project.
> Simply check out the latest snapshot from CVS,
> and run the build.xml
> file (using ant). The latest build will be at your
> disposal.
> If this is too complex, let me know.. I can make
> a build specifically
> for you, but only if u have cvs problems.
> Regards,
> Somik
> ----- Original Message -----
> From: "Navid H.Langaroudi" <na...@ya...>
> To: <htm...@li...>
> Sent: Tuesday, November 19, 2002 10:25 PM
> Subject: Re: [Htmlparser-user] How to use it.
>
>
> > Hi Somik,
> > It is good the bug is fixed. But when and where I
> get
> > the next version. I really need it.
> >
> > Thanks
> > Navid
> >
> > --- Somik Raha <so...@ya...> wrote:
> > > Hi Navid,
> > > Thanks for this bug report! I was able to
> > > reproduce it with testcases.
> > > Its fixed now.
> > > Folks : The next version will be having this
> bug
> > > fix. Also, I am
> > > refactoring all the package names from
> com.kizna.*
> > > to org.htmlparser.*.
> > >
> > > This would cause you to change your package
> > > names, and I must apologize
> > > for the inconvenience, but this would be the
> first
> > > and last change. The next
> > > release is coming with nicer docs..
> > >
> > > Regards,
> > > Somik
> > > ----- Original Message -----
> > > From: "Somik Raha" <so...@ya...>
> > > To: <htm...@li...>
> > > Sent: Tuesday, November 19, 2002 1:54 PM
> > > Subject: Re: [Htmlparser-user] How to use it.
> > >
> > >
> > > > Hi Navid,
> > > > That does sound like a bug. I will look into
> it
> > > > tonight. Meanwhile, could you enter this bug
> > > report in
> > > > the htmlparser website ?
> > > > (http://htmlparser.sourceforge.net).
> > > >
> > > > Regards,
> > > > Somik
> > > > --- "Navid H.Langaroudi" <na...@ya...>
> > > wrote:
> > > > > Hi Somik,
> > > > > I could do some parts of my program. And the
> > > result
> > > > > was great. Thanks for all your helps. But
> > > yesterday
> > > > > I
> > > > > found a bug, or most probably it is my
> mistake.
> > > > > I used the sample in HTMLParser.java
> > > > >
> > > > > * </pre>
> > > > > * The above snippet will print out only the
> > > text
> > > > > contents in the html document.<br>
> > > > > * Here's another snippet that will only
> print
> > > out
> > > > > the
> > > > > link urls in a document.
> > > > > * This is an example of adding a link
> scanner.
> > > > > * <pre>
> > > > > * HTMLParser parser = new
> > > > > HTMLParser("http://www.yahoo.com");
> > > > > * parser.addScanner(new
> HTMLLinkScanner("-l"));
> > > > > * for (Enumeration e =
> > > > > parser.elements();e.hasMoreElements();) {
> > > > > * HTMLNode node =
> (HTMLNode)e.nextElement();
> > > > > * if (node instanceof HTMLLinkTag) {
> > > > > * HTMLLinkTag linkTag =
> > > (HTMLLinkTag)node;
> > > > > *
> System.out.println(linkTag.getLink());
> > > > > * }
> > > > > * }
> > > > > * </pre>
> > > > > * @see HTMLParser#elements()
> > > > >
> > > > > It extract all urls of a page. But if the
> page
> > > url
> > > > > be
> > > > > something like
> > > http://www.mysite.com/books/some.asp
> > > > > and for example page has a link as
> > > > > href="newpage.html"
> > > > >
> > > > > then the extracted url will be:
> > > > >
> > >
> http://www.mysite.com/books/some.asp/newpage.html
> > > > > which is wrong!
> > > > > only if the instead some.asp I have
> some.html
> > > then
> > > > > the
> > > > > url will be correct!
> > > > > http://www.mysite.com/books/newpage.html
> > > > >
> > > > > Would you please give me some guide line.
> > > > > Thank you very much.
> > > > >
> > > > > Nav
> > > > >
> > > > > --- Somik Raha <so...@ya...> wrote:
> > > > > > Navid wrote :
> > > > > > > Thank you Somik, I could run some
> program to
> > > use
> > > > > > > HTMLparser classes. But my question is ,
> > > Does
> > > > > > > HTMLparser works as a standalone
> program?
> > > > > >
> > > > > > The standalone is just a demonstration.
> > > HTMLParser
> > > > > > is a library - you are
> > > > > > expected to use it in your application.
> Check
> > > > > > HTMLParser.java for more
> > > > > > details.
> > > > > >
> > > > > > Regards,
> > > > > > Somik
> > > > > >
> > > > > >
> > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
>
-------------------------------------------------------
> > > > > > This sf.net email is sponsored by: To
> learn
> > > the
> > > > > > basics of securing
> > > > > > your web site with SSL, click here to get
> a
> > > FREE
> > > > > > TRIAL of a Thawte
> > > > > > Server Certificate:
> > > > > > http://www.gothawte.com/rd524.html
> > > > > >
> > > _______________________________________________
> > > > > > Htmlparser-user mailing list
> > > > > > Htm...@li...
> > > > > >
> > > > >
> > > >
> > >
> >
>
https://lists.sourceforge.net/lists/listinfo/htmlparser-user
> > > > >
> > > > >
> > > > >
> > >
> __________________________________________________
> > > > > Do you Yahoo!?
> > > > > Yahoo! Web Hosting - Let the expert host
> your
> > > site
> > > > > http://webhosting.yahoo.com
> > > > >
> > > > >
> > > > >
> > > >
> > >
>
=== message truncated ===
__________________________________________________
Do you Yahoo!?
Yahoo! Web Hosting - Let the expert host your site
http://webhosting.yahoo.com
|