Thread: [Htmlparser-user] How to use it.
Brought to you by:
derrickoswald
From: Navid H.L. <na...@ya...> - 2002-11-09 19:09:39
|
Hi I am very new here. How I can use htmlparser? How I should set up the library so the java import works? I can not even compile and run the sample programs. I have jdk on my computer. Can some one give me basic instructions to do these. I greatly appreciat your help. Nav __________________________________________________ Do you Yahoo!? U2 on LAUNCH - Exclusive greatest hits videos http://launch.yahoo.com/u2 |
From: Somik R. <so...@ya...> - 2002-11-09 19:11:43
|
Hi Navid, The text doc is outdated (it will be updated for the next release). You can refer to the javadoc - it is uptodate, and is intensive. Check the javadoc of HTMLParser.java to get started. You should keep htmlparser.jar in your classpath (this is in the lib directory), and import com.kizna.html.* in your program. Feel free to put your questions here. Regards, Somik ----- Original Message ----- From: "Navid H.Langaroudi" <na...@ya...> To: <htm...@li...> Sent: Saturday, November 09, 2002 11:09 AM Subject: [Htmlparser-user] How to use it. > Hi I am very new here. How I can use htmlparser? How I > should set up the library so the java import works? > I can not even compile and run the sample programs. I > have jdk on my computer. Can some one give me basic > instructions to do these. > > I greatly appreciat your help. > > Nav > > __________________________________________________ > Do you Yahoo!? > U2 on LAUNCH - Exclusive greatest hits videos > http://launch.yahoo.com/u2 > > > ------------------------------------------------------- > This sf.net email is sponsored by:ThinkGeek > Welcome to geek heaven. > http://thinkgeek.com/sf > _______________________________________________ > Htmlparser-user mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlparser-user |
From: Navid H.L. <na...@ya...> - 2002-11-09 19:42:51
|
Thanks Somik. It was a great help. Navid --- Somik Raha <so...@ya...> wrote: > Hi Navid, > The text doc is outdated (it will be updated for > the next release). You > can refer to the javadoc - it is uptodate, and is > intensive. > Check the javadoc of HTMLParser.java to get > started. > > You should keep htmlparser.jar in your classpath > (this is in the lib > directory), and import com.kizna.html.* in your > program. > Feel free to put your questions here. > > Regards, > Somik > ----- Original Message ----- > From: "Navid H.Langaroudi" <na...@ya...> > To: <htm...@li...> > Sent: Saturday, November 09, 2002 11:09 AM > Subject: [Htmlparser-user] How to use it. > > > > Hi I am very new here. How I can use htmlparser? > How I > > should set up the library so the java import > works? > > I can not even compile and run the sample > programs. I > > have jdk on my computer. Can some one give me > basic > > instructions to do these. > > > > I greatly appreciat your help. > > > > Nav > > > > __________________________________________________ > > Do you Yahoo!? > > U2 on LAUNCH - Exclusive greatest hits videos > > http://launch.yahoo.com/u2 > > > > > > > ------------------------------------------------------- > > This sf.net email is sponsored by:ThinkGeek > > Welcome to geek heaven. > > http://thinkgeek.com/sf > > _______________________________________________ > > Htmlparser-user mailing list > > Htm...@li... > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > > ------------------------------------------------------- > This sf.net email is sponsored by:ThinkGeek > Welcome to geek heaven. > http://thinkgeek.com/sf > _______________________________________________ > Htmlparser-user mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlparser-user __________________________________________________ Do you Yahoo!? U2 on LAUNCH - Exclusive greatest hits videos http://launch.yahoo.com/u2 |
From: Navid H.L. <na...@ya...> - 2002-11-11 20:16:54
|
Hi Somik, It seems I still need more hlep. I fixed the classpath, I am working on XP, so I set up an environment varible as classpath=c:\thefolder contains the htmlparser.jar Then I tried to compile the HTMLParser.java, but it is giving error, saying: package com.kizna.html.scanners does not exist import com.kizna.html.scanners.* and there are 24 more error. Am I completely on wrong track? If htmlparser.jar be perfectly in class path, then all provided classes will work in a new program? Do I have to compile HTMLParser.java at all? I really appreciate you help. Thanks in advance. Navid --- Somik Raha <so...@ya...> wrote: > Hi Navid, > The text doc is outdated (it will be updated for > the next release). You > can refer to the javadoc - it is uptodate, and is > intensive. > Check the javadoc of HTMLParser.java to get > started. > > You should keep htmlparser.jar in your classpath > (this is in the lib > directory), and import com.kizna.html.* in your > program. > Feel free to put your questions here. > > Regards, > Somik > ----- Original Message ----- > From: "Navid H.Langaroudi" <na...@ya...> > To: <htm...@li...> > Sent: Saturday, November 09, 2002 11:09 AM > Subject: [Htmlparser-user] How to use it. > > > > Hi I am very new here. How I can use htmlparser? > How I > > should set up the library so the java import > works? > > I can not even compile and run the sample > programs. I > > have jdk on my computer. Can some one give me > basic > > instructions to do these. > > > > I greatly appreciat your help. > > > > Nav > > > > __________________________________________________ > > Do you Yahoo!? > > U2 on LAUNCH - Exclusive greatest hits videos > > http://launch.yahoo.com/u2 > > > > > > > ------------------------------------------------------- > > This sf.net email is sponsored by:ThinkGeek > > Welcome to geek heaven. > > http://thinkgeek.com/sf > > _______________________________________________ > > Htmlparser-user mailing list > > Htm...@li... > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > > ------------------------------------------------------- > This sf.net email is sponsored by:ThinkGeek > Welcome to geek heaven. > http://thinkgeek.com/sf > _______________________________________________ > Htmlparser-user mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlparser-user __________________________________________________ Do you Yahoo!? U2 on LAUNCH - Exclusive greatest hits videos http://launch.yahoo.com/u2 |
From: Somik R. <so...@ya...> - 2002-11-12 05:02:44
|
Hi Navid, Looks like you're having basic java problems here. Whichever class you use, you should have corresponding packages imported. If you're using an intelligent IDE like Eclipse(http://www.eclipse.org), this is usually auto-detected (when you choose Arrange Imports). One way of cross-checking is - open up the jar file to see that the class or classes you're trying to import exists. In any case, post the program that you're trying to compile, with the entire errors that are thrown, and we might be able to help. Regards, Somik |
From: Navid H.L. <na...@ya...> - 2002-11-14 22:40:53
|
Thank you Somik, I could run some program to use HTMLparser classes. But my question is , Does HTMLparser works as a standalone program? Navid --- Somik Raha <so...@ya...> wrote: > Hi Navid, > Looks like you're having basic java problems > here. Whichever class you > use, you should have corresponding packages > imported. If you're using an > intelligent IDE like > Eclipse(http://www.eclipse.org), this is usually > auto-detected (when you choose Arrange Imports). > One way of cross-checking is - open up the jar > file to see that the > class or classes you're trying to import exists. > In any case, post the program that you're trying > to compile, with the > entire errors that are thrown, and we might be able > to help. > > Regards, > Somik > > > > ------------------------------------------------------- > This sf.net email is sponsored by:ThinkGeek > Welcome to geek heaven. > http://thinkgeek.com/sf > _______________________________________________ > Htmlparser-user mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlparser-user __________________________________________________ Do you Yahoo!? Yahoo! Web Hosting - Let the expert host your site http://webhosting.yahoo.com |
From: Somik R. <so...@ya...> - 2002-11-16 20:49:46
|
Navid wrote : > Thank you Somik, I could run some program to use > HTMLparser classes. But my question is , Does > HTMLparser works as a standalone program? The standalone is just a demonstration. HTMLParser is a library - you are expected to use it in your application. Check HTMLParser.java for more details. Regards, Somik |
From: Navid H.L. <na...@ya...> - 2002-11-19 19:39:36
|
Hi Somik, I could do some parts of my program. And the result was great. Thanks for all your helps. But yesterday I found a bug, or most probably it is my mistake. I used the sample in HTMLParser.java * </pre> * The above snippet will print out only the text contents in the html document.<br> * Here's another snippet that will only print out the link urls in a document. * This is an example of adding a link scanner. * <pre> * HTMLParser parser = new HTMLParser("http://www.yahoo.com"); * parser.addScanner(new HTMLLinkScanner("-l")); * for (Enumeration e = parser.elements();e.hasMoreElements();) { * HTMLNode node = (HTMLNode)e.nextElement(); * if (node instanceof HTMLLinkTag) { * HTMLLinkTag linkTag = (HTMLLinkTag)node; * System.out.println(linkTag.getLink()); * } * } * </pre> * @see HTMLParser#elements() It extract all urls of a page. But if the page url be something like http://www.mysite.com/books/some.asp and for example page has a link as href="newpage.html" then the extracted url will be: http://www.mysite.com/books/some.asp/newpage.html which is wrong! only if the instead some.asp I have some.html then the url will be correct! http://www.mysite.com/books/newpage.html Would you please give me some guide line. Thank you very much. Nav --- Somik Raha <so...@ya...> wrote: > Navid wrote : > > Thank you Somik, I could run some program to use > > HTMLparser classes. But my question is , Does > > HTMLparser works as a standalone program? > > The standalone is just a demonstration. HTMLParser > is a library - you are > expected to use it in your application. Check > HTMLParser.java for more > details. > > Regards, > Somik > > > > ------------------------------------------------------- > This sf.net email is sponsored by: To learn the > basics of securing > your web site with SSL, click here to get a FREE > TRIAL of a Thawte > Server Certificate: > http://www.gothawte.com/rd524.html > _______________________________________________ > Htmlparser-user mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlparser-user __________________________________________________ Do you Yahoo!? Yahoo! Web Hosting - Let the expert host your site http://webhosting.yahoo.com |
From: Somik R. <so...@ya...> - 2002-11-19 21:54:00
|
Hi Navid, That does sound like a bug. I will look into it tonight. Meanwhile, could you enter this bug report in the htmlparser website ? (http://htmlparser.sourceforge.net). Regards, Somik --- "Navid H.Langaroudi" <na...@ya...> wrote: > Hi Somik, > I could do some parts of my program. And the result > was great. Thanks for all your helps. But yesterday > I > found a bug, or most probably it is my mistake. > I used the sample in HTMLParser.java > > * </pre> > * The above snippet will print out only the text > contents in the html document.<br> > * Here's another snippet that will only print out > the > link urls in a document. > * This is an example of adding a link scanner. > * <pre> > * HTMLParser parser = new > HTMLParser("http://www.yahoo.com"); > * parser.addScanner(new HTMLLinkScanner("-l")); > * for (Enumeration e = > parser.elements();e.hasMoreElements();) { > * HTMLNode node = (HTMLNode)e.nextElement(); > * if (node instanceof HTMLLinkTag) { > * HTMLLinkTag linkTag = (HTMLLinkTag)node; > * System.out.println(linkTag.getLink()); > * } > * } > * </pre> > * @see HTMLParser#elements() > > It extract all urls of a page. But if the page url > be > something like http://www.mysite.com/books/some.asp > and for example page has a link as > href="newpage.html" > > then the extracted url will be: > http://www.mysite.com/books/some.asp/newpage.html > which is wrong! > only if the instead some.asp I have some.html then > the > url will be correct! > http://www.mysite.com/books/newpage.html > > Would you please give me some guide line. > Thank you very much. > > Nav > > --- Somik Raha <so...@ya...> wrote: > > Navid wrote : > > > Thank you Somik, I could run some program to use > > > HTMLparser classes. But my question is , Does > > > HTMLparser works as a standalone program? > > > > The standalone is just a demonstration. HTMLParser > > is a library - you are > > expected to use it in your application. Check > > HTMLParser.java for more > > details. > > > > Regards, > > Somik > > > > > > > > > ------------------------------------------------------- > > This sf.net email is sponsored by: To learn the > > basics of securing > > your web site with SSL, click here to get a FREE > > TRIAL of a Thawte > > Server Certificate: > > http://www.gothawte.com/rd524.html > > _______________________________________________ > > Htmlparser-user mailing list > > Htm...@li... > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > __________________________________________________ > Do you Yahoo!? > Yahoo! Web Hosting - Let the expert host your site > http://webhosting.yahoo.com > > > ------------------------------------------------------- > This sf.net email is sponsored by: To learn the > basics of securing > your web site with SSL, click here to get a FREE > TRIAL of a Thawte > Server Certificate: > http://www.gothawte.com/rd524.html > _______________________________________________ > Htmlparser-user mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlparser-user __________________________________________________ Do you Yahoo!? Yahoo! Web Hosting - Let the expert host your site http://webhosting.yahoo.com |
From: Somik R. <so...@ya...> - 2002-11-20 06:06:05
|
Hi Navid, Thanks for this bug report! I was able to reproduce it with testcases. Its fixed now. Folks : The next version will be having this bug fix. Also, I am refactoring all the package names from com.kizna.* to org.htmlparser.*. This would cause you to change your package names, and I must apologize for the inconvenience, but this would be the first and last change. The next release is coming with nicer docs.. Regards, Somik ----- Original Message ----- From: "Somik Raha" <so...@ya...> To: <htm...@li...> Sent: Tuesday, November 19, 2002 1:54 PM Subject: Re: [Htmlparser-user] How to use it. > Hi Navid, > That does sound like a bug. I will look into it > tonight. Meanwhile, could you enter this bug report in > the htmlparser website ? > (http://htmlparser.sourceforge.net). > > Regards, > Somik > --- "Navid H.Langaroudi" <na...@ya...> wrote: > > Hi Somik, > > I could do some parts of my program. And the result > > was great. Thanks for all your helps. But yesterday > > I > > found a bug, or most probably it is my mistake. > > I used the sample in HTMLParser.java > > > > * </pre> > > * The above snippet will print out only the text > > contents in the html document.<br> > > * Here's another snippet that will only print out > > the > > link urls in a document. > > * This is an example of adding a link scanner. > > * <pre> > > * HTMLParser parser = new > > HTMLParser("http://www.yahoo.com"); > > * parser.addScanner(new HTMLLinkScanner("-l")); > > * for (Enumeration e = > > parser.elements();e.hasMoreElements();) { > > * HTMLNode node = (HTMLNode)e.nextElement(); > > * if (node instanceof HTMLLinkTag) { > > * HTMLLinkTag linkTag = (HTMLLinkTag)node; > > * System.out.println(linkTag.getLink()); > > * } > > * } > > * </pre> > > * @see HTMLParser#elements() > > > > It extract all urls of a page. But if the page url > > be > > something like http://www.mysite.com/books/some.asp > > and for example page has a link as > > href="newpage.html" > > > > then the extracted url will be: > > http://www.mysite.com/books/some.asp/newpage.html > > which is wrong! > > only if the instead some.asp I have some.html then > > the > > url will be correct! > > http://www.mysite.com/books/newpage.html > > > > Would you please give me some guide line. > > Thank you very much. > > > > Nav > > > > --- Somik Raha <so...@ya...> wrote: > > > Navid wrote : > > > > Thank you Somik, I could run some program to use > > > > HTMLparser classes. But my question is , Does > > > > HTMLparser works as a standalone program? > > > > > > The standalone is just a demonstration. HTMLParser > > > is a library - you are > > > expected to use it in your application. Check > > > HTMLParser.java for more > > > details. > > > > > > Regards, > > > Somik > > > > > > > > > > > > > > > ------------------------------------------------------- > > > This sf.net email is sponsored by: To learn the > > > basics of securing > > > your web site with SSL, click here to get a FREE > > > TRIAL of a Thawte > > > Server Certificate: > > > http://www.gothawte.com/rd524.html > > > _______________________________________________ > > > Htmlparser-user mailing list > > > Htm...@li... > > > > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > > > > __________________________________________________ > > Do you Yahoo!? > > Yahoo! Web Hosting - Let the expert host your site > > http://webhosting.yahoo.com > > > > > > > ------------------------------------------------------- > > This sf.net email is sponsored by: To learn the > > basics of securing > > your web site with SSL, click here to get a FREE > > TRIAL of a Thawte > > Server Certificate: > > http://www.gothawte.com/rd524.html > > _______________________________________________ > > Htmlparser-user mailing list > > Htm...@li... > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > __________________________________________________ > Do you Yahoo!? > Yahoo! Web Hosting - Let the expert host your site > http://webhosting.yahoo.com > > > ------------------------------------------------------- > This sf.net email is sponsored by: To learn the basics of securing > your web site with SSL, click here to get a FREE TRIAL of a Thawte > Server Certificate: http://www.gothawte.com/rd524.html > _______________________________________________ > Htmlparser-user mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlparser-user |
From: Navid H.L. <na...@ya...> - 2002-11-20 06:25:36
|
Hi Somik, It is good the bug is fixed. But when and where I get the next version. I really need it. Thanks Navid --- Somik Raha <so...@ya...> wrote: > Hi Navid, > Thanks for this bug report! I was able to > reproduce it with testcases. > Its fixed now. > Folks : The next version will be having this bug > fix. Also, I am > refactoring all the package names from com.kizna.* > to org.htmlparser.*. > > This would cause you to change your package > names, and I must apologize > for the inconvenience, but this would be the first > and last change. The next > release is coming with nicer docs.. > > Regards, > Somik > ----- Original Message ----- > From: "Somik Raha" <so...@ya...> > To: <htm...@li...> > Sent: Tuesday, November 19, 2002 1:54 PM > Subject: Re: [Htmlparser-user] How to use it. > > > > Hi Navid, > > That does sound like a bug. I will look into it > > tonight. Meanwhile, could you enter this bug > report in > > the htmlparser website ? > > (http://htmlparser.sourceforge.net). > > > > Regards, > > Somik > > --- "Navid H.Langaroudi" <na...@ya...> > wrote: > > > Hi Somik, > > > I could do some parts of my program. And the > result > > > was great. Thanks for all your helps. But > yesterday > > > I > > > found a bug, or most probably it is my mistake. > > > I used the sample in HTMLParser.java > > > > > > * </pre> > > > * The above snippet will print out only the > text > > > contents in the html document.<br> > > > * Here's another snippet that will only print > out > > > the > > > link urls in a document. > > > * This is an example of adding a link scanner. > > > * <pre> > > > * HTMLParser parser = new > > > HTMLParser("http://www.yahoo.com"); > > > * parser.addScanner(new HTMLLinkScanner("-l")); > > > * for (Enumeration e = > > > parser.elements();e.hasMoreElements();) { > > > * HTMLNode node = (HTMLNode)e.nextElement(); > > > * if (node instanceof HTMLLinkTag) { > > > * HTMLLinkTag linkTag = > (HTMLLinkTag)node; > > > * System.out.println(linkTag.getLink()); > > > * } > > > * } > > > * </pre> > > > * @see HTMLParser#elements() > > > > > > It extract all urls of a page. But if the page > url > > > be > > > something like > http://www.mysite.com/books/some.asp > > > and for example page has a link as > > > href="newpage.html" > > > > > > then the extracted url will be: > > > > http://www.mysite.com/books/some.asp/newpage.html > > > which is wrong! > > > only if the instead some.asp I have some.html > then > > > the > > > url will be correct! > > > http://www.mysite.com/books/newpage.html > > > > > > Would you please give me some guide line. > > > Thank you very much. > > > > > > Nav > > > > > > --- Somik Raha <so...@ya...> wrote: > > > > Navid wrote : > > > > > Thank you Somik, I could run some program to > use > > > > > HTMLparser classes. But my question is , > Does > > > > > HTMLparser works as a standalone program? > > > > > > > > The standalone is just a demonstration. > HTMLParser > > > > is a library - you are > > > > expected to use it in your application. Check > > > > HTMLParser.java for more > > > > details. > > > > > > > > Regards, > > > > Somik > > > > > > > > > > > > > > > > > > > > > > ------------------------------------------------------- > > > > This sf.net email is sponsored by: To learn > the > > > > basics of securing > > > > your web site with SSL, click here to get a > FREE > > > > TRIAL of a Thawte > > > > Server Certificate: > > > > http://www.gothawte.com/rd524.html > > > > > _______________________________________________ > > > > Htmlparser-user mailing list > > > > Htm...@li... > > > > > > > > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > > > > > > > > __________________________________________________ > > > Do you Yahoo!? > > > Yahoo! Web Hosting - Let the expert host your > site > > > http://webhosting.yahoo.com > > > > > > > > > > > > ------------------------------------------------------- > > > This sf.net email is sponsored by: To learn the > > > basics of securing > > > your web site with SSL, click here to get a FREE > > > TRIAL of a Thawte > > > Server Certificate: > > > http://www.gothawte.com/rd524.html > > > _______________________________________________ > > > Htmlparser-user mailing list > > > Htm...@li... > > > > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > > > > __________________________________________________ > > Do you Yahoo!? > > Yahoo! Web Hosting - Let the expert host your site > > http://webhosting.yahoo.com > > > > > > > ------------------------------------------------------- > > This sf.net email is sponsored by: To learn the > basics of securing > > your web site with SSL, click here to get a FREE > TRIAL of a Thawte > > Server Certificate: > http://www.gothawte.com/rd524.html > > _______________________________________________ > > Htmlparser-user mailing list > > Htm...@li... > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > > ------------------------------------------------------- > This sf.net email is sponsored by: To learn the > basics of securing > your web site with SSL, click here to get a FREE > TRIAL of a Thawte > Server Certificate: > http://www.gothawte.com/rd524.html > _______________________________________________ > Htmlparser-user mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlparser-user __________________________________________________ Do you Yahoo!? Yahoo! Web Hosting - Let the expert host your site http://webhosting.yahoo.com |
From: Somik R. <so...@ya...> - 2002-11-20 06:30:02
|
Hi Navid, It will be at least a week before its out, as this one's going to have a lot of docs... However, if you should need the version asap - dont worry - this is an open source project. Simply check out the latest snapshot from CVS, and run the build.xml file (using ant). The latest build will be at your disposal. If this is too complex, let me know.. I can make a build specifically for you, but only if u have cvs problems. Regards, Somik ----- Original Message ----- From: "Navid H.Langaroudi" <na...@ya...> To: <htm...@li...> Sent: Tuesday, November 19, 2002 10:25 PM Subject: Re: [Htmlparser-user] How to use it. > Hi Somik, > It is good the bug is fixed. But when and where I get > the next version. I really need it. > > Thanks > Navid > > --- Somik Raha <so...@ya...> wrote: > > Hi Navid, > > Thanks for this bug report! I was able to > > reproduce it with testcases. > > Its fixed now. > > Folks : The next version will be having this bug > > fix. Also, I am > > refactoring all the package names from com.kizna.* > > to org.htmlparser.*. > > > > This would cause you to change your package > > names, and I must apologize > > for the inconvenience, but this would be the first > > and last change. The next > > release is coming with nicer docs.. > > > > Regards, > > Somik > > ----- Original Message ----- > > From: "Somik Raha" <so...@ya...> > > To: <htm...@li...> > > Sent: Tuesday, November 19, 2002 1:54 PM > > Subject: Re: [Htmlparser-user] How to use it. > > > > > > > Hi Navid, > > > That does sound like a bug. I will look into it > > > tonight. Meanwhile, could you enter this bug > > report in > > > the htmlparser website ? > > > (http://htmlparser.sourceforge.net). > > > > > > Regards, > > > Somik > > > --- "Navid H.Langaroudi" <na...@ya...> > > wrote: > > > > Hi Somik, > > > > I could do some parts of my program. And the > > result > > > > was great. Thanks for all your helps. But > > yesterday > > > > I > > > > found a bug, or most probably it is my mistake. > > > > I used the sample in HTMLParser.java > > > > > > > > * </pre> > > > > * The above snippet will print out only the > > text > > > > contents in the html document.<br> > > > > * Here's another snippet that will only print > > out > > > > the > > > > link urls in a document. > > > > * This is an example of adding a link scanner. > > > > * <pre> > > > > * HTMLParser parser = new > > > > HTMLParser("http://www.yahoo.com"); > > > > * parser.addScanner(new HTMLLinkScanner("-l")); > > > > * for (Enumeration e = > > > > parser.elements();e.hasMoreElements();) { > > > > * HTMLNode node = (HTMLNode)e.nextElement(); > > > > * if (node instanceof HTMLLinkTag) { > > > > * HTMLLinkTag linkTag = > > (HTMLLinkTag)node; > > > > * System.out.println(linkTag.getLink()); > > > > * } > > > > * } > > > > * </pre> > > > > * @see HTMLParser#elements() > > > > > > > > It extract all urls of a page. But if the page > > url > > > > be > > > > something like > > http://www.mysite.com/books/some.asp > > > > and for example page has a link as > > > > href="newpage.html" > > > > > > > > then the extracted url will be: > > > > > > http://www.mysite.com/books/some.asp/newpage.html > > > > which is wrong! > > > > only if the instead some.asp I have some.html > > then > > > > the > > > > url will be correct! > > > > http://www.mysite.com/books/newpage.html > > > > > > > > Would you please give me some guide line. > > > > Thank you very much. > > > > > > > > Nav > > > > > > > > --- Somik Raha <so...@ya...> wrote: > > > > > Navid wrote : > > > > > > Thank you Somik, I could run some program to > > use > > > > > > HTMLparser classes. But my question is , > > Does > > > > > > HTMLparser works as a standalone program? > > > > > > > > > > The standalone is just a demonstration. > > HTMLParser > > > > > is a library - you are > > > > > expected to use it in your application. Check > > > > > HTMLParser.java for more > > > > > details. > > > > > > > > > > Regards, > > > > > Somik > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > ------------------------------------------------------- > > > > > This sf.net email is sponsored by: To learn > > the > > > > > basics of securing > > > > > your web site with SSL, click here to get a > > FREE > > > > > TRIAL of a Thawte > > > > > Server Certificate: > > > > > http://www.gothawte.com/rd524.html > > > > > > > _______________________________________________ > > > > > Htmlparser-user mailing list > > > > > Htm...@li... > > > > > > > > > > > > > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > > > > > > > > > > > > __________________________________________________ > > > > Do you Yahoo!? > > > > Yahoo! Web Hosting - Let the expert host your > > site > > > > http://webhosting.yahoo.com > > > > > > > > > > > > > > > > > > ------------------------------------------------------- > > > > This sf.net email is sponsored by: To learn the > > > > basics of securing > > > > your web site with SSL, click here to get a FREE > > > > TRIAL of a Thawte > > > > Server Certificate: > > > > http://www.gothawte.com/rd524.html > > > > _______________________________________________ > > > > Htmlparser-user mailing list > > > > Htm...@li... > > > > > > > > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > > > > > > > __________________________________________________ > > > Do you Yahoo!? > > > Yahoo! Web Hosting - Let the expert host your site > > > http://webhosting.yahoo.com > > > > > > > > > > > > ------------------------------------------------------- > > > This sf.net email is sponsored by: To learn the > > basics of securing > > > your web site with SSL, click here to get a FREE > > TRIAL of a Thawte > > > Server Certificate: > > http://www.gothawte.com/rd524.html > > > _______________________________________________ > > > Htmlparser-user mailing list > > > Htm...@li... > > > > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > > > > > > > ------------------------------------------------------- > > This sf.net email is sponsored by: To learn the > > basics of securing > > your web site with SSL, click here to get a FREE > > TRIAL of a Thawte > > Server Certificate: > > http://www.gothawte.com/rd524.html > > _______________________________________________ > > Htmlparser-user mailing list > > Htm...@li... > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > __________________________________________________ > Do you Yahoo!? > Yahoo! Web Hosting - Let the expert host your site > http://webhosting.yahoo.com > > > ------------------------------------------------------- > This sf.net email is sponsored by: To learn the basics of securing > your web site with SSL, click here to get a FREE TRIAL of a Thawte > Server Certificate: http://www.gothawte.com/rd524.html > _______________________________________________ > Htmlparser-user mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlparser-user |
From: Navid H.L. <na...@ya...> - 2002-11-20 19:11:48
|
Hi Somik, I really appreciate it if you could make the build for me. I have little knowledge of XML. I am not sure how this build.xml could compile java! Can you give me some lead on internet that I could read about it. Or if there is a book or source. And I haven't work with CVS before, I just read about it on internet. I guess it is a usful system. Any way, it would be great if you could provide me the new build. Thank you very much. Navid --- Somik Raha <so...@ya...> wrote: > Hi Navid, > It will be at least a week before its out, as > this one's going to have a > lot of docs... > However, if you should need the version asap - > dont worry - this is an > open source project. > Simply check out the latest snapshot from CVS, > and run the build.xml > file (using ant). The latest build will be at your > disposal. > If this is too complex, let me know.. I can make > a build specifically > for you, but only if u have cvs problems. > Regards, > Somik > ----- Original Message ----- > From: "Navid H.Langaroudi" <na...@ya...> > To: <htm...@li...> > Sent: Tuesday, November 19, 2002 10:25 PM > Subject: Re: [Htmlparser-user] How to use it. > > > > Hi Somik, > > It is good the bug is fixed. But when and where I > get > > the next version. I really need it. > > > > Thanks > > Navid > > > > --- Somik Raha <so...@ya...> wrote: > > > Hi Navid, > > > Thanks for this bug report! I was able to > > > reproduce it with testcases. > > > Its fixed now. > > > Folks : The next version will be having this > bug > > > fix. Also, I am > > > refactoring all the package names from > com.kizna.* > > > to org.htmlparser.*. > > > > > > This would cause you to change your package > > > names, and I must apologize > > > for the inconvenience, but this would be the > first > > > and last change. The next > > > release is coming with nicer docs.. > > > > > > Regards, > > > Somik > > > ----- Original Message ----- > > > From: "Somik Raha" <so...@ya...> > > > To: <htm...@li...> > > > Sent: Tuesday, November 19, 2002 1:54 PM > > > Subject: Re: [Htmlparser-user] How to use it. > > > > > > > > > > Hi Navid, > > > > That does sound like a bug. I will look into > it > > > > tonight. Meanwhile, could you enter this bug > > > report in > > > > the htmlparser website ? > > > > (http://htmlparser.sourceforge.net). > > > > > > > > Regards, > > > > Somik > > > > --- "Navid H.Langaroudi" <na...@ya...> > > > wrote: > > > > > Hi Somik, > > > > > I could do some parts of my program. And the > > > result > > > > > was great. Thanks for all your helps. But > > > yesterday > > > > > I > > > > > found a bug, or most probably it is my > mistake. > > > > > I used the sample in HTMLParser.java > > > > > > > > > > * </pre> > > > > > * The above snippet will print out only the > > > text > > > > > contents in the html document.<br> > > > > > * Here's another snippet that will only > print > > > out > > > > > the > > > > > link urls in a document. > > > > > * This is an example of adding a link > scanner. > > > > > * <pre> > > > > > * HTMLParser parser = new > > > > > HTMLParser("http://www.yahoo.com"); > > > > > * parser.addScanner(new > HTMLLinkScanner("-l")); > > > > > * for (Enumeration e = > > > > > parser.elements();e.hasMoreElements();) { > > > > > * HTMLNode node = > (HTMLNode)e.nextElement(); > > > > > * if (node instanceof HTMLLinkTag) { > > > > > * HTMLLinkTag linkTag = > > > (HTMLLinkTag)node; > > > > > * > System.out.println(linkTag.getLink()); > > > > > * } > > > > > * } > > > > > * </pre> > > > > > * @see HTMLParser#elements() > > > > > > > > > > It extract all urls of a page. But if the > page > > > url > > > > > be > > > > > something like > > > http://www.mysite.com/books/some.asp > > > > > and for example page has a link as > > > > > href="newpage.html" > > > > > > > > > > then the extracted url will be: > > > > > > > > > http://www.mysite.com/books/some.asp/newpage.html > > > > > which is wrong! > > > > > only if the instead some.asp I have > some.html > > > then > > > > > the > > > > > url will be correct! > > > > > http://www.mysite.com/books/newpage.html > > > > > > > > > > Would you please give me some guide line. > > > > > Thank you very much. > > > > > > > > > > Nav > > > > > > > > > > --- Somik Raha <so...@ya...> wrote: > > > > > > Navid wrote : > > > > > > > Thank you Somik, I could run some > program to > > > use > > > > > > > HTMLparser classes. But my question is , > > > Does > > > > > > > HTMLparser works as a standalone > program? > > > > > > > > > > > > The standalone is just a demonstration. > > > HTMLParser > > > > > > is a library - you are > > > > > > expected to use it in your application. > Check > > > > > > HTMLParser.java for more > > > > > > details. > > > > > > > > > > > > Regards, > > > > > > Somik > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > ------------------------------------------------------- > > > > > > This sf.net email is sponsored by: To > learn > > > the > > > > > > basics of securing > > > > > > your web site with SSL, click here to get > a > > > FREE > > > > > > TRIAL of a Thawte > > > > > > Server Certificate: > > > > > > http://www.gothawte.com/rd524.html > > > > > > > > > _______________________________________________ > > > > > > Htmlparser-user mailing list > > > > > > Htm...@li... > > > > > > > > > > > > > > > > > > > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > > > > > > > > > > > > > > > > > __________________________________________________ > > > > > Do you Yahoo!? > > > > > Yahoo! Web Hosting - Let the expert host > your > > > site > > > > > http://webhosting.yahoo.com > > > > > > > > > > > > > > > > > > > > > > > === message truncated === __________________________________________________ Do you Yahoo!? Yahoo! Web Hosting - Let the expert host your site http://webhosting.yahoo.com |
From: Somik R. <so...@ya...> - 2002-11-21 07:13:10
|
Hi Navid, Boy, Ive had to stay up late making a release for you... :) Here's the url - http://htmlparser.sourceforge.net/htmlparser1_2_20021120.zip This is not an integration release per se- its got the bug fix that you want, but beware, the package names have changed - so change your programs accordingly - com.kizna.html.* is now org.htmlparser.* Let me know how it goes.. Regards, Somik ----- Original Message ----- From: "Navid H.Langaroudi" <na...@ya...> To: <htm...@li...> Sent: Wednesday, November 20, 2002 10:57 AM Subject: Re: [Htmlparser-user] How to use it. > Hi Somik, > I really appreciate it if you could make the build for > me. I have little knowledge of XML. I am not sure how > this build.xml could compile java! Can you give me > some lead on internet that I could read about it. > Or if there is a book or source. > > And I haven't work with CVS before, I just read about > it on internet. I guess it is a usful system. > > Any way, it would be great if you could provide me the > new build. > > Thank you very much. > Navid > > --- Somik Raha <so...@ya...> wrote: > > Hi Navid, > > It will be at least a week before its out, as > > this one's going to have a > > lot of docs... > > However, if you should need the version asap - > > dont worry - this is an > > open source project. > > Simply check out the latest snapshot from CVS, > > and run the build.xml > > file (using ant). The latest build will be at your > > disposal. > > If this is too complex, let me know.. I can make > > a build specifically > > for you, but only if u have cvs problems. > > Regards, > > Somik > > ----- Original Message ----- > > From: "Navid H.Langaroudi" <na...@ya...> > > To: <htm...@li...> > > Sent: Tuesday, November 19, 2002 10:25 PM > > Subject: Re: [Htmlparser-user] How to use it. > > > > > > > Hi Somik, > > > It is good the bug is fixed. But when and where I > > get > > > the next version. I really need it. > > > > > > Thanks > > > Navid > > > > > > --- Somik Raha <so...@ya...> wrote: > > > > Hi Navid, > > > > Thanks for this bug report! I was able to > > > > reproduce it with testcases. > > > > Its fixed now. > > > > Folks : The next version will be having this > > bug > > > > fix. Also, I am > > > > refactoring all the package names from > > com.kizna.* > > > > to org.htmlparser.*. > > > > > > > > This would cause you to change your package > > > > names, and I must apologize > > > > for the inconvenience, but this would be the > > first > > > > and last change. The next > > > > release is coming with nicer docs.. > > > > > > > > Regards, > > > > Somik > > > > ----- Original Message ----- > > > > From: "Somik Raha" <so...@ya...> > > > > To: <htm...@li...> > > > > Sent: Tuesday, November 19, 2002 1:54 PM > > > > Subject: Re: [Htmlparser-user] How to use it. > > > > > > > > > > > > > Hi Navid, > > > > > That does sound like a bug. I will look into > > it > > > > > tonight. Meanwhile, could you enter this bug > > > > report in > > > > > the htmlparser website ? > > > > > (http://htmlparser.sourceforge.net). > > > > > > > > > > Regards, > > > > > Somik > > > > > --- "Navid H.Langaroudi" <na...@ya...> > > > > wrote: > > > > > > Hi Somik, > > > > > > I could do some parts of my program. And the > > > > result > > > > > > was great. Thanks for all your helps. But > > > > yesterday > > > > > > I > > > > > > found a bug, or most probably it is my > > mistake. > > > > > > I used the sample in HTMLParser.java > > > > > > > > > > > > * </pre> > > > > > > * The above snippet will print out only the > > > > text > > > > > > contents in the html document.<br> > > > > > > * Here's another snippet that will only > > print > > > > out > > > > > > the > > > > > > link urls in a document. > > > > > > * This is an example of adding a link > > scanner. > > > > > > * <pre> > > > > > > * HTMLParser parser = new > > > > > > HTMLParser("http://www.yahoo.com"); > > > > > > * parser.addScanner(new > > HTMLLinkScanner("-l")); > > > > > > * for (Enumeration e = > > > > > > parser.elements();e.hasMoreElements();) { > > > > > > * HTMLNode node = > > (HTMLNode)e.nextElement(); > > > > > > * if (node instanceof HTMLLinkTag) { > > > > > > * HTMLLinkTag linkTag = > > > > (HTMLLinkTag)node; > > > > > > * > > System.out.println(linkTag.getLink()); > > > > > > * } > > > > > > * } > > > > > > * </pre> > > > > > > * @see HTMLParser#elements() > > > > > > > > > > > > It extract all urls of a page. But if the > > page > > > > url > > > > > > be > > > > > > something like > > > > http://www.mysite.com/books/some.asp > > > > > > and for example page has a link as > > > > > > href="newpage.html" > > > > > > > > > > > > then the extracted url will be: > > > > > > > > > > > > http://www.mysite.com/books/some.asp/newpage.html > > > > > > which is wrong! > > > > > > only if the instead some.asp I have > > some.html > > > > then > > > > > > the > > > > > > url will be correct! > > > > > > http://www.mysite.com/books/newpage.html > > > > > > > > > > > > Would you please give me some guide line. > > > > > > Thank you very much. > > > > > > > > > > > > Nav > > > > > > > > > > > > --- Somik Raha <so...@ya...> wrote: > > > > > > > Navid wrote : > > > > > > > > Thank you Somik, I could run some > > program to > > > > use > > > > > > > > HTMLparser classes. But my question is , > > > > Does > > > > > > > > HTMLparser works as a standalone > > program? > > > > > > > > > > > > > > The standalone is just a demonstration. > > > > HTMLParser > > > > > > > is a library - you are > > > > > > > expected to use it in your application. > > Check > > > > > > > HTMLParser.java for more > > > > > > > details. > > > > > > > > > > > > > > Regards, > > > > > > > Somik > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > ------------------------------------------------------- > > > > > > > This sf.net email is sponsored by: To > > learn > > > > the > > > > > > > basics of securing > > > > > > > your web site with SSL, click here to get > > a > > > > FREE > > > > > > > TRIAL of a Thawte > > > > > > > Server Certificate: > > > > > > > http://www.gothawte.com/rd524.html > > > > > > > > > > > _______________________________________________ > > > > > > > Htmlparser-user mailing list > > > > > > > Htm...@li... > > > > > > > > > > > > > > > > > > > > > > > > > > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > > > > > > > > > > > > > > > > > > > > > > __________________________________________________ > > > > > > Do you Yahoo!? > > > > > > Yahoo! Web Hosting - Let the expert host > > your > > > > site > > > > > > http://webhosting.yahoo.com > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > === message truncated === > > > __________________________________________________ > Do you Yahoo!? > Yahoo! Web Hosting - Let the expert host your site > http://webhosting.yahoo.com > > > ------------------------------------------------------- > This sf.net email is sponsored by: > Battle your brains against the best in the Thawte Crypto > Challenge. Be the first to crack the code - register now: > http://www.gothawte.com/rd521.html > _______________________________________________ > Htmlparser-user mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlparser-user |
From: Navid H.L. <na...@ya...> - 2002-11-21 20:25:16
|
Hi Somik, Thanks a million times! I replaced the htmlparser.jar file and I made the changes in package names. Now my code runs, but I have a small error. in a line: for (Enumeration f = parserTitle.elements();f.hasMoreElements();) { it gave me an error for incompatible type. says, HTMLParserEnumeration type found, but Enumeration type required. So I just cast it. It disapeared, and the it says an HTMLParserException is not handled! Is there a solution for this? If my explanation is not clear let me know. And about the other problem, yes, it is fixed, I should have use the register(). Thank you very much, Navid --- Somik Raha <so...@ya...> wrote: > Hi Navid, > Boy, Ive had to stay up late making a release > for you... :) > Here's the url - > http://htmlparser.sourceforge.net/htmlparser1_2_20021120.zip > This is not an integration release per se- its > got the bug fix that you > want, but beware, the package names have changed - > so change your programs > accordingly - com.kizna.html.* is now > org.htmlparser.* > > Let me know how it goes.. > > Regards, > Somik > > ----- Original Message ----- > From: "Navid H.Langaroudi" <na...@ya...> > To: <htm...@li...> > Sent: Wednesday, November 20, 2002 10:57 AM > Subject: Re: [Htmlparser-user] How to use it. > > > > Hi Somik, > > I really appreciate it if you could make the build > for > > me. I have little knowledge of XML. I am not sure > how > > this build.xml could compile java! Can you give me > > some lead on internet that I could read about it. > > Or if there is a book or source. > > > > And I haven't work with CVS before, I just read > about > > it on internet. I guess it is a usful system. > > > > Any way, it would be great if you could provide me > the > > new build. > > > > Thank you very much. > > Navid > > > > --- Somik Raha <so...@ya...> wrote: > > > Hi Navid, > > > It will be at least a week before its out, > as > > > this one's going to have a > > > lot of docs... > > > However, if you should need the version asap > - > > > dont worry - this is an > > > open source project. > > > Simply check out the latest snapshot from > CVS, > > > and run the build.xml > > > file (using ant). The latest build will be at > your > > > disposal. > > > If this is too complex, let me know.. I can > make > > > a build specifically > > > for you, but only if u have cvs problems. > > > Regards, > > > Somik > > > ----- Original Message ----- > > > From: "Navid H.Langaroudi" <na...@ya...> > > > To: <htm...@li...> > > > Sent: Tuesday, November 19, 2002 10:25 PM > > > Subject: Re: [Htmlparser-user] How to use it. > > > > > > > > > > Hi Somik, > > > > It is good the bug is fixed. But when and > where I > > > get > > > > the next version. I really need it. > > > > > > > > Thanks > > > > Navid > > > > > > > > --- Somik Raha <so...@ya...> wrote: > > > > > Hi Navid, > > > > > Thanks for this bug report! I was able > to > > > > > reproduce it with testcases. > > > > > Its fixed now. > > > > > Folks : The next version will be having > this > > > bug > > > > > fix. Also, I am > > > > > refactoring all the package names from > > > com.kizna.* > > > > > to org.htmlparser.*. > > > > > > > > > > This would cause you to change your > package > > > > > names, and I must apologize > > > > > for the inconvenience, but this would be the > > > first > > > > > and last change. The next > > > > > release is coming with nicer docs.. > > > > > > > > > > Regards, > > > > > Somik > > > > > ----- Original Message ----- > > > > > From: "Somik Raha" <so...@ya...> > > > > > To: <htm...@li...> > > > > > Sent: Tuesday, November 19, 2002 1:54 PM > > > > > Subject: Re: [Htmlparser-user] How to use > it. > > > > > > > > > > > > > > > > Hi Navid, > > > > > > That does sound like a bug. I will look > into > > > it > > > > > > tonight. Meanwhile, could you enter this > bug > > > > > report in > > > > > > the htmlparser website ? > > > > > > (http://htmlparser.sourceforge.net). > > > > > > > > > > > > Regards, > > > > > > Somik > > > > > > --- "Navid H.Langaroudi" > <na...@ya...> > > > > > wrote: > > > > > > > Hi Somik, > > > > > > > I could do some parts of my program. And > the > > > > > result > > > > > > > was great. Thanks for all your helps. > But > > > > > yesterday > > > > > > > I > > > > > > > found a bug, or most probably it is my > > > mistake. > > > > > > > I used the sample in HTMLParser.java > > > > > > > > > > > > > > * </pre> > > > > > > > * The above snippet will print out only > the > > > > > text > > > > > > > contents in the html document.<br> > > > > > > > * Here's another snippet that will only > > > print > > > > > out > > > > > > > the > > > > > > > link urls in a document. > > > > > > > * This is an example of adding a link > > > scanner. > > > > > > > * <pre> > > > > > > > * HTMLParser parser = new > > > > > > > HTMLParser("http://www.yahoo.com"); > > > > > > > * parser.addScanner(new > > > HTMLLinkScanner("-l")); > > > > > > > * for (Enumeration e = > > > > > > > parser.elements();e.hasMoreElements();) > { > > > > > > > * HTMLNode node = > > > (HTMLNode)e.nextElement(); > > > > > > > * if (node instanceof HTMLLinkTag) { > > > > > > > * HTMLLinkTag linkTag = > > > > > (HTMLLinkTag)node; > > > > > > > * > > > System.out.println(linkTag.getLink()); > > > > > > > * } > > > > > > > * } > > > > > > > * </pre> > > > > > > > * @see HTMLParser#elements() > > > > > > > > > > > > > > It extract all urls of a page. But if > the > > > page > > > > > url > > > > > > > be > > > > > > > something like > > > > > http://www.mysite.com/books/some.asp > > > > > > > and for example page has a link as > > > > > > > href="newpage.html" > > > > > > > > > > > > > > then the extracted url will be: > > > > > > > > > > > > > > > > http://www.mysite.com/books/some.asp/newpage.html > > > > > > > which is wrong! > > > > > > > only if the instead some.asp I have > > > some.html > > > > > then > > > > > > > the > > > > > > > url will be correct! > > > > > > > http://www.mysite.com/books/newpage.html > > > > > > > > > > > > > > Would you please give me some guide > line. > > > > > > > Thank you very much. > > > > > > > > > > > > > > Nav > === message truncated === __________________________________________________ Do you Yahoo!? Yahoo! Mail Plus Powerful. Affordable. Sign up now. http://mailplus.yahoo.com |
From: Somik R. <so...@ya...> - 2002-11-21 21:23:39
|
Hi Navid, As always, you really need to look at the docs in HTMLParser.java. The loop should be : for (HTMLEnumeration e = parser.elements();e.hasMoreNodes();) { HTMLNode node = e.nextHTMLNode(); //... } Regards, Somik --- "Navid H.Langaroudi" <na...@ya...> wrote: > Hi Somik, > Thanks a million times! > I replaced the htmlparser.jar file and I made the > changes in package names. > Now my code runs, but I have a small error. > in a line: > for (Enumeration f = > parserTitle.elements();f.hasMoreElements();) { > > it gave me an error for incompatible type. > says, HTMLParserEnumeration type found, but > Enumeration type required. So I just cast it. It > disapeared, and the it says an HTMLParserException > is > not handled! > Is there a solution for this? > If my explanation is not clear let me know. > > And about the other problem, yes, it is fixed, I > should have use the register(). > > > Thank you very much, > Navid > > --- Somik Raha <so...@ya...> wrote: > > Hi Navid, > > Boy, Ive had to stay up late making a release > > for you... :) > > Here's the url - > > > http://htmlparser.sourceforge.net/htmlparser1_2_20021120.zip > > This is not an integration release per se- its > > got the bug fix that you > > want, but beware, the package names have changed - > > so change your programs > > accordingly - com.kizna.html.* is now > > org.htmlparser.* > > > > Let me know how it goes.. > > > > Regards, > > Somik > > > > ----- Original Message ----- > > From: "Navid H.Langaroudi" <na...@ya...> > > To: <htm...@li...> > > Sent: Wednesday, November 20, 2002 10:57 AM > > Subject: Re: [Htmlparser-user] How to use it. > > > > > > > Hi Somik, > > > I really appreciate it if you could make the > build > > for > > > me. I have little knowledge of XML. I am not > sure > > how > > > this build.xml could compile java! Can you give > me > > > some lead on internet that I could read about > it. > > > Or if there is a book or source. > > > > > > And I haven't work with CVS before, I just read > > about > > > it on internet. I guess it is a usful system. > > > > > > Any way, it would be great if you could provide > me > > the > > > new build. > > > > > > Thank you very much. > > > Navid > > > > > > --- Somik Raha <so...@ya...> wrote: > > > > Hi Navid, > > > > It will be at least a week before its out, > > as > > > > this one's going to have a > > > > lot of docs... > > > > However, if you should need the version > asap > > - > > > > dont worry - this is an > > > > open source project. > > > > Simply check out the latest snapshot from > > CVS, > > > > and run the build.xml > > > > file (using ant). The latest build will be at > > your > > > > disposal. > > > > If this is too complex, let me know.. I > can > > make > > > > a build specifically > > > > for you, but only if u have cvs problems. > > > > Regards, > > > > Somik > > > > ----- Original Message ----- > > > > From: "Navid H.Langaroudi" <na...@ya...> > > > > To: <htm...@li...> > > > > Sent: Tuesday, November 19, 2002 10:25 PM > > > > Subject: Re: [Htmlparser-user] How to use it. > > > > > > > > > > > > > Hi Somik, > > > > > It is good the bug is fixed. But when and > > where I > > > > get > > > > > the next version. I really need it. > > > > > > > > > > Thanks > > > > > Navid > > > > > > > > > > --- Somik Raha <so...@ya...> wrote: > > > > > > Hi Navid, > > > > > > Thanks for this bug report! I was able > > to > > > > > > reproduce it with testcases. > > > > > > Its fixed now. > > > > > > Folks : The next version will be > having > > this > > > > bug > > > > > > fix. Also, I am > > > > > > refactoring all the package names from > > > > com.kizna.* > > > > > > to org.htmlparser.*. > > > > > > > > > > > > This would cause you to change your > > package > > > > > > names, and I must apologize > > > > > > for the inconvenience, but this would be > the > > > > first > > > > > > and last change. The next > > > > > > release is coming with nicer docs.. > > > > > > > > > > > > Regards, > > > > > > Somik > > > > > > ----- Original Message ----- > > > > > > From: "Somik Raha" <so...@ya...> > > > > > > To: > <htm...@li...> > > > > > > Sent: Tuesday, November 19, 2002 1:54 PM > > > > > > Subject: Re: [Htmlparser-user] How to use > > it. > > > > > > > > > > > > > > > > > > > Hi Navid, > > > > > > > That does sound like a bug. I will > look > > into > > > > it > > > > > > > tonight. Meanwhile, could you enter this > > bug > > > > > > report in > > > > > > > the htmlparser website ? > > > > > > > (http://htmlparser.sourceforge.net). > > > > > > > > > > > > > > Regards, > > > > > > > Somik > > > > > > > --- "Navid H.Langaroudi" > > <na...@ya...> > > > > > > wrote: > > > > > > > > Hi Somik, > > > > > > > > I could do some parts of my program. > And > > the > > > > > > result > > > > > > > > was great. Thanks for all your helps. > > But > > > > > > yesterday > > > > > > > > I > > > > > > > > found a bug, or most probably it is my > > > > mistake. > > > > > > > > I used the sample in HTMLParser.java > > > > > > > > > > > > > > > > * </pre> > > > > > > > > * The above snippet will print out > only > > the > > > > > > text > > > > > > > > contents in the html document.<br> > > > > > > > > * Here's another snippet that will > only > > > > print > > > > > > out > > > > > > > > the > > > > > > > > link urls in a document. > > > > > > > > * This is an example of adding a link > > > > scanner. > > > > > > > > * <pre> > > > > > > > > * HTMLParser parser = new > > > > > > > > HTMLParser("http://www.yahoo.com"); > > > > > > > > * parser.addScanner(new > > > > HTMLLinkScanner("-l")); > > > > > > > > * for (Enumeration e = > > > > > > > > > parser.elements();e.hasMoreElements();) > === message truncated === __________________________________________________ Do you Yahoo!? Yahoo! Mail Plus Powerful. Affordable. Sign up now. http://mailplus.yahoo.com |
From: Navid H.L. <na...@ya...> - 2002-11-22 02:07:55
|
Thanks Somik, things are working. Lets see how I progress! Thanks Navid --- Somik Raha <so...@ya...> wrote: > Hi Navid, > As always, you really need to look at the docs in > HTMLParser.java. > The loop should be : > for (HTMLEnumeration e = > parser.elements();e.hasMoreNodes();) { > HTMLNode node = e.nextHTMLNode(); > //... > } > > Regards, > Somik > --- "Navid H.Langaroudi" <na...@ya...> wrote: > > Hi Somik, > > Thanks a million times! > > I replaced the htmlparser.jar file and I made the > > changes in package names. > > Now my code runs, but I have a small error. > > in a line: > > for (Enumeration f = > > parserTitle.elements();f.hasMoreElements();) { > > > > it gave me an error for incompatible type. > > says, HTMLParserEnumeration type found, but > > Enumeration type required. So I just cast it. It > > disapeared, and the it says an HTMLParserException > > is > > not handled! > > Is there a solution for this? > > If my explanation is not clear let me know. > > > > And about the other problem, yes, it is fixed, I > > should have use the register(). > > > > > > Thank you very much, > > Navid > > > > --- Somik Raha <so...@ya...> wrote: > > > Hi Navid, > > > Boy, Ive had to stay up late making a > release > > > for you... :) > > > Here's the url - > > > > > > http://htmlparser.sourceforge.net/htmlparser1_2_20021120.zip > > > This is not an integration release per se- > its > > > got the bug fix that you > > > want, but beware, the package names have changed > - > > > so change your programs > > > accordingly - com.kizna.html.* is now > > > org.htmlparser.* > > > > > > Let me know how it goes.. > > > > > > Regards, > > > Somik > > > > > > ----- Original Message ----- > > > From: "Navid H.Langaroudi" <na...@ya...> > > > To: <htm...@li...> > > > Sent: Wednesday, November 20, 2002 10:57 AM > > > Subject: Re: [Htmlparser-user] How to use it. > > > > > > > > > > Hi Somik, > > > > I really appreciate it if you could make the > > build > > > for > > > > me. I have little knowledge of XML. I am not > > sure > > > how > > > > this build.xml could compile java! Can you > give > > me > > > > some lead on internet that I could read about > > it. > > > > Or if there is a book or source. > > > > > > > > And I haven't work with CVS before, I just > read > > > about > > > > it on internet. I guess it is a usful system. > > > > > > > > Any way, it would be great if you could > provide > > me > > > the > > > > new build. > > > > > > > > Thank you very much. > > > > Navid > > > > > > > > --- Somik Raha <so...@ya...> wrote: > > > > > Hi Navid, > > > > > It will be at least a week before its > out, > > > as > > > > > this one's going to have a > > > > > lot of docs... > > > > > However, if you should need the version > > asap > > > - > > > > > dont worry - this is an > > > > > open source project. > > > > > Simply check out the latest snapshot > from > > > CVS, > > > > > and run the build.xml > > > > > file (using ant). The latest build will be > at > > > your > > > > > disposal. > > > > > If this is too complex, let me know.. I > > can > > > make > > > > > a build specifically > > > > > for you, but only if u have cvs problems. > > > > > Regards, > > > > > Somik > > > > > ----- Original Message ----- > > > > > From: "Navid H.Langaroudi" > <na...@ya...> > > > > > To: <htm...@li...> > > > > > Sent: Tuesday, November 19, 2002 10:25 PM > > > > > Subject: Re: [Htmlparser-user] How to use > it. > > > > > > > > > > > > > > > > Hi Somik, > > > > > > It is good the bug is fixed. But when and > > > where I > > > > > get > > > > > > the next version. I really need it. > > > > > > > > > > > > Thanks > > > > > > Navid > > > > > > > > > > > > --- Somik Raha <so...@ya...> wrote: > > > > > > > Hi Navid, > > > > > > > Thanks for this bug report! I was > able > > > to > > > > > > > reproduce it with testcases. > > > > > > > Its fixed now. > > > > > > > Folks : The next version will be > > having > > > this > > > > > bug > > > > > > > fix. Also, I am > > > > > > > refactoring all the package names from > > > > > com.kizna.* > > > > > > > to org.htmlparser.*. > > > > > > > > > > > > > > This would cause you to change your > > > package > > > > > > > names, and I must apologize > > > > > > > for the inconvenience, but this would be > > the > > > > > first > > > > > > > and last change. The next > > > > > > > release is coming with nicer docs.. > > > > > > > > > > > > > > Regards, > > > > > > > Somik > > > > > > > ----- Original Message ----- > > > > > > > From: "Somik Raha" <so...@ya...> > > > > > > > To: > > <htm...@li...> > > > > > > > Sent: Tuesday, November 19, 2002 1:54 PM > > > > > > > Subject: Re: [Htmlparser-user] How to > use > > > it. > > > > > > > > > > > > > > > > > > > > > > Hi Navid, > > > > > > > > That does sound like a bug. I will > > look > > > into > > > > > it > > > > > > > > tonight. Meanwhile, could you enter > this > > > bug > > > > > > > report in > > > > > > > > the htmlparser website ? > > > > > > > > (http://htmlparser.sourceforge.net). > > > > > > > > > > > > > > > > Regards, > > > > > > > > Somik > > > > > > > > --- "Navid H.Langaroudi" > > > <na...@ya...> > > > > > > > wrote: > > > > > > > > > Hi Somik, > > > > > > > > > I could do some parts of my program. > > And > > > the > > > > > > > result > > > > > > > > > was great. Thanks for all your > helps. > === message truncated === __________________________________________________ Do you Yahoo!? Yahoo! Mail Plus Powerful. Affordable. Sign up now. http://mailplus.yahoo.com |
From: Navid H.L. <na...@ya...> - 2002-11-23 04:31:50
|
Hi Somik, How you doing? I still need your help. I am getting some exception errors in my program, it happens when it tried to open a url to a non existing page. I tried some try/catch, but still can't catch this one. It says Error: HTMLParser.oprnURLConnection(): Error in opening a URL connection to http://www.somenoneexistingurl.com ....... How can I skip this, my program reads all urls of a site and tries to go to next page, and if the url does not exist or is wrong, it terminats, I think I should control this in order to let the program carry on on correct links. Thanks for your help. Have a nice weekend. By the way, Are you living in San Francisco? I am in LosAngeles. Navid --- Somik Raha <so...@ya...> wrote: > Hi Navid, > As always, you really need to look at the docs in > HTMLParser.java. > The loop should be : > for (HTMLEnumeration e = > parser.elements();e.hasMoreNodes();) { > HTMLNode node = e.nextHTMLNode(); > //... > } > > Regards, > Somik > --- "Navid H.Langaroudi" <na...@ya...> wrote: > > Hi Somik, > > Thanks a million times! > > I replaced the htmlparser.jar file and I made the > > changes in package names. > > Now my code runs, but I have a small error. > > in a line: > > for (Enumeration f = > > parserTitle.elements();f.hasMoreElements();) { > > > > it gave me an error for incompatible type. > > says, HTMLParserEnumeration type found, but > > Enumeration type required. So I just cast it. It > > disapeared, and the it says an HTMLParserException > > is > > not handled! > > Is there a solution for this? > > If my explanation is not clear let me know. > > > > And about the other problem, yes, it is fixed, I > > should have use the register(). > > > > > > Thank you very much, > > Navid > > > > --- Somik Raha <so...@ya...> wrote: > > > Hi Navid, > > > Boy, Ive had to stay up late making a > release > > > for you... :) > > > Here's the url - > > > > > > http://htmlparser.sourceforge.net/htmlparser1_2_20021120.zip > > > This is not an integration release per se- > its > > > got the bug fix that you > > > want, but beware, the package names have changed > - > > > so change your programs > > > accordingly - com.kizna.html.* is now > > > org.htmlparser.* > > > > > > Let me know how it goes.. > > > > > > Regards, > > > Somik > > > > > > ----- Original Message ----- > > > From: "Navid H.Langaroudi" <na...@ya...> > > > To: <htm...@li...> > > > Sent: Wednesday, November 20, 2002 10:57 AM > > > Subject: Re: [Htmlparser-user] How to use it. > > > > > > > > > > Hi Somik, > > > > I really appreciate it if you could make the > > build > > > for > > > > me. I have little knowledge of XML. I am not > > sure > > > how > > > > this build.xml could compile java! Can you > give > > me > > > > some lead on internet that I could read about > > it. > > > > Or if there is a book or source. > > > > > > > > And I haven't work with CVS before, I just > read > > > about > > > > it on internet. I guess it is a usful system. > > > > > > > > Any way, it would be great if you could > provide > > me > > > the > > > > new build. > > > > > > > > Thank you very much. > > > > Navid > > > > > > > > --- Somik Raha <so...@ya...> wrote: > > > > > Hi Navid, > > > > > It will be at least a week before its > out, > > > as > > > > > this one's going to have a > > > > > lot of docs... > > > > > However, if you should need the version > > asap > > > - > > > > > dont worry - this is an > > > > > open source project. > > > > > Simply check out the latest snapshot > from > > > CVS, > > > > > and run the build.xml > > > > > file (using ant). The latest build will be > at > > > your > > > > > disposal. > > > > > If this is too complex, let me know.. I > > can > > > make > > > > > a build specifically > > > > > for you, but only if u have cvs problems. > > > > > Regards, > > > > > Somik > > > > > ----- Original Message ----- > > > > > From: "Navid H.Langaroudi" > <na...@ya...> > > > > > To: <htm...@li...> > > > > > Sent: Tuesday, November 19, 2002 10:25 PM > > > > > Subject: Re: [Htmlparser-user] How to use > it. > > > > > > > > > > > > > > > > Hi Somik, > > > > > > It is good the bug is fixed. But when and > > > where I > > > > > get > > > > > > the next version. I really need it. > > > > > > > > > > > > Thanks > > > > > > Navid > > > > > > > > > > > > --- Somik Raha <so...@ya...> wrote: > > > > > > > Hi Navid, > > > > > > > Thanks for this bug report! I was > able > > > to > > > > > > > reproduce it with testcases. > > > > > > > Its fixed now. > > > > > > > Folks : The next version will be > > having > > > this > > > > > bug > > > > > > > fix. Also, I am > > > > > > > refactoring all the package names from > > > > > com.kizna.* > > > > > > > to org.htmlparser.*. > > > > > > > > > > > > > > This would cause you to change your > > > package > > > > > > > names, and I must apologize > > > > > > > for the inconvenience, but this would be > > the > > > > > first > > > > > > > and last change. The next > > > > > > > release is coming with nicer docs.. > > > > > > > > > > > > > > Regards, > > > > > > > Somik > > > > > > > ----- Original Message ----- > > > > > > > From: "Somik Raha" <so...@ya...> > > > > > > > To: > > <htm...@li...> > > > > > > > Sent: Tuesday, November 19, 2002 1:54 PM > > > > > > > Subject: Re: [Htmlparser-user] How to > use > > > it. > > > > > > > > > > > > > > > > > > > > > > Hi Navid, > > > > > > > > That does sound like a bug. I will > > look > > > into > > > > > it > > > > > > > > tonight. Meanwhile, could you enter > this > > > bug > > > > > > > report in > > > > > > > > the htmlparser website ? > > > > > > > > (http://htmlparser.sourceforge.net). > > > > > > > > > > > > > > > > Regards, > > > > > > > > Somik > > > > > > > > --- "Navid H.Langaroudi" > > > <na...@ya...> > > > > > > > wrote: > > > > > > > > > Hi Somik, > > > > > > > > > I could do some parts of my program. > > And > > > the > > > > > > > result > > > > > > > > > was great. Thanks for all your > helps. > === message truncated === __________________________________________________ Do you Yahoo!? Yahoo! Mail Plus Powerful. Affordable. Sign up now. http://mailplus.yahoo.com |
From: Somik R. <so...@ya...> - 2002-11-23 07:43:05
|
Hi Navid, > How you doing? > I still need your help. > I am getting some exception errors in my program, it > happens when it tried to open a url to a non existing > page. > I tried some try/catch, but still can't catch this > one. > It says > Error: HTMLParser.oprnURLConnection(): Error in > opening a URL connection to > http://www.somenoneexistingurl.com ....... > > How can I skip this, my program reads all urls of a > site and tries to go to next page, and if the url does > not exist or is wrong, it terminats, I think I should > control this in order to let the program carry on on > correct links. Hmm, let me write a testcase to trap this.. > Thanks for your help. > Have a nice weekend. > By the way, Are you living in San Francisco? I am in > LosAngeles. Close enough. I am in Berkeley. Regards, Somik |
From: Somik R. <so...@ya...> - 2002-11-23 08:03:45
|
Hi Navid, > I still need your help. > I am getting some exception errors in my program, it > happens when it tried to open a url to a non existing > page. > I tried some try/catch, but still can't catch this > one. > It says > Error: HTMLParser.oprnURLConnection(): Error in > opening a URL connection to > http://www.somenoneexistingurl.com ....... > > How can I skip this, my program reads all urls of a > site and tries to go to next page, and if the url does > not exist or is wrong, it terminats, I think I should > control this in order to let the program carry on on > correct links. I wrote a testcase for this - in HTMLParserTest.java. This test proves that there is no bug in the parser. You can add this snippet and verify for yourself. public void testNullUrl() { HTMLParser parser; try { parser = new HTMLParser("http://someoneexisting.com"); assertTrue("Should have thrown an exception!",false); } catch (HTMLParserException e) { } } I can guess what you might be doing wrong though .. are you sure you are trying to trap HTMLParserException ? Regards, Somik |
From: Navid H.L. <na...@ya...> - 2002-11-25 19:28:52
Attachments:
t123.java
|
Hi Somik, I am still getting this error. Here I send a sample code, which if you run, you will get the error. Please see the attachment. Thanks Navid --- Somik Raha <so...@ya...> wrote: > Hi Navid, > > > > I still need your help. > > I am getting some exception errors in my program, > it > > happens when it tried to open a url to a non > existing > > page. > > I tried some try/catch, but still can't catch this > > one. > > It says > > Error: HTMLParser.oprnURLConnection(): Error in > > opening a URL connection to > > http://www.somenoneexistingurl.com ....... > > > > How can I skip this, my program reads all urls of > a > > site and tries to go to next page, and if the url > does > > not exist or is wrong, it terminats, I think I > should > > control this in order to let the program carry on > on > > correct links. > > I wrote a testcase for this - in > HTMLParserTest.java. This test proves that > there is no bug in the parser. You can add this > snippet and verify for > yourself. > > public void testNullUrl() { > HTMLParser parser; > try { > parser = new > HTMLParser("http://someoneexisting.com"); > assertTrue("Should have thrown an > exception!",false); > } > catch (HTMLParserException e) { > } > } > > I can guess what you might be doing wrong though .. > are you sure you are > trying to trap HTMLParserException ? > > Regards, > Somik > > > > ------------------------------------------------------- > This sf.net email is sponsored by:ThinkGeek > Welcome to geek heaven. > http://thinkgeek.com/sf > _______________________________________________ > Htmlparser-user mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlparser-user __________________________________________________ Do you Yahoo!? Yahoo! Mail Plus Powerful. Affordable. Sign up now. http://mailplus.yahoo.com |
From: Somik R. <so...@ya...> - 2002-11-25 19:50:29
|
Hi Navid, I ran the program, and it does exactly what I expected. But I see your doubt now. You want to suppress the exception messages. These are happening because of DefaultHMLParserFeedback(). Pls write your own NullHTMLParserFeedback(), that does not print anything when it encounters an error, and use that to initialize the parser. Read the javadoc of HTMLParser.java carefully. > --- Somik Raha <so...@ya...> wrote: > > Hi Navid, > > > > > > > I still need your help. > > > I am getting some exception errors in my > program, > > it > > > happens when it tried to open a url to a non > > existing > > > page. > > > I tried some try/catch, but still can't catch > this > > > one. > > > It says > > > Error: HTMLParser.oprnURLConnection(): Error in > > > opening a URL connection to > > > http://www.somenoneexistingurl.com ....... > > > > > > How can I skip this, my program reads all urls > of > > a > > > site and tries to go to next page, and if the > url > > does > > > not exist or is wrong, it terminats, I think I > > should > > > control this in order to let the program carry > on > > on > > > correct links. > > > > I wrote a testcase for this - in > > HTMLParserTest.java. This test proves that > > there is no bug in the parser. You can add this > > snippet and verify for > > yourself. > > > > public void testNullUrl() { > > HTMLParser parser; > > try { > > parser = new > > HTMLParser("http://someoneexisting.com"); > > assertTrue("Should have thrown an > > exception!",false); > > } > > catch (HTMLParserException e) { > > } > > } > > > > I can guess what you might be doing wrong though > .. > > are you sure you are > > trying to trap HTMLParserException ? > > > > Regards, > > Somik > > > > > > > > > ------------------------------------------------------- > > This sf.net email is sponsored by:ThinkGeek > > Welcome to geek heaven. > > http://thinkgeek.com/sf > > _______________________________________________ > > Htmlparser-user mailing list > > Htm...@li... > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > > __________________________________________________ > Do you Yahoo!? > Yahoo! Mail Plus Powerful. Affordable. Sign up > now. > http://mailplus.yahoo.com __________________________________________________ Do you Yahoo!? Yahoo! Mail Plus Powerful. Affordable. Sign up now. http://mailplus.yahoo.com |
From: Somik R. <so...@ya...> - 2002-11-25 19:51:14
|
Hi Navid, I ran the program, and it does exactly what I expected. But I see your doubt now. You want to suppress the exception messages. These are happening because of DefaultHMLParserFeedback(). Pls write your own NullHTMLParserFeedback(), that does not print anything when it encounters an error, and use that to initialize the parser. Read the javadoc of HTMLParser.java carefully. I have written the modified program for you : public void testNullUrl(){ try { parser = new HTMLParser("http://www.yahooeeeeee.com",new HTMLParserFeedback() { /** * @see org.htmlparser.util.HTMLParserFeedback#info(String) */ public void info(String message) { } /** * @see org.htmlparser.util.HTMLParserFeedback#warning(String) */ public void warning(String message) { } /** * @see org.htmlparser.util.HTMLParserFeedback#error(String, HTMLParserException) */ public void error(String message, HTMLParserException e) { } }); //assertTrue("Should have thrown an exception!",false); parser.registerScanners(); parser.addScanner(new HTMLLinkScanner("-l")); } catch (HTMLParserException e) { System.out.println("Can not connect the URL!"); } } Try this- it should give you what you want now. Regards Somik (The earlier mail went before I could complete it..) __________________________________________________ Do you Yahoo!? Yahoo! Mail Plus Powerful. Affordable. Sign up now. http://mailplus.yahoo.com |
From: Navid H.L. <na...@ya...> - 2002-11-27 01:49:57
|
Thank you Somik, It worked as I wanted. Also using the parser classes, I could extract each part of page data separatly. Well, now I reached to the real difficult part. I am trying to extract meaningful data from some sites, like a site products names, discription and keywords, which is not always in meta tags. Do have any suggestion? Navid --- Somik Raha <so...@ya...> wrote: > Hi Navid, > I ran the program, and it does exactly what I > expected. > But I see your doubt now. You want to suppress the > exception messages. These are happening because of > DefaultHMLParserFeedback(). Pls write your own > NullHTMLParserFeedback(), that does not print > anything > when it encounters an error, and use that to > initialize the parser. Read the javadoc of > HTMLParser.java carefully. > > I have written the modified program for you : > public void testNullUrl(){ > try { > parser = new > HTMLParser("http://www.yahooeeeeee.com",new > HTMLParserFeedback() { > /** > * @see > org.htmlparser.util.HTMLParserFeedback#info(String) > */ > public void info(String message) { > } > > /** > * @see > org.htmlparser.util.HTMLParserFeedback#warning(String) > */ > public void warning(String message) { > } > > /** > * @see > org.htmlparser.util.HTMLParserFeedback#error(String, > HTMLParserException) > */ > public void error(String message, > HTMLParserException e) { > } > }); > //assertTrue("Should have thrown an > exception!",false); > parser.registerScanners(); > parser.addScanner(new HTMLLinkScanner("-l")); > > } > catch (HTMLParserException e) { > System.out.println("Can not connect the URL!"); > } > } > > Try this- it should give you what you want now. > > Regards > Somik > > (The earlier mail went before I could complete it..) > > __________________________________________________ > Do you Yahoo!? > Yahoo! Mail Plus Powerful. Affordable. Sign up > now. > http://mailplus.yahoo.com > > > ------------------------------------------------------- > This SF.net email is sponsored by: Get the new Palm > Tungsten T > handheld. Power & Color in a compact size! > http://ads.sourceforge.net/cgi-bin/redirect.pl?palm0002en > _______________________________________________ > Htmlparser-user mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlparser-user __________________________________________________ Do you Yahoo!? Yahoo! Mail Plus - Powerful. Affordable. Sign up now. http://mailplus.yahoo.com |
From: Somik R. <so...@ya...> - 2002-11-27 03:39:00
|
> Well, now I reached to the real difficult part. I am > trying to extract meaningful data from some sites, > like a site products names, discription and keywords, > which is not always in meta tags. > Do have any suggestion? You might do well to use Artificial Intelligence (whatever that is). :) If you define your goal more clearly, it would be easier to help you. Regards, Somik |