Re: [Htmlparser-developer] Re: [Htmlparser-user] HTML parser 1.1
Brought to you by:
derrickoswald
From: Raghavender S. <kin...@ho...> - 2002-04-12 03:38:39
|
Thanks somik. I will work on it. Raghav >From: "Somik Raha" <so...@ya...> >To: <htm...@li...>, "Raghavender Srimantula" ><kin...@ho...> >Subject: Re: [Htmlparser-developer] Re: [Htmlparser-user] HTML parser 1.1 >Date: Fri, 12 Apr 2002 11:57:50 +0900 > >Hi Raghav > You are right. That is indeed a bug. I have written a test case for >it, >captured it, and fixed it. > Code is checked into CVS - it should work for you now. > >Regards, >Somik >----- Original Message ----- >From: "Raghavender Srimantula" <kin...@ho...> >To: <so...@ya...>; <htm...@li...> >Sent: Friday, April 12, 2002 6:12 AM >Subject: [Htmlparser-developer] Re: [Htmlparser-user] HTML parser 1.1 > > > > hi Somik, > > the code snippet you mailed me seems to have some problems. > > let me explain you. the method > > isXMLTagFound(node,"OPTION") > > would always return false. the reason: in the definition of the above >method > > we have > > > > if (node instanceof HTMLTag) { > > System.out.println("node instanceof HTMLTag in tagscanner "); > > HTMLTag tag = (HTMLTag)node; > > if (tag.getText().equals(tagName)) { > > xmlTagFound=true; > > } > > } > > > > tag.getText() would always give me > > OPTION value="#">Select a destination > > > > which is not equal to the tagName, in this case the tagName=OPTION. > > > > Raghav > > > > > > >From: "Somik Raha" <so...@ya...> > > >To: "Raghavender Srimantula" <kin...@ho...>, > > ><htm...@li...> > > >Subject: Re: [Htmlparser-user] HTML parser 1.1 > > >Date: Thu, 11 Apr 2002 11:14:51 +0900 > > > > > >Hi Raghav > > > I replied to your earlier query. Did you recieve the mail (I >forwarded > > >it again) ? > > > Regarding your current query, there are two ways to handle option > > >tags. > > > > > >[1] Like in the previous question, you will have to recognize a HTMLTag > > >(begin tag), followed by HTMLStringNode, and finally HTMLEndTag. > > >[2] To make life easier, since this tag is basic xml, you can use a >special > > >XML parsing method provided in the superclass HTMLTagScanner. > > > > > >The methods are : > > >(i) isXMLTagFound > > >(ii) extractXMLData > > > > > >both of them are static mehods. > > >You would use it like this : > > > > > >HTMLNode node = reader.readElement(); > > >if (isXMLTag(node,"OPTION")) { > > > String option = extractXMLData(node,"OPTION",reader); > > > // The string now contains the data within the option xml tag > > > // So given an input : <OPTION value="#">Select a >destination</OPTION> > > > // option will hold "Select a destination" > > >} > > > > > >But getting the value from the option tag itself would need to be >handled > > >seperately. > > > > > >Regards, > > >Somik > > >----- Original Message ----- > > >From: "Raghavender Srimantula" <kin...@ho...> > > >To: <so...@ya...>; <htm...@li...> > > >Sent: Thursday, April 11, 2002 9:22 AM > > >Subject: Re: [Htmlparser-user] HTML parser 1.1 > > > > > > > > > > hi Somik, > > > > any ideas about my previous mail. let us say if we have > > > > <OPTION value="#">Select a destination</OPTION> > > > > when I do a > > > > node = reader.readElement(); > > > > where "reader" is HTMLReader > > > > the node I get is of type neither HTMLStringNode, HTMLEndTag, > > > > HTMLRemarkNode. > > > > how do I classify this if I want to do some thing with them. > > > > Raghav > > > > > > > > >From: "Somik Raha" <so...@ya...> > > > > >To: "Raghavender Srimantula" <kin...@ho...> > > > > >CC: <htm...@li...> > > > > >Subject: Re: [Htmlparser-user] HTML parser 1.1 > > > > >Date: Mon, 8 Apr 2002 13:04:07 +0900 > > > > > > > > > >Hi Raghav > > > > > > when would be this HTMLparser 1.1 out? > > > > >As soon as I can wrap it up. Technically, the code is ready and >already > > > > >checked into CVS. I need to do the process of creating a release - >make > > > > >some > > > > >documentation, check everything is ok, .. > > > > >If I had some help I could wrap it up sooner. > > > > > > > > > > > I am not sure, but to me the way htmlparser parses is it gives >me > > >the > > > > >tag > > > > > > parameter of the first line in the above snippet of html code, >when > > >I > > >do > > > > > > Hashtable table = tag.parseParameters(); > > > > > > it is looking for parameters inside <FORM ..... >, but not <FORM > > > > > > .....</FORM> > > > > > > > > > >Yes - parseParameters() will give you the stuff inside the FORM >tag. > > >That > > > > >is > > > > >what I call "microscopic" parsing. But to get the remaining tags - >till > > >you > > > > >encounter </FORM> you need to do "macroscopic" parsing. This is not > > >hard- > > > > >check HTMLAppletScanner as an example. > > > > > > > > > >In a nutshell - concept is very simple. The scan method provides >you > > >with > > >a > > > > >reader. So you are to use that reader to read ahead and get the >next > > >tags. > > > > >This is simple bcos the reader will automatically identify the >correct > > > > >tags, > > > > >and the mechanism is very similar to using the parser to get the >tags > > >you > > > > >want. The HTMLLinkScanner among others, also works on the same > > >principle. > > > > > > > > > >Bytway - I think we should take this discussion to the Developer >list. > > > > > > > > > >Regards, > > > > >Somik > > > > >----- Original Message ----- > > > > >From: "Raghavender Srimantula" <kin...@ho...> > > > > >To: <htm...@li...> > > > > >Sent: Monday, April 08, 2002 6:39 AM > > > > >Subject: [Htmlparser-user] HTML parser 1.1 > > > > > > > > > > > > > > > > Hi Somik, > > > > > > when would be this HTMLparser 1.1 out? > > > > > > one more question. to parse the FORM tags, I have a small >question. > > > > > > let us say this is a form tag > > > > > > > > > > > > <FORM NAME="LoginForm" METHOD=POST ACTION="urltoInvoke"> > > > > > > <P>User name: > > > > > > <INPUT TYPE="text" NAME="userName" SIZE="10"> > > > > > > <P>Password: > > > > > > <INPUT TYPE="password" NAME="password" SIZE="12"> > > > > > > <P><INPUT TYPE="submit" VALUE="Log in"> > > > > > > <INPUT TYPE="button" VALUE="Cancel" onClick="window.close()"> > > > > > > </FORM> > > > > > > > > > > > > I am not sure, but to me the way htmlparser parses is it gives >me > > >the > > > > >tag > > > > > > parameter of the first line in the above snippet of html code, >when > > >I > > >do > > > > > > Hashtable table = tag.parseParameters(); > > > > > > it is looking for parameters inside <FORM ..... >, but not <FORM > > > > > > .....</FORM> > > > > > > > > > > > > could you suggest me how to go ahead with this. > > > > > > Raghav > > > > > > > > > > > > > > > > > > to extract the INPUT tag parameters > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > > >_________________________________________________________________ > > > > > > MSN Photos is the easiest way to share and print your photos: > > > > > > http://photos.msn.com/support/worldwide.aspx > > > > > > > > > > > > > > > > > > _______________________________________________ > > > > > > Htmlparser-user mailing list > > > > > > Htm...@li... > > > > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > > > > > > > > > > > > >_________________________________________________________ > > > > >Do You Yahoo!? > > > > >Get your free @yahoo.com address at http://mail.yahoo.com > > > > > > > > > > > > > > > > > > > > > > > > > _________________________________________________________________ > > > > Get your FREE download of MSN Explorer at > > >http://explorer.msn.com/intl.asp. > > > > > > > > >_________________________________________________________ > > >Do You Yahoo!? > > >Get your free @yahoo.com address at http://mail.yahoo.com > > > > > > > > > > > > > _________________________________________________________________ > > Chat with friends online, try MSN Messenger: http://messenger.msn.com > > > > > > _______________________________________________ > > Htmlparser-developer mailing list > > Htm...@li... > > https://lists.sourceforge.net/lists/listinfo/htmlparser-developer > > >_________________________________________________________ >Do You Yahoo!? >Get your free @yahoo.com address at http://mail.yahoo.com > _________________________________________________________________ Get your FREE download of MSN Explorer at http://explorer.msn.com/intl.asp. |