Re: [Htmlparser-user] Hints on how to change image tag locations and write outdocument
Brought to you by:
derrickoswald
From: Raghavender S. <kin...@ho...> - 2002-05-06 01:44:05
|
Hi Somik, this question is regarding "not all images are being retrieved". I mean the images under <a tag. I did try to open the attachment you sent me. I could not find anything. but seeing the previous mails I could read that it is not a bug. but still if I do want to retrieve all the images how do I do it. Thanks, Raghav >From: "Somik Raha" <so...@ya...> >Reply-To: htm...@li... >To: <htm...@li...> >Subject: Re: [Htmlparser-user] Hints on how to change image tag locations >and write outdocument >Date: Tue, 30 Apr 2002 11:37:26 +0900 > >Hi Raghav, > Ah - this was a question by Annette Doyle (titled "Not all image tags >are returned"). I am attaching my reply. > >Regards >Somik > >----- Original Message ----- >From: "Raghavender Srimantula" <kin...@ho...> >To: <htm...@li...> >Sent: Tuesday, April 30, 2002 11:16 AM >Subject: Re: [Htmlparser-user] Hints on how to change image tag locations >and write outdocument > > > > hi Somik, > > I found one more interesting thing here. when I am trying to get all the > > images the image scanner would give me images > > <img src="http://us.i1.yimg.com/us.yimg.com/i/mntl/sh/mom02/title4.gif" > > width=296 height=27 border=0 usemap=#tm> > > so if I do a imagetag.getImageLocation(), I would get > > http://us.i1.yimg.com/us.yimg.com/i/mntl/sh/mom02/title4.gif > > > > but is the html content is like this > > <a href=s/6006><img >src=http://us.i1.yimg.com/us.yimg.com/i/us/hj/hjys.gif > > border=0 width=70 height=22></a> > > which starts with <a and ends with </a>, then the image scanner will not > > give me http://us.i1.yimg.com/us.yimg.com/i/us/hj/hjys.gif when I do a > > imagetag.getImageLocation(). this is not even classified as an ImageTag. > > this is classified as LinkTag. how to get this image. > > > > the above content is from www.yahoo.com. on the netscape browser if you >goto > > view-->pageinfo, you will see a bunch of images. > > but when you run the htmlparser you can get only one image. > > > > Thanks, > > Raghav > > > > > > >From: "Somik Raha" <so...@ya...> > > >Reply-To: htm...@li... > > >To: <htm...@li...> > > >Subject: Re: [Htmlparser-user] Hints on how to change image tag >locations > > >and write outdocument > > >Date: Tue, 30 Apr 2002 09:15:38 +0900 > > > > > >Can you describe your application ? Was it parsing a single page when >the > > >problem occurred ? > > > > > >Regards, > > >Somik > > >----- Original Message ----- > > >From: "Raghavender Srimantula" <kin...@ho...> > > >To: <htm...@li...> > > >Cc: <htm...@li...> > > >Sent: Tuesday, April 30, 2002 8:36 AM > > >Subject: Re: [Htmlparser-user] Hints on how to change image tag >locations > > >and write outdocument > > > > > > > > > > Hi Somik, > > > > I encountered a strange problem today. while I was running > > >htmlparser...I > > > > got a java.lang.OutOfMemoryError. seems that lot of objects are >being > > > > allocated. where exactly is this happening. I mean could you give me >an > > >idea > > > > where or in which file the potential problem could be. > > > > Raghav > > > > > > > > > > > > >From: "Somik Raha" <so...@ya...> > > > > >Reply-To: htm...@li... > > > > >To: <htm...@li...> > > > > >CC: <htm...@li...> > > > > >Subject: Re: [Htmlparser-user] Hints on how to change image tag > > >locations > > > > >and write out document > > > > >Date: Sat, 27 Apr 2002 18:22:34 +0900 > > > > > > > > > >Hi Annette, > > > > > Pls find attached a program to get you started. This program >will > > >do > > > > >what you want - you will need to modify the construct that checks >for > > >the > > > > >image tag - and replace it with the location of your choice. > > > > > Also - I found one bug thanks to this requirement - image tags > > >params > > > > >were not being correctly put in. Though it needs a deeper look, I >have > > >done > > > > >a quick fix for now, and all test cases are passing (with one test >case > > >in > > > > >HTMLImageScannerTest trapping this bug). > > > > > Please check out the latest html parser source code from CVS. > > > > > > > > > >Regards, > > > > >Somik > > > > > > > > > > ----- Original Message ----- > > > > > From: Doyle, Annette > > > > > To: htm...@li... > > > > > Sent: Friday, April 26, 2002 10:08 PM > > > > > Subject: [Htmlparser-user] Hints on how to change image tag > > >locations > > > > >and write out document > > > > > > > > > > > > > > > Could you please give me some hints as how to change only image >tag > > > > >locations and then, (or at the same time) write out the html >document > > >to > > > > >file (with new image tag locations)? > > > > > > > > > > > > > > > > > > > > Thanks- > > > > > > > > > > Annette Doyle > > > > > > > > > ><< ImageTagRetriever.java >> > > > > > > > > > > > > > > > > > > > > _________________________________________________________________ > > > > Join the world's largest e-mail service with MSN Hotmail. > > > > http://www.hotmail.com > > > > > > > > > > > > _______________________________________________ > > > > Htmlparser-user mailing list > > > > Htm...@li... > > > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > > > > > > >_______________________________________________ > > >Htmlparser-user mailing list > > >Htm...@li... > > >https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > > > > > > > > _________________________________________________________________ > > Send and receive Hotmail on your mobile device: http://mobile.msn.com > > > > > > _______________________________________________ > > Htmlparser-user mailing list > > Htm...@li... > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user ><< >[Htmlparser-developer]Re_[Htmlparser-user]Notallimagetagsarereturned[NotaBug].eml > >> _________________________________________________________________ MSN Photos is the easiest way to share and print your photos: http://photos.msn.com/support/worldwide.aspx |