Re: [Htmlparser-user] set URL for relative links
Brought to you by:
derrickoswald
From: Jeffrey B. <jb...@cs...> - 2006-09-11 18:21:05
|
On 9/11/06, Garry Huang <ga...@gm...> wrote: > Did you try my_parser.setURL("http://www.bar.com/"); ? Yeah, I tried that. If it's inserted before I call extractAllNodesThatMatch(img_filter); then http://www.bar.com is downloaded. If it's called after then the relative links aren't fixed. It's possible that there's something subtle with the ordering that I could change, but I couldn't get it to work and it seems like it would be a hack... Thanks for the suggestion though. -Jeff > Just a thought. > > Cheers, > Garry > > On Sep 12, 2006, at 12:58 AM, jpdogg wrote: > > > Hello, > > > > I've cached some HTML pages in local files and would like to tell the > > Parser object what the original URLs were so that it can correctly > > interpret relative links. > > > > As a simple example, say I do this: > > > > Parser my_parser = new Parser("<html><img src='foo.jpg'></html>"); > > > > If I construct a filter to give me all of the ImageTags in this simple > > document, I get one. Unfortunately, it has the URL foo.jpg. If I > > know that this file was originally located at > > http://www.bar.com/foo.html, how do I inform the parser module? I > > want it to be able to report that the above image is located at > > http://www.bar.com/foo.jpg. > > > > Thanks! > > Jeff > > > > ---------------------------------------------------------------------- > > --- > > Using Tomcat but need to do more? Need to support web services, > > security? > > Get stuff done quickly with pre-integrated technology to make your > > job easier > > Download IBM WebSphere Application Server v.1.0.1 based on Apache > > Geronimo > > http://sel.as-us.falkag.net/sel? > > cmd=lnk&kid=120709&bid=263057&dat=121642 > > _______________________________________________ > > Htmlparser-user mailing list > > Htm...@li... > > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > > > ------------------------------------------------------------------------- > Using Tomcat but need to do more? Need to support web services, security? > Get stuff done quickly with pre-integrated technology to make your job easier > Download IBM WebSphere Application Server v.1.0.1 based on Apache Geronimo > http://sel.as-us.falkag.net/sel?cmd=lnk&kid=120709&bid=263057&dat=121642 > _______________________________________________ > Htmlparser-user mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > |