Re: [Htmlparser-user] Site Capturer
Brought to you by:
derrickoswald
From: Ian M. <ian...@gm...> - 2006-04-11 21:09:09
|
If you aren't running a web server (it's not clear from your email), then you will need to use file:/// 'URLs' instead, or read the documents in as Strings using Java's File and FileReader classes, and then do Parser.setInputHTML. Ian On 11/04/06, Derrick Oswald <Der...@ro...> wrote: > Jay, > > By the 'directory is handle as a text Node', do you mean the web server > is replying to the URL with a directory listing as a plain text node in > the HTML page?. > Usually HTTP servers either don't reply with anything, or if configured > more loosely will provide a hyper-linked listing of the directory so the > user can navigate through the directories. > As far as I know there isn't any existing code to handle a plain-text > listing of a directory and extract the tree structure if that's what > you're asking. > > Derrick > > HATTAT J=E9r=E9mie wrote: > > > Hi everbody, > > > > I'm working with htmlparser for few days, > > I want to miror an entire site, my start point was SiteCapturer example= , > > but i can't handle a site with an url like "http://localhost/mysite", > > because a > > directory is handle as a text Node. > > > > Is there a API implemented solution or should i developp the file > > handler engine? > > > > Thanks in advance, > > > > Jay > > > > > > ------------------------------------------------------- > This SF.Net email is sponsored by xPML, a groundbreaking scripting langua= ge > that extends applications into web and mobile media. Attend the live webc= ast > and join the prime developer group breaking into this new coding territor= y! > http://sel.as-us.falkag.net/sel?cmd=3Dlnk&kid=3D110944&bid=3D241720&dat= =3D121642 > _______________________________________________ > Htmlparser-user mailing list > Htm...@li... > https://lists.sourceforge.net/lists/listinfo/htmlparser-user > |