Re: [Htmlparser-user] Site Capturer

SourceForge Headquarters 1320 Columbia Street Suite 310 San Diego, CA 92101 +1 (858) 422-6466

If you aren't running a web server (it's not clear from your email),
then you will need to use file:/// 'URLs' instead, or read the
documents in as Strings using Java's File and FileReader classes, and
then do Parser.setInputHTML.

Ian

On 11/04/06, Derrick Oswald <Der...@ro...> wrote:
> Jay,
>
> By the 'directory is handle as a text Node', do you mean the web server
> is replying to the URL with a directory listing as a plain text node in
> the HTML page?.
> Usually HTTP servers either don't reply with anything, or if configured
> more loosely will provide a hyper-linked listing of the directory so the
> user can navigate through the directories.
> As far as I know there isn't any existing code to handle a plain-text
> listing of a directory and extract the tree structure if that's what
> you're asking.
>
> Derrick
>
> HATTAT J=E9r=E9mie wrote:
>
> > Hi everbody,
> >
> > I'm working with htmlparser for few days,
> > I want to miror an entire site, my start point was SiteCapturer example=
,
> > but i can't handle a site with an url like "http://localhost/mysite",
> > because a
> > directory is handle as a text Node.
> >
> > Is there a API implemented solution or should i  developp the file
> > handler engine?
> >
> > Thanks in advance,
> >
> > Jay
> >
>
>
>
> -------------------------------------------------------
> This SF.Net email is sponsored by xPML, a groundbreaking scripting langua=
ge
> that extends applications into web and mobile media. Attend the live webc=
ast
> and join the prime developer group breaking into this new coding territor=
y!
> http://sel.as-us.falkag.net/sel?cmd=3Dlnk&kid=3D110944&bid=3D241720&dat=
=3D121642
> _______________________________________________
> Htmlparser-user mailing list
> Htm...@li...
> https://lists.sourceforge.net/lists/listinfo/htmlparser-user
>