Re: [Htmlparser-user] How if source is in byte form
Brought to you by:
derrickoswald
|
From: Derrick O. <der...@ro...> - 2008-01-06 17:40:21
|
The Page class has a constructor taking an InputStream and an encoding.
You can make an InputStream from a byte array for example.
You need to have stored the encoding somewhere to reconstitute the bytes correctly.
The Parser constructor taking a Lexer constructed from a Page would be what you want.
----- Original Message ----
From: cash cash <ca...@ya...>
To: htmlparser user list <htm...@li...>
Sent: Sunday, January 6, 2008 2:19:23 AM
Subject: [Htmlparser-user] How if source is in byte form
Dear HTMLParser community,
We adapted a web crawler which stores crawled web
pages in byte form. Can HTMLParser take byte as input
and do filtering on HTML tags?
Thank you.
____________________________________________________________________________________
Never miss a thing. Make Yahoo your home page.
http://www.yahoo.com/r/hs
-------------------------------------------------------------------------
This SF.net email is sponsored by: Microsoft
Defy all challenges. Microsoft(R) Visual Studio 2005.
http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/
_______________________________________________
Htmlparser-user mailing list
Htm...@li...
https://lists.sourceforge.net/lists/listinfo/htmlparser-user
|