Re: [Htmlparser-user] How if source is in byte form
Brought to you by:
derrickoswald
From: Derrick O. <der...@ro...> - 2008-01-06 17:40:21
|
The Page class has a constructor taking an InputStream and an encoding. You can make an InputStream from a byte array for example. You need to have stored the encoding somewhere to reconstitute the bytes correctly. The Parser constructor taking a Lexer constructed from a Page would be what you want. ----- Original Message ---- From: cash cash <ca...@ya...> To: htmlparser user list <htm...@li...> Sent: Sunday, January 6, 2008 2:19:23 AM Subject: [Htmlparser-user] How if source is in byte form Dear HTMLParser community, We adapted a web crawler which stores crawled web pages in byte form. Can HTMLParser take byte as input and do filtering on HTML tags? Thank you. ____________________________________________________________________________________ Never miss a thing. Make Yahoo your home page. http://www.yahoo.com/r/hs ------------------------------------------------------------------------- This SF.net email is sponsored by: Microsoft Defy all challenges. Microsoft(R) Visual Studio 2005. http://clk.atdmt.com/MRT/go/vse0120000070mrt/direct/01/ _______________________________________________ Htmlparser-user mailing list Htm...@li... https://lists.sourceforge.net/lists/listinfo/htmlparser-user |