Make it handle streams
Status: Beta
Brought to you by:
jhsolorz
It seems odd for a SAX-based parser to not be able to
read from a stream. Your parser should be able to take
chunks of text and process them, then grab some more.
To have to load the whole html document into memory
seems a waste. There are many examples of XML parsers
that do this just fine (off the top of my head, the IBM
Xerces Java XML libraries does this).
Logged In: NO
you can do it by yourself, send chunk by chunk to htmlparser