After years of silence I've returned to hacking on httpf. Currently the sources are polished. Basically there's code of at least 4 people that has to be cleaned up. While I am at it I'll rewrite ugly parts and eat bugs along the way.
Submitted new sources. Major cleanup (again), some bugs removed, new features. We're stepping towards are stable release...
The new parser seems to run stable. I'm currently adding more automatic tests to verify this.
DocBook Documentation will be there, soon.
BTW: the executable's size (stripped) is 41K ;)
The latest news told that the CVS contents were alpha, well... they were right. Things had to be done differently. The HTML parser is simpler, easier to understand (and maintain ;)) and we have a basic test suite now.
Things are getting better for the next release.
Currently the repository holds a version that processes HTML documents in a different manner.
Previous versions looked for a tag and it's end, then nulled out the closing angle. Then each bit of the tag was analyzed as handled as configured.
The current version does *not* look for the end of a tag before processing it, especially the '>' won't be substitued by a null character. This new processing scheme was implemented to be able to handle "the unmatched-quotes problem".... read more
I made a new version that fixes some errors:
Skipping of SCRIPT contents now works better.
Fixed some rarely occuring segfaults.
Removing of entities with javascript works now
under (hopefully) all circumstances.
Ok, finally we got it. The httpf project is up and running now. Feel free to download, ask questions etc...