NekoHTML is a simple HTML scanner and tag balancer that enables application programmers to parse HTML documents and access the information using standard XML interfaces.
Checks given webpages for backlinks and scans for image links and keywords.
Uses DOM-based methods to scan for backlinks, which are more sophisticated than simple text scanning (one example: they ignore commented out source code).