Menu

#14 Spider follows commented out links

v1.0 (example)
closed-fixed
EJP
None
5
2021-07-14
2004-11-10
Phil Shaw
No

When link elements are commented out, the spider/parser
identifies them as legitimate links for checking.

I don't know if this markup will come through as intended,
but it is typical of the cases I have seen.

<!-- link rel="glossary" href="/Glossary.htm"
title="Example" type="text/html" / -->

Where links are commented out like this, I suggest they
should not be followed.

Discussion

  • EJP

    EJP - 2021-07-14
    • status: open --> closed-fixed
    • assigned_to: EJP
    • Group: --> v1.0 (example)
     
  • EJP

    EJP - 2021-07-14

    Fixed in upcoming 1.0 release. A proper HTML parser is now used and links are identifed via XPath expressions. Comments are therefore ignored, as are element/attribute pairs that can't be links, even if they are.

     

Log in to post a comment.

MongoDB Logo MongoDB