Menu

Google indexing a phpwiki site

Help
Daz
2004-05-05
2012-10-11
  • Daz

    Daz - 2004-05-05

    Google only indexes the front page of my phpwiki site. It doesn't follow the links to the other pages. The static pages at the same site are indexed normally.

    It does index the "action=PageHistory", "action=Diff" etc links from the home page, but none of the actual page links.

    Is this a phpwiki thing, a Google thing, or is it just me? Thanks.

     
    • Reini Urban

      Reini Urban - 2004-05-05

      Do you use USE_PATH_INFO = true?
      This should help.

      Every page should have this header:
      <meta name="robots" content="index,follow" />

      and every action page (like edit or diff) should have this header:
      <meta name="robots" content="noindex,nofollow" />

      It should work regardlesss of the USE_PATH_INFO setting, but I didn't really follow latest google robot policies.

       
      • Daz

        Daz - 2004-05-06

        The USE_PATH_INFO config line is commented out. So paths look like http://.../phpwiki/index.php/ReplaceRedHat.

        In the source of 'normal' pages, there is:
        <meta name="robots" content="index,follow" />

        And in the source of a PageHistory page, I have:
        <meta name="robots" content="noindex,nofollow" />

        But when I search for:
        "site:www.snakegully.nu inurl:phpwiki", I only get these pages:

        .../phpwiki/
        .../phpwiki/HomePage?action=diff
        .../phpwiki/HomePage?action=PageInfo
        .../phpwiki/HomePage?action=BackLinks
        .../phpwiki/HomePage?action=LikePages
        .../phpwiki/HomePage?action=PageHistory
        .../phpwiki/HomePage?action=DebugInfo
        .../phpwiki/HomePage?action=viewsource

        It looks like it may be a google problem? I might query them about it. Hmm. I just realised that these URL's that ARE indexed are in the format that would be used for all pages if USE_PATH_INFO was set to false?

         
        • Reini Urban

          Reini Urban - 2004-05-06

          > The USE_PATH_INFO config line is commented out. > So paths look like http://.../phpwiki/index.php/ReplaceRedHat.

          That means that USE_PATH_INFO is automatically enabled. Without PATH_INFO urls would look like
          http://.../phpwiki/index.php?pagename=ReplaceRedHat

          > And in the source of a PageHistory page, I have:
          > <meta name="robots" content="noindex,nofollow" />

          That's also an action page so that's ok.

          Looks like Google doesn't like the non-PrettyWiki setup with index.php/PageName and prefers USE_PATH_INFO=false.
          Try to use a better index.php setup, by renaming it to wiki, or use our wiki and associate it with the php handler. and make it the DirectoryIndex.

          So that urls will look like
          http://.../phpwiki/wiki/ReplaceRedHat.

           
      • Christopher Robbins

        I am trying to get google to index my wiki, yet although I have changes "top.tmpl" to read <meta name="robots" content="index,follow" />, all new pages still have the old noindex, nofollow.

        Any ideas. I am running version 1.3.2

         

Log in to post a comment.

Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.