Menu

HTTPS crawl fails to create socket

Help
Anonymous
2015-04-02
2020-03-19
  • Anonymous

    Anonymous - 2015-04-02

    When attempting to crawl a HTTPS site (facebook) - no content was received. I found the issue to be the socket failing to be created - is this a known issue? Is there any workaround?

     
  • Uwe Hunfeld

    Uwe Hunfeld - 2015-04-02

    Hi!

    What php version are you using?
    5.6x?

    If so, there is a known bug: https://sourceforge.net/p/phpcrawl/bugs/86/

    Before php 5.6 it should work without any (known) problems.

    Did you install OpenSSH (it's required for ssl-connections).

     
  • Anonymous

    Anonymous - 2015-04-03

    Hey there!

    Thank you for the quick response. I just checked, and I am indeed using PHP 5.6 (5.6.2). I'll try rolling back to an earlier version now and report my results back, but it sounds like that should resolve the issue.

    Thank you!

     
  • Anonymous

    Anonymous - 2015-04-07

    For anyone looking at this, I can confirm that PHP 5.6.x was the problem (5.6.2 for me). Dropping back a version everything seems to work fine!

     
  • Uwe Hunfeld

    Uwe Hunfeld - 2015-04-07

    Hi,

    thanks for replying back and the confirmation!

    This bug (https and php 5.6) will get fixed in the next release.

    If anybody needs a temporary workaround for phpcrawl version 0.83, please let me know
    (here or in the bugreport).

    Thanks!

     
  • Anonymous

    Anonymous - 2016-05-24

    I was having massive Problems crawling SSL-Sites after a PHP-version update. After a bit of hacking I found the Error: The socket was returning 'Host unreachable'
    Solution:
    in PHPCrawlerHTTPRequest.class.php Row 551:
    Replace 'SNI_server_name' with 'peer_name'
    SNI_server_name is not supported by PHP > 5.6.0.

    Works like a charm now, hope this helps someone else. Thank you, Uwe, for your wonderful tool..

    Peter

     
    • Anonymous

      Anonymous - 2020-03-19

      Ohhh my god!!!! More than 10 hours searching and more than a dozen of parameters trying to find a workaround and the most simple way was the answer. Thank you!!!

       
  • Anonymous

    Anonymous - 2016-08-19

    Thank you so much Peter!!!
    Just saved my butt with the above suggestion. I expect your post will be worth gold when people start to upgrade to php7!
    -Val

     
  • Anonymous

    Anonymous - 2019-10-18
     
  • Benjamin Oruta

    Benjamin Oruta - 2020-04-06
    Post awaiting moderation.

Anonymous
Anonymous

Add attachments
Cancel