Menu

HTTPS crawl fails to create socket

Help
Anonymous
2015-04-02
2020-03-19
  • Anonymous

    Anonymous - 2015-04-02

    When attempting to crawl a HTTPS site (facebook) - no content was received. I found the issue to be the socket failing to be created - is this a known issue? Is there any workaround?

     
  • Uwe Hunfeld

    Uwe Hunfeld - 2015-04-02

    Hi!

    What php version are you using?
    5.6x?

    If so, there is a known bug: https://sourceforge.net/p/phpcrawl/bugs/86/

    Before php 5.6 it should work without any (known) problems.

    Did you install OpenSSH (it's required for ssl-connections).

     
  • Anonymous

    Anonymous - 2015-04-03

    Hey there!

    Thank you for the quick response. I just checked, and I am indeed using PHP 5.6 (5.6.2). I'll try rolling back to an earlier version now and report my results back, but it sounds like that should resolve the issue.

    Thank you!

     
  • Anonymous

    Anonymous - 2015-04-07

    For anyone looking at this, I can confirm that PHP 5.6.x was the problem (5.6.2 for me). Dropping back a version everything seems to work fine!

     
  • Uwe Hunfeld

    Uwe Hunfeld - 2015-04-07

    Hi,

    thanks for replying back and the confirmation!

    This bug (https and php 5.6) will get fixed in the next release.

    If anybody needs a temporary workaround for phpcrawl version 0.83, please let me know
    (here or in the bugreport).

    Thanks!

     
  • Anonymous

    Anonymous - 2016-05-24

    I was having massive Problems crawling SSL-Sites after a PHP-version update. After a bit of hacking I found the Error: The socket was returning 'Host unreachable'
    Solution:
    in PHPCrawlerHTTPRequest.class.php Row 551:
    Replace 'SNI_server_name' with 'peer_name'
    SNI_server_name is not supported by PHP > 5.6.0.

    Works like a charm now, hope this helps someone else. Thank you, Uwe, for your wonderful tool..

    Peter

     
    • Anonymous

      Anonymous - 2020-03-19

      Ohhh my god!!!! More than 10 hours searching and more than a dozen of parameters trying to find a workaround and the most simple way was the answer. Thank you!!!

       
  • Anonymous

    Anonymous - 2016-08-19

    Thank you so much Peter!!!
    Just saved my butt with the above suggestion. I expect your post will be worth gold when people start to upgrade to php7!
    -Val

     
  • Anonymous

    Anonymous - 2019-10-18
     
  • Benjamin Oruta

    Benjamin Oruta - 2020-04-06
    Post awaiting moderation.

Anonymous
Anonymous

Add attachments
Cancel





Want the latest updates on software, tech news, and AI?
Get latest updates about software, tech news, and AI from SourceForge directly in your inbox once a month.