Menu

web-harvest can't download the file

Help
2010-11-24
2012-09-04
  • nworm nworm

    nworm nworm - 2010-11-24

    web-harvest can't download this file http://www.oaomoek.ru/upload/File/2011/K
    D_2798.rar

    I use this code:

    <?xml version="1.0" encoding="UTF-8"?>
    
    <config scriptlang="java">
          <file action="write" path="c:\temp\1.rar" type="binary">
              <http method="get" url="[url]http://www.oaomoek.ru/upload/File/2011/KD_2798.rar[/url]">;
              </http>
          </file> 
    </config>
    

    And i get a bad file. I can not open it. :(

     
  • Alex Wajda

    Alex Wajda - 2010-11-24

    First of all it's a server's problem - it returns inappropriate content type
    "text/plain" while one of the binary content types is expected:

    telnet> open www.oaomoek.ru 80
    Trying 87.245.190.123...
    Connected to www.oaomoek.ru.
    Escape character is '^]'.
    HEAD /upload/File/2011/KD_2798.rar HTTP/1.1
    Host: www.oaomoek.ru
    
    HTTP/1.1 200 OK
    Date: Wed, 24 Nov 2010 09:32:11 GMT
    Server: Apache/1.3.37 (Unix) PHP/4.4.7 rus/PL30.22
    Last-Modified: Wed, 20 Oct 2010 06:16:59 GMT
    ETag: "60ca-15e14-4cbe895b"
    Accept-Ranges: bytes
    Content-Length: 89620
    Content-Type: text/plain
    

    The general purpose download manager and browsers guess the correct content
    type and forgive the server. Although WH could do the same, but currently it
    does not, it relies on the returned Content-Type header much.

    This might be improved in one of the further versions. Could you please file a
    request in the bugs tracker?

     
  • jacob

    jacob - 2012-05-30

    I want to know, if the problem had been update in new version.????

     

Log in to post a comment.