Menu

Problem with Chinese in nextXPath URL

2008-01-07
2012-09-04
  • sunrunner777

    sunrunner777 - 2008-01-07

    Hi,

    I am enjoying using Web-Harvest so far, it is a great tool! Great work!

    I have problems with download-multipage-list, when the nextXPath URL has Chinese characters such as

    http://www.somesite.com/test?a=美国&b=英国

    This causes it fail to find the next page. However, when I use url encoding to encode those chinese cahracters, it then works:

    http://www.somesite.com/test?a=%BC%56%12%CA&b=%23%BD%F2%CB

    How can I force the program to do a url encoding before visiting the link?

    Thanks in advance!

    Sunrunner777

     
    • fangbin yee

      fangbin yee - 2008-03-06

      you may write a encode function in bean shell to encode it, I believe such function could be seached from web

       
  • Mark V

    Mark V - 2011-02-24

    Google directed me here, answer from another thread

    <http url="${org.apache.commons.httpclient.util.URIUtil.encodeQuery(myUrl.toString())}" />
    
     

Log in to post a comment.