Menu

How to schedule webharvest

Help
JC.
2010-04-28
2013-08-16
  • JC.

    JC. - 2010-04-28

    Hello there,

    So my boss asked me to play around with scrapping data.

    I managed to build a config file to get the data we wanted (consider this a
    miracle, I studied English and History -- shout out to all the nice people on
    this forum by the way).

    Now, since we're interested on how this data evolve, we would like to run the
    same config file several times a day. And do it by hand is not really an
    option.

    I'm sure there's a way to set up some scheduling for this, but as I said, I'm
    no expert and I just don't know where to start. Could it be built in the file
    itself? Do I need to create a whole new java application with webharvest in
    it? Can the OS (windows vista) do it? A cron job maybe?...

    I'll be satisfied with just a starting point.

    Thanks!

    JC.

     
  • JC.

    JC. - 2010-05-04

    So I figured this one out myself, using task scheduler in windows, with a
    command line that has the config file in it, like this:

    C:\Program Files\Java\jre6\bin\java -Xms1520m -Xmx1520m -jar
    webharvest_all_2.jar config="C:\Program Files\Java\jre6\bin\config.xml"

    ...In case someone needs it.

     

Log in to post a comment.