[Taskforest-discuss] Task Forest Daemon Problem
Brought to you by:
enoor
From: Rosco R. <Ros...@sa...> - 2009-06-11 18:45:25
|
We're currently running TF 1.26; but I've been noticing this problem since we first began experimenting back at around 1.10 or so. Until recently, I was not convinced that it truly was a bug...thinking it probably something incorrect about our local configuration. Here's the rub. Things will be humming along nicely; our nightly processing will finish with everything OK. Then for some application reason, we'll need to rerun a job. In the olden days, I'd do this with the batch "./bin/rerun..." command. Nowadays, I'm clicking the web client "Rerun" button. All seems well as you issue the command. Then after several minutes the job doesn't appear to finish when I think it should and I begin snooping around. I then discover that the ./bin/taskforest process itself has terminated. Sometimes a single, isolated rerun command will cause this...but not very often. Sometimes, I find it necessary to fire-off several rerun commands from a ksh...or lately, with the web client I'll click several jobs as quick as the web page can be rebuilt. The more rerun commands I fire at the daemon in a short time, the greater the chance that it'll go down. Generally, we have very little trouble with the daemon. It performs reliably in all but this circumstance. Rosco |