Re: [Osgmm-discuss] Condor Negotiator Crashing
Brought to you by:
mats_rynge
From: Mats R. <ry...@re...> - 2009-06-23 21:54:10
|
Peter Doherty wrote: > > It turns out this version breaks the verification runs for me. Are > there updated scripts to go along with it? For example the > fork.condor file in ~osgmm/var/verification-runs/SITE-NAME listed the > executable as "fork.script.123591332490" but that executable didn't > exist anywhere. fork.script.$ts is just a copy of the libexec/fork.script. I don't think the location changed from 0.5, but I might be wrong. > But I'm having trouble figuring out why the verification tests aren't > working right anymore. The Ranks for all the sites are low (1 or 3) > although the Success score is 100%. And several sites aren't even > being tested. It would really be helpful to me to get more logging > information showing why a site was dropped from the list, and why a > test can complete with TEST SUCCESSFUL, but the site Rank is still 1. > Like our site SBGrid-Harvard-East is no longer in my list from > condor_grid_overview, and since it doesn't have a directory under > verification-runs, I can't see the output from the tests. > Restarting the MatchMaker seems to clear out the osgmm.log file > without rolling it over. So after a few restarts this afternoon I now > have a huge gap in the log files, and perhaps that's where the answer > is why the East site was dropped. I'm trying to re-create your problem on a test machine here. It is running Condor 7.2.1 and is configured for the sbgrid VO. So far I have not seen the negotiator crash. I think it would be useful for me to poke around in your OSGMM install instead of trying to figure things out over email. I think I used to have an account on abitibi. If the account is still there, can I have the password reset (and sent in a private email of course)? -- Mats Rynge Renaissance Computing Institute <http://www.renci.org> |