Re: [Osgmm-discuss] Condor Negotiator Crashing
Brought to you by:
mats_rynge
From: Peter D. <do...@cr...> - 2009-06-22 20:11:45
|
On Jun 22, 2009, at 3:57 PM, Mats Rynge wrote: > > What version of Condor do you have? Can you provide the var/log/ > osgmm.log? > # condor_version $CondorVersion: 7.2.1 Feb 18 2009 BuildID: 133382 $ $CondorPlatform: X86_64-LINUX_RHEL5 $ The osgmm.log file is entirely filled with the permission change attempts. 4000 jobs in the queue, and it checks every 2 seconds on a different job. the osgmm.log.1.gz is a day old, and the osgmm.log.1 file has binary data in it. Is that normal? Hmm... suddenly I'm wondering if that's a clue. I used 'strings' on the osgmm.log.1 file, and the last entry is at 10:42 this morning. That's about when things started to go wrong. I'll have to check if I did something that would have caused something like that. I've restarted the match maker a couple times with no success. I'm attaching the log file anyhow. It's 5MB. It got rejected by the mailing list... so it's available here: http://abitibi.sbgrid.org/osgmm.log.1 Thanks for your help Mats. I've got meetings that will probably fill the rest of my day, so I'll have to wait until tomorrow to do much else. > >> I thought we have already removed the sudo/chmod feature (it is not >> a great way to do it - I will remove the code for 0.7). The >> preferred way to do this is to have a pre script that fixes the >> permissions. See local-pre-job in http://osgmm.sourceforge.net/ar01s03.html#job Okay, I'll look into that. --Peter |