Re: [Osgmm-discuss] Condor Negotiator Crashing
Brought to you by:
mats_rynge
From: Mats R. <ry...@re...> - 2009-06-22 20:02:25
|
Peter Doherty wrote: > Okay, that eliminated the errors on the console when I launch the > match maker. Thanks. > For the moment the negotiator stopped crashing, but it stopped before > I put the new jar file in, so I don't know what to make of that. > At the moment there are no valid sites in the matchmaker, I'm going to > have to look into things further to see what's going on. It seems the > verification runs didn't run this afternoon. > The matchmaker related processes don't look right to me. What version of Condor do you have? Can you provide the var/log/osgmm.log? > Anyhow, looking in the osgmm.log file I noticed something > interesting. It tries to track jobs by their job log files, and if it > can't access the file, it tries to chmod 644 the log file. > a.) I don't know that I like the idea of the matchmaker trying to > change permissions on files in people's home directories. > b.) if it can't read the file, the odds are pretty slim it's going to > be able to change permissions on the file. > > But I guess this is why I have so many 0's and empty columns in the > various fields of condor_grid_overview. If it can't access the log > files, it can't display what jobs are running where and what their > status is in the condor_grid_overview output. Is that correct? I thought we have already removed the sudo/chmod feature (it is not a great way to do it - I will remove the code for 0.7). The preferred way to do this is to have a pre script that fixes the permissions. See local-pre-job in http://osgmm.sourceforge.net/ar01s03.html#job -- Mats Rynge Renaissance Computing Institute <http://www.renci.org> |