From: Nicholas H. <he...@se...> - 2003-04-10 16:55:56
|
On Wed, 9 Apr 2003 16:04:19 -0600 er...@he... wrote: Ok -- here is another node. I tried to find the process information on the head node incase that helps. ps -zxf for node25 on the head node: 28460 ? S 0:00 bpsh -n 25 subtaskInvoker /scratch/user/sfischer/slot_1/result /genomics/binf/scratch/dotsBuilds/nicTest/mus/similarity/fin 28462 ? SW 0:00 \_ [subtaskInvoker] 28463 ? SW 0:00 \_ [blastSimilarity] 11803 ? SW 0:00 \_ [sh] 11804 ? SW 0:00 \_ [blastx] 11823 ? SW 0:00 \_ [blastx] 11825 ? SW 0:00 \_ [blastx] On the node before the kill: 10509 ? S 0:00 \_ /bin/sh /proc/self/fd/3 /scratch/user/sfischer/slot_1/result /genomics/binf/scratch/dotsBuilds/nicTest/mus/similar 10510 ? S 0:00 \_ /usr/bin/perl /home/sfischer/gushome/bin/blastSimilarity --blastBinDir /genomics/share/pkg/bio/wu-blast/curren 11825 ? S 0:00 \_ sh -c /genomics/share/pkg/bio/wu-blast/current/blastx /scratch/user/sfischer/prodom.fsa seqTmp -wordmask=s 11826 ? S 0:00 \_ /genomics/share/pkg/bio/wu-blast/current/blastx /scratch/user/sfischer/prodom.fsa seqTmp -wordmask seg 11839 ? S 0:00 \_ /genomics/share/pkg/bio/wu-blast/current/blastx /scratch/user/sfischer/prodom.fsa seqTmp -wordmask 11840 ? S 0:00 \_ /genomics/share/pkg/bio/wu-blast/current/blastx /scratch/user/sfischer/prodom.fsa seqTmp -word Hrm -- This one seemed to have killed them all. With the head node information I managed to trim the trace to where the bpsh <...> started to the end. Trace is at :http://www.liniac.upenn.edu/~henken/bproc/node25trace One more after this -- just for fun :) Nic -- Nicholas Henke Penguin Herder & Linux Cluster System Programmer Liniac Project - Univ. of Pennsylvania |