bproc-users Mailing List for BProc: Beowulf Distributed Process Space (Page 11)

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

HI Ted,

I attach my node_up script, which sources the nfs.init script (you call
it nfs_node.conf, I guess).

You can ignore the lines for sensors.init and pathscale.init.

In addition to this, I have the following in my /etc/clustermatic/config:

librariesfrombinary /sbin/rpc.statd /sbin/portmap

My nodes now work properly, and mount my master:/home instantaneously.

Daniel

On Tue, Oct 19, 2004 at 12:01:05PM -0400, Ted Sariyski wrote:
> I  need some more  help. The script Michal wrote for NFS node support 
> ends with:
> 
> # bpsh $node rpc.statd
> 
> There is no rpc.statd in my distribution but there is rpc.rstatd (I use 
> SuSe Server 9), so I changed it corespondingly.
> Than I add
> 
> /etc/clustermatic/nfs_node.conf $*
> 
> at the end of node_up (nfs_node.conf if the Michal's script, attached).
> 
> It mounts but is slow.  I'm not sure that I understand how to use:
> # If you want to put more setup stuff here, make sure do replace the
> # "exec" above with the following:
> # /usr/lib/beoboot/bin/node_up $* || exit 1
> 
> What I'm doing wrong ?
> Thanks,
> Ted
> 
>  #!/bin/bash -x
>  #
>  # A sample how to get NFS modules on a node.
>  # Make sure that /etc/modules.conf.dist for a node does not
>  # define any "install" actions for these
>  #
>  #  Michal Jaegermann, 2004/Aug/19, michal@ha...
>  #
>  #  2004/Oct/15, michal@ha...
>  #   - start portmap and rpc.statd on nodes
>  #   - fix "case m" typo and do not use "-N" option to bpsh
>  
>  node=$1
>  mod=nfs
>  modules=$( grep $mod.ko /lib/modules/$(uname -r)/modules.dep)
>  modules=${modules/:/}
>  modules=$(
>  for m in $modules ; do
>      echo $m
>  done | tac )
>  ( cd /
>      for m in $modules ; do
>         echo $m
>      done
>  ) | ( cd / ; cpio -o -c --quiet ) | bpsh $node cpio -imd --quiet
>  bpsh $node depmod -a
>  for m in $modules ; do
>      m=$(basename $m .ko)
>      m=${m/_/-}
>      case $m in
>         sunrpc)
>             bpsh $node modprobe -i sunrpc
>             bpsh $node mkdir -p /var/lib/nfs/rpc_pipefs
>             bpsh $node mount | grep -q rpc_pipefs || \
>                 bpsh $node mount -t rpc_pipefs sunrpc 
> /var/lib/nfs/rpc_pipefs
>             ;;
>         *)  bpsh $node modprobe -i $m
>      esac
>  done
>  # these are for a benfit of rpc.statd
>  bpsh $node mkdir -p /var/lib/nfs/statd/
>  bpsh $node mkdir -p /var/run
>  bpsh $node portmap
>  bpsh $node rpc.rstatd
> 
> #mount -t nfs MASTER:/public/home /u    -o 
> nfsvers=3,tcp,bg,rw,rsize=16384,wsize=16384,hard,intr
> #mount -t nfs MASTER:/scratch /scratch1 -o 
> nfsvers=3,tcp,bg,rw,rsize=16384,wsize=16384,hard,intr
> #mount -t nfs MASTER:/public/code /code -o 
> nfsvers=3,tcp,bg,rw,rsize=16384,wsize=16384,hard,intr
> 
> 
> Daniel Gruner wrote:
> 
> >Ted,
> >
> >See the posting from Michal Jagermann on Oct 16.  You need to run
> >both portmap and rpc.statd on the nodes, and then mounting and umounting
> >work fine.
> >
> >Daniel
> >
> >
> >On Mon, Oct 18, 2004 at 11:01:59AM -0400, Ted Sariyski wrote:
> >  
> >
> >>Finally I was able to build a customized version of clustermatic with
> >>kernel 2.6.7 for AMD64. All nodes use Tian B2882 Transport GX28
> >>mainboard, the head node have two SATA hard disks running in RAID1 mode
> >>and I use PXE to boot the diskless nodes (only 16 nodes). 
> >>
> >>I have a couple of questions concerning mounting remote file systems, it
> >>takes really long. Besides some nodes come up fast while for other it
> >>takes 5-10 minutes. For example node1 boots in 2-3 minutes while node0
> >>issued errors on the console (there are not records on the log file):
> >> 
> >>mmap failed: /lib64/ld-2.3.3.so
> >>vmadump: mmap failed: /lib64/ld-2.3.3.so
> >>portmap: server localhost not responding, time out
> >>RPC: failed to contact portmap
> >>Lockd_up: no pid, 2 users??
> >>
> >>before somehow come up:
> >>
> >>[root@xtreme101 root]# bpsh 0 mount
> >>rootfs on / type rootfs (rw)
> >>none on /proc type proc (rw,nodiratime)
> >>none on /bpfs type bpfs (rw)
> >>192.168.0.101:/home on /home type nfs
> >>(rw,v3,rsize=32768,wsize=32768,hard,udp,nolock,addr=192.168.0.101)
> >>192.168.0.200:/public/home on /u type nfs
> >>(rw,v3,rsize=16384,wsize=16384,hard,intr,tcp,lock,addr=192.168.0.200)
> >>192.168.0.200:/scratch on /scratch1 type nfs
> >>(rw,v3,rsize=16384,wsize=16384,hard,intr,tcp,lock,addr=192.168.0.200)
> >>192.168.0.200:/public/code on /code type nfs
> >>(rw,v3,rsize=16384,wsize=16384,hard,intr,tcp,lock,addr=192.168.0.200)
> >>
> >>Currently I work only with three nodes and I believe that it's not a PXE
> >>issue. What is the meaning of mmap and portmap errors issued by node0?
> >>Is it normal for mount to take so long or I miss something in config?
> >>
> >>Thanks,
> >>Ted
> >>
> >>
> >>
> >>
> >>
> >>-------------------------------------------------------
> >>This SF.net email is sponsored by: IT Product Guide on ITManagersJournal
> >>Use IT products in your business? Tell us what you think of them. Give us
> >>Your Opinions, Get Free ThinkGeek Gift Certificates! Click to find out more
> >>http://productguide.itmanagersjournal.com/guidepromo.tmpl
> >>_______________________________________________
> >>BProc-users mailing list
> >>BPr...@li...
> >>https://lists.sourceforge.net/lists/listinfo/bproc-users
> >>    
> >>
> >
> >  
> >
> 

-- 

Dr. Daniel Gruner                        dg...@ti...
Dept. of Chemistry                       dan...@ut...
University of Toronto                    phone:  (416)-978-8689
80 St. George Street                     fax:    (416)-978-5325
Toronto, ON  M5S 3H6, Canada             finger for PGP public key

2001	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct (25)	Nov	Dec (22)
2002	Jan (13)	Feb (22)	Mar (39)	Apr (10)	May (26)	Jun (23)	Jul (38)	Aug (20)	Sep (27)	Oct (76)	Nov (32)	Dec (11)
2003	Jan (8)	Feb (23)	Mar (12)	Apr (39)	May (1)	Jun (48)	Jul (35)	Aug (15)	Sep (60)	Oct (27)	Nov (9)	Dec (32)
2004	Jan (8)	Feb (16)	Mar (40)	Apr (25)	May (12)	Jun (33)	Jul (49)	Aug (39)	Sep (26)	Oct (47)	Nov (26)	Dec (36)
2005	Jan (29)	Feb (15)	Mar (22)	Apr (1)	May (8)	Jun (32)	Jul (11)	Aug (17)	Sep (9)	Oct (7)	Nov (15)	Dec

bproc-users Mailing List for BProc: Beowulf Distributed Process Space (Page 11)

bproc-users — General discussion about BProc.