From: Cristi M. <Cri...@ne...> - 2011-01-14 11:15:40
|
Hello again, Unfortunately it doesn't work like this - either with: # /opt/vizstack/sbin/vs-configure-system -s slurm node[1-4] or # /opt/vizstack/sbin/vs-configure-standalone - on the master node(with GPU) I received the same result: > Processing Node 'node5'... > Errors happened while trying to get the configuration of node 'node5'. > Reason: > > Traceback (most recent call last): > File "/opt/vizstack/sbin/vs-generate-node-config", line 304, in <module> > gpuInfo = parse_gpuInfo(temp_xconfig) > File "/opt/vizstack/sbin/vs-generate-node-config", line 168, in > parse_gpuInfo > busId = "PCI:%d:%02d:%01d"%(int(busIdParts[0], 16), int(busIdParts[1], > 16), int(busIdParts[2], 16)) > IndexError: list index out of range > srun: error: node1: task 0: Exited with exit code 1 > > > Please fix the above errors & run this tool again Do you have other solutions ? I am using vistack_1.1-3_ubuntu9.10.amb64.deb Thank you in advance. Cristi --------------------------------------------------------------------------- From: Manjunath Sripadarao - 2011-01-14 03:30 Hello, Can you try running the script excluding the Master Node without GPU. For example if node5 is the master node, can you try with # /opt/vizstack/sbin/vs-configure-system -s slurm node[1-4] Regards, - Manju --------------------------------------------------------------------------- On Thu, 2011-01-13 at 16:33 +0000, Cristi Mocan wrote: > Hello, > > After I followed the steps from the admin guide - in order to use Vizstack > in my cluster: ( 1 Master Node -without GPU and 5 clients - nodes with > GPUs: NVIDIA GTS 8800), install Munge, Slurm, Vizstack. > > Slurm works fine and the connection is good: > > 5 idle node[1-5] > > Next step, I want to configure Vizstack with SLURM: > > I run this command on the master node: > > # /opt/vizstack/sbin/vs-configure-system -s slurm node[1-5] > > > > Result: > > > Processing Node 'node1'... > Errors happened while trying to get the configuration of node 'node1'. > Reason: > > Traceback (most recent call last): > File "/opt/vizstack/sbin/vs-generate-node-config", line 304, in <module> > gpuInfo = parse_gpuInfo(temp_xconfig) > File "/opt/vizstack/sbin/vs-generate-node-config", line 168, in > parse_gpuInfo > busId = "PCI:%d:%02d:%01d"%(int(busIdParts[0], 16), int(busIdParts[1], > 16), int(busIdParts[2], 16)) > IndexError: list index out of range > srun: error: node1: task 0: Exited with exit code 1 > > > Please fix the above errors & run this tool again > > > > I have to mention that I have used nvidia-settings: version 260.19.21 and > the latest Vizstack 1.1-3 released. On all the machines I have the OS: > Ubuntu 10.04. I have used ssh in order to connect to each machine in my > cluster - running in separated terminals. > > I also tried to run: > # /opt/vizstack/sbin/vs-configure-standalone > on one of the client nodes, but I received the same result. > > > Can you help me in order to resolve this problem ? > > Thank you in advance ! |