For quite some time (maybe even on the old 2.4 based kernel) we've occasionally seen a problem where the /proc filesystem seems to be deadlocked - any attempt to read /proc hangs.
When this happens rebooting one the nodes (not any node, it has to be the "right" one) will free up the system and things will continue as normal.
Today I just noticed that when I rebooted the node that was "causing" the problem I had the following messages on the init node:
Node 6 has gone down!!!
Assertion failed! origin_lock != ((void *)0), cluster/ssi/vproc/dvp_pvpsops.c, pvpsop_get_execnode, line=376
nm_add_node: Node 6 added
Is this a clue?
Log in to post a comment.