bproc-users Mailing List for BProc: Beowulf Distributed Process Space (Page 38)

SourceForge Headquarters 225 Broadway Suite 1600 San Diego, CA 92101 +1 (858) 422-6466

On Wed, Oct 09, 2002 at 07:38:10PM -0600, Wilton Wong wrote:
> Well after working a bit more it appears I may have a genuine bug in
> _bproc_vrfork_io(), I can seem to only fork off one process at a time, see the
> attached patch to beoboot../node_up/node_up.c and see what I mean.
> 
> Anyways we have no ideas here.. and really have no clue as to what is supposed
> to happen or what is not happening ;) this patch was sort of a "we know what
> works" now let's just do that type patch.. not for use in real production
> environments. All I know is that with this patch I am able to boot more than 1
> node at the same time.

It works fine for me.  I've seen node_up do 50+ nodes at a time on our
cluster here.  The only hidden gotcha that I can think of with vrfork
is that the nodes need to be able to reach one another w/ IP.  So, if
they're on different subnets, etc. you're going to have some trouble.
The default route that the boot code puts in is bogus.

If it's working for only one process at once, you *should* be able to
force it to always do that w/o hacking anything by sticking
"nodeupmaxclients 1" in /etc/beowulf/config.

- Erik

2001	Jan	Feb	Mar	Apr	May	Jun	Jul	Aug	Sep	Oct (25)	Nov	Dec (22)
2002	Jan (13)	Feb (22)	Mar (39)	Apr (10)	May (26)	Jun (23)	Jul (38)	Aug (20)	Sep (27)	Oct (76)	Nov (32)	Dec (11)
2003	Jan (8)	Feb (23)	Mar (12)	Apr (39)	May (1)	Jun (48)	Jul (35)	Aug (15)	Sep (60)	Oct (27)	Nov (9)	Dec (32)
2004	Jan (8)	Feb (16)	Mar (40)	Apr (25)	May (12)	Jun (33)	Jul (49)	Aug (39)	Sep (26)	Oct (47)	Nov (26)	Dec (36)
2005	Jan (29)	Feb (15)	Mar (22)	Apr (1)	May (8)	Jun (32)	Jul (11)	Aug (17)	Sep (9)	Oct (7)	Nov (15)	Dec

bproc-users Mailing List for BProc: Beowulf Distributed Process Space (Page 38)

bproc-users — General discussion about BProc.