[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
re[2]: Clusterwide pids
>> Lars Marowsky-Bree writes:
>> > "Albert D. Cahalan" <acahalan@cs.uml.edu> said:
>> >> How about this: when a node boots, pass it a PID range.
>> >
>> > This leaves us with the chicken and egg problem - how do you
>> > boot a node which is - at the time of boot - unable to contact
>> > the cluster?
>> "Albert D. Cahalan" <acahalan@cs.uml.edu> said:
>> You don't. It is a mistake to design for this perversion.
>> Do you, or do you not want a shared PID space? Make up your
>> mind about this. You don't run an SMP system as multiple
>> uniprocessor systems, then suddenly decide that you want SMP!
This may be good discussion point for the Ottawa meeting. (No I won't be there.)
I have to agree that in order to get a shared PID space, it seems best to do it right off the bat.
I am most familiar with TruClusters from Compaq, and that is what they do.
If you must boot a TruCluster node, or set of nodes, without having quoram, then you have a separate diagnostic boot process you can manually use to override the quarom value.
I don't have the details handy, but effectively it is like:
boot kernel -flags quoram_overide=n
Where n becomes your new quoram value for this one boot.
It is not typically used, but, for instance, if you have a 5 node cluster and 3 of the nodes die, you need a way to get the cluster back operational.
Greg Freemyer
Internet Engineer
Deployment and Integration Specialist
The Norcross Group
www.NorcrossGroup.com
Linux-cluster: generic cluster infrastructure for Linux
Archive: http://mail.nl.linux.org/linux-cluster/