Problem bringing down cluster or node.
Hi All,
I'm having an issue when I try to either shut down the entire cluster, or just one node from a cluster. I'm running Solaris 10 with Sun Cluster 3.1, on two V240's, when the node is shutting down, everything seems OK, then it outputs the messages shown below before completely hanging... No syncing disks, then OK prompt... Nothing. Any clues as to what's occuring?
- --
Oct 9 10:32:48 dione xntpd[482]: [ID 866926 daemon.notice] xntpd exiting on signal 15
Oct 9 10:32:49 dione FIN_SVC_CTRL: [ID 702911 local0.error] Warning:Because one or more of the sun cluster userland cluster services are offline this service goes offline
Oct 9 10:32:52 dione cl_eventlogd[2075]: [ID 247336 daemon.error] Going down on signal 15.
Oct 9 10:32:52 dione INITRGM: [ID 702911 local0.error] Warning: an attempt to stop or disable the svc:/system/cluster/rgm:default service was detected and ignored. A shutdown or reboot in progress is allowed to proceed as normal.
Oct 9 10:32:53 dione Cluster.PNM: [ID 226280 daemon.notice] PNM daemon exiting.
Oct 9 10:32:53 dione INITFED: [ID 702911 local0.error] Warning: an attempt to stop or disable the svc:/system/cluster/rpc-fed:default service was detected and ignored. A shutdown or reboot in progress is allowed to proceed as normal.
Oct 9 10:32:53 dione Cluster.RGM.rgmd: [ID 642220 daemon.error] There is already an instance of this daemon running
Oct 9 10:32:53 dione Cluster.RGM.fed: [ID 642220 daemon.error] There is already an instance of this daemon running
Oct 9 10:33:07 dione Cluster.PMF.pmfd: [ID 615790 daemon.notice] "cacao" Failed to stay up.
- --
I've not had this problem before with another cluster I built using V210's, Solaris 10 and Sun Cluster 3.1. I have applied all the updates, apart from the Java runtime updates - if I do update Java I receive errors about having an incompatible runtime.
I'm assuming that something isn't running, or is killed before it should be.
Thank you in advance,
Pete

