srcconf -x 86400 -N node3 no response

Hi,

I tried to install Sun cluster 3.2 on 3 Solaris 10 x86 (64-bit) servers, they are all Sun X4100. I installed OS patches and Sun cluster software on them, connected private interconnection, setup ssh public key, hosts file.

When I run scinstall on node1, it go through all the menu, 'cluster nodes', ' Cluster Transport Adapters and Cables ', in the last step, '

Cluster Creation', it reboot node3, and Waiting for "node3" to become a cluster member ... ,but after node3 is rebooted, there's still no response , it is waiting forever. here's the log file:

Initializing cluster name to "clustername" ... done

Initializing authentication options ... done

Initializing configuration for adapter "e1000g2" ... done

Initializing configuration for adapter "e1000g3" ... done

Initializing configuration for switch "switch1" ... done

Initializing configuration for switch "switch2" ... done

Initializing configuration for cable ... done

Initializing configuration for cable ... done

Initializing private network address options ... done

Setting the node ID for "node3" ... done (id=1)

Checking for global devices global file system ... done

Updating vfstab ... done

Verifying that NTP is configured ... done

Initializing NTP configuration ... done

Updating nsswitch.conf ... done

Adding cluster node entries to /etc/inet/hosts ... done

Configuring IP multipathing groups ...done

Verifying that power management is NOT configured ... done

Ensure network routing is disabled ... done

Please reboot this machine.

Log file - /var/cluster/logs/install/scinstall.log.9605

SC_COMMAND_STATUS=0

ssh root@node3 -o "BatchMode yes" -n "/bin/sh -c '/usr/cluster/lib/scadmin/lib/cmd_test hasbooted; /bin/echo SC_COMMAND_STATUS=\$

?'"

This node has not yet been booted as a cluster node.

SC_COMMAND_STATUS=0

Rebooting "node3" ...

ssh root@node3 -o "BatchMode yes" -n "/bin/sh -c '/usr/cluster/lib/scadmin/lib/cmd_reboot; /bin/echo SC_COMMAND_STATUS=\$?'"

SC_COMMAND_STATUS=0

scrconf -x 86400 -N node3

So node1 rebooted node3, node3 is online , and I could see the cluster node is online:

[root@node3:/ #]clnode status

=== Cluster Nodes ===

Node Status

Node NameStatus

node3Online

but the srcconf in node1 is still hang there.

Any idea ? Thanks.

[2494 byte] By [yyea] at [2007-11-27 0:28:45]
# 1
I've not seen this problem before as far as I can remember. I don't remember having to set up ssh either. Are you sure that your ssh setup is completely correct? What happens if you temporarily disable the ssh setup?Tim
Tim.Reada at 2007-7-11 22:30:26 > top of Java-index,Solaris Operating System,Solaris Essentials - General Technical Questions...
# 2

I had the same problem.

Here is the solution (at least in my case): disable tcpwrappers.

Check to see if you see something similar to this in your /var/adm/messages file:

Apr 30 15:40:08 node2 rpcbind: [ID 702911 daemon.warning] refused connect from 172.31.65.190 to getport()

Sun's JASS system hardening package ("fully supported resulting config" ) enables tcpwrappers and puts in a very restrictive /etc/hosts.allow and /etc/hosts.deny

As soon as i removed these two files the install continued.

--john

segfaultcoredumpa at 2007-7-11 22:30:26 > top of Java-index,Solaris Operating System,Solaris Essentials - General Technical Questions...