Quorum device is offline

Setup

2 Node cluster

2 x Netra 440

1 x DotHill SANNet 200 SCSI Array (Equivalent to Sun's 3310 SCSI RAID)

2 x private nets (P2P)

2 x public nets

Solaris 9

Sun Cluster 3.1

The RAID

- is shared between the servers

- has one logical drive/one partition

I think I'vesetup the cluster alright but for some reason the output of scstat shows that the quorum device is offline. Any idea? Log below:

# scstat -q

-- Quorum Summary --

Quorum votes possible:3

Quorum votes needed:2

Quorum votes present:2

-- Quorum Votes by Node --

Node NamePresent Possible Status

- --

Node votes:salsa11Online

Node votes:chips11Online

-- Quorum Votes by Device --

Device Name Present Possible Status

-- - --

Device votes:/dev/did/rdsk/d6s2 01Offline

[889 byte] By [dwal_007] at [2007-11-26 9:32:11]
# 1

1 x DotHill SANNet 200 SCSI Array (Equivalent to Sun's 3310 SCSI RAID)

Equivalent but not equal to.... that may well be the issue. There is a lot of firmware involved along the way. It's hard to tell what might be the issue though. Have a look in /var/adm/messages and see if there is anything in there that gives more insight.

Tim

TimRead at 2007-7-7 0:18:20 > top of Java-index,Solaris Operating System,Solaris Essentials - General Technical Questions...
# 2

I replaced the DotHill device with Sun StorEdge 3320 SCSI array but scstat still shows the device as offline despite repeated reboots. I see the following error in /var/adm/messages:

Sep 27 13:21:28 salsa cl_runtime: [ID 131640 kern.warning] WARNING: CMM: Reading reservation keys from quorum device /dev/did/rdsk/d6s2 failed.

I'm kinda stuck here. Any help would be greatly appreciated.

dwal_007 at 2007-7-7 0:18:20 > top of Java-index,Solaris Operating System,Solaris Essentials - General Technical Questions...
# 3
Have you tried removing then readding the Quorum device?HTH,-ashu
ashu15 at 2007-7-7 0:18:20 > top of Java-index,Solaris Operating System,Solaris Essentials - General Technical Questions...
# 4

That didn't really help. It's a 2 node cluster with one quorum device. I can't seem to remove the quorum device at all. Here is the log:

# scconf -c -q installmode

# scconf -r -q name=d6

scconf: Failed to remove quorum device (d6) - unexpected error.

# echo $?

16

#

dwal_007 at 2007-7-7 0:18:20 > top of Java-index,Solaris Operating System,Solaris Essentials - General Technical Questions...
# 5
You need to add a new QD, then remove the old one. You can then re-add the old one and remove the new one.Tim
TimRead at 2007-7-7 0:18:20 > top of Java-index,Solaris Operating System,Solaris Essentials - General Technical Questions...
# 6
For those who may be interested:** I re-installed the whole system (starting with OS) for the problem to go away** Both StorEdge 3320 and Dothill SANNet II 200 SCSI Array work just fine
dwal_007 at 2007-7-7 0:18:20 > top of Java-index,Solaris Operating System,Solaris Essentials - General Technical Questions...
# 7

I had a simmilar problem after changing completely the shared disk infrastructure. I got to the point that the did Devices where recogniced but the quroum wouldnt go online. I added another shared device as quorum, removed the original, added the original and removed the temporary.

It worked just fine this way.

st_w at 2007-7-7 0:18:20 > top of Java-index,Solaris Operating System,Solaris Essentials - General Technical Questions...