Quorum device is offline
Setup
2 Node cluster
2 x Netra 440
1 x DotHill SANNet 200 SCSI Array (Equivalent to Sun's 3310 SCSI RAID)
2 x private nets (P2P)
2 x public nets
Solaris 9
Sun Cluster 3.1
The RAID
- is shared between the servers
- has one logical drive/one partition
I think I'vesetup the cluster alright but for some reason the output of scstat shows that the quorum device is offline. Any idea? Log below:
# scstat -q
-- Quorum Summary --
Quorum votes possible:3
Quorum votes needed:2
Quorum votes present:2
-- Quorum Votes by Node --
Node NamePresent Possible Status
- --
Node votes:salsa11Online
Node votes:chips11Online
-- Quorum Votes by Device --
Device Name Present Possible Status
-- - --
Device votes:/dev/did/rdsk/d6s2 01Offline
[889 byte] By [
dwal_007] at [2007-11-26 9:32:11]

# 1
1 x DotHill SANNet 200 SCSI Array (Equivalent to Sun's 3310 SCSI RAID)
Equivalent but not equal to.... that may well be the issue. There is a lot of firmware involved along the way. It's hard to tell what might be the issue though. Have a look in /var/adm/messages and see if there is anything in there that gives more insight.
Tim
# 2
I replaced the DotHill device with Sun StorEdge 3320 SCSI array but scstat still shows the device as offline despite repeated reboots. I see the following error in /var/adm/messages:
Sep 27 13:21:28 salsa cl_runtime: [ID 131640 kern.warning] WARNING: CMM: Reading reservation keys from quorum device /dev/did/rdsk/d6s2 failed.
I'm kinda stuck here. Any help would be greatly appreciated.
# 4
That didn't really help. It's a 2 node cluster with one quorum device. I can't seem to remove the quorum device at all. Here is the log:
# scconf -c -q installmode
# scconf -r -q name=d6
scconf: Failed to remove quorum device (d6) - unexpected error.
# echo $?
16
#
# 7
I had a simmilar problem after changing completely the shared disk infrastructure. I got to the point that the did Devices where recogniced but the quroum wouldnt go online. I added another shared device as quorum, removed the original, added the original and removed the temporary.
It worked just fine this way.
st_w at 2007-7-7 0:18:20 >
