V440 machine with 4 internal disks and connected with SE 3510 booting
Hi All,
I have a SUN V440 server with four internal disks and SE 3510 is connected with it.
I have installed Solaris OS 5.9 09/04 on it, configured Solaris Volume Manager (SVM) for internal as well as for SE 3510 external disks. Than we installed sun cluster 3.1 on it. After that i installed Oracle 10g on it. After that i installed my application on it , which created 4 Resource Groups on the sun cluster, but Resource Groups were not online by default which was not a normal behaviour. I tried to put resource groups online manually, it was not successful and following error came
Nov 26 08:45:31 emarn1-1 SC[SUNWscu.prenet_start]:RG_EMA_ORACLE:RES_EMA_ORACLE_HASTORAGE: File system check of /home (/dev/md/sogds/rdsk/d130) failed: (-4195137) /dev/md/sogds/rdsk/d130: BAD SUPER BLOCK: MAGIC NUMBER WRONG
Nov 26 08:45:31 emarn1-1 /dev/md/sogds/rdsk/d130: USE AN ALTERNATE SUPER-BLOCK TO SUPPLY NEEDED INFORMATION;
Nov 26 08:45:31 emarn1-1 /dev/md/sogds/rdsk/d130: eg. fsck [-F ufs] -o b=# [special ...]
Nov 26 08:45:31 emarn1-1 /dev/md/sogds/rdsk/d130: where # is the alternate super block. SEE fsck_ufs(1M).
Nov 26 08:45:31 emarn1-1 /dev/md/sogds/rdsk/d130: UNEXPECTED INCONSISTENCY; RUN fsck MANUALLY.
Nov 26 08:45:31 emarn1-1 THE FOLLOWING FILE SYSTEM(S) HAD AN UNEXPECTED INCONSISTENCY: /dev/md/sogds/rdsk/d130 (/home).
Nov 26 08:45:31 emarn1-1 Cluster.RGM.rgmd: Method <hastorageplus_prenet_start> failed on resource <RES_EMA_ORACLE_HASTORAGE> in resource group <RG_EMA_ORACLE> [exit code <1>, time used: 0% of timeout <1800 seconds>]
Nov 26 08:45:31 emarn1-1 ip: TCP_IOC_ABORT_CONN: local = 010.034.159.028:0, remote = 000.000.000.000:0, start = -2, end = 6
Nov 26 08:45:31 emarn1-1 ip: TCP_IOC_ABORT_CONN: aborted 0 connection
Nov 26 08:45:32 emarn1-1 SC[SUNWscu.prenet_start]:RG_EMA_ORACLE:RES_EMA_ORACLE_HASTORAGE: File system check of /home (/dev/md/sogds/rdsk/d130) failed: (-4195137) /dev/md/sogds/rdsk/d130: BAD SUPER BLOCK: MAGIC NUMBER WRONG
Nov 26 08:45:32 emarn1-1 /dev/md/sogds/rdsk/d130: USE AN ALTERNATE SUPER-BLOCK TO SUPPLY NEEDED INFORMATION;
Nov 26 08:45:32 emarn1-1 /dev/md/sogds/rdsk/d130: eg. fsck [-F ufs] -o b=# [special ...]
Nov 26 08:45:32 emarn1-1 /dev/md/sogds/rdsk/d130: where # is the alternate super block. SEE fsck_ufs(1M).
Nov 26 08:45:32 emarn1-1 /dev/md/sogds/rdsk/d130: UNEXPECTED INCONSISTENCY; RUN fsck MANUALLY.
Nov 26 08:45:32 emarn1-1 THE FOLLOWING FILE SYSTEM(S) HAD AN UNEXPECTED INCONSISTENCY: /dev/md/sogds/rdsk/d130 (/home).
Nov 26 08:45:32 emarn1-1 Cluster.RGM.rgmd: Method <hastorageplus_prenet_start> failed on resource <RES_EMA_ORACLE_HASTORAGE> in resource group <RG_EMA_ORACLE> [exit code <1>, time used: 0% of timeout <1800 seconds>]
Nov 26 08:45:32 emarn1-1 ip: TCP_IOC_ABORT_CONN: local = 010.034.159.028:0, remote = 000.000.000.000:0, start = -2, end = 6
Nov 26 08:45:32 emarn1-1 ip: TCP_IOC_ABORT_CONN: aborted 0 connection
What i did, i just restarted the system by scshutdown, after that when i booted the system, following error comes
Rebooting with command: boot
Boot device: root_dsk File and args:
SunOS Release 5.9 Version Generic_118558-06 64-bit
Copyright 1983-2003 Sun Microsystems, Inc. All rights reserved.
Use is subject to license terms.
obpsym: symbolic debugging is available.
Hardware watchdog enabled
INIT: Cannot create /var/adm/utmpx
INIT: failed write of utmpx entry:" "
INIT: failed write of utmpx entry:" "
INIT: SINGLE USER MODE
I performed debugging steps after this
i) I booted the system from cdrom with boot cdrom -s and performed fsck /dev/rdsk/c1t0d0s0 for root , /var , /globaldevices and other partitions on both primary and secondary disks. There were no error reported. After this, i rebooted the system but was no success, same error
ii) Than i tried to reboot the system from secondary disk, no success
iii) Than booted the system from cdrom again and tried to mv the utmpx & wtmpx files and created new files with touch utmpx & touch wtmpx command. Again booted the system but no success
iv) Than i changed the permissions of the files and tried to reboot but no success.
v) Than i tried to reboot the system with boot -x command but no success.
As i have done lot of configuration on the system, SVM, Oracle, Sun cluster, HA and SE 3510 configurations, so i do not want to loose these things.
Can some body please suggest any way to restore the system successfully without loosing any of the above installations.
I will really appreciate your help.
Regards
ashraf

