V440 machine with 4 internal disks and connected with SE 3510 booting

Hi All,

I have a SUN V440 server with four internal disks and SE 3510 is connected with it.

I have installed Solaris OS 5.9 09/04 on it, configured Solaris Volume Manager (SVM) for internal as well as for SE 3510 external disks. Than we installed sun cluster 3.1 on it. After that i installed Oracle 10g on it. After that i installed my application on it , which created 4 Resource Groups on the sun cluster, but Resource Groups were not online by default which was not a normal behaviour. I tried to put resource groups online manually, it was not successful and following error came

Nov 26 08:45:31 emarn1-1 SC[SUNWscu.prenet_start]:RG_EMA_ORACLE:RES_EMA_ORACLE_HASTORAGE: File system check of /home (/dev/md/sogds/rdsk/d130) failed: (-4195137) /dev/md/sogds/rdsk/d130: BAD SUPER BLOCK: MAGIC NUMBER WRONG

Nov 26 08:45:31 emarn1-1 /dev/md/sogds/rdsk/d130: USE AN ALTERNATE SUPER-BLOCK TO SUPPLY NEEDED INFORMATION;

Nov 26 08:45:31 emarn1-1 /dev/md/sogds/rdsk/d130: eg. fsck [-F ufs] -o b=# [special ...]

Nov 26 08:45:31 emarn1-1 /dev/md/sogds/rdsk/d130: where # is the alternate super block. SEE fsck_ufs(1M).

Nov 26 08:45:31 emarn1-1 /dev/md/sogds/rdsk/d130: UNEXPECTED INCONSISTENCY; RUN fsck MANUALLY.

Nov 26 08:45:31 emarn1-1 THE FOLLOWING FILE SYSTEM(S) HAD AN UNEXPECTED INCONSISTENCY: /dev/md/sogds/rdsk/d130 (/home).

Nov 26 08:45:31 emarn1-1 Cluster.RGM.rgmd: Method <hastorageplus_prenet_start> failed on resource <RES_EMA_ORACLE_HASTORAGE> in resource group <RG_EMA_ORACLE> [exit code <1>, time used: 0% of timeout <1800 seconds>]

Nov 26 08:45:31 emarn1-1 ip: TCP_IOC_ABORT_CONN: local = 010.034.159.028:0, remote = 000.000.000.000:0, start = -2, end = 6

Nov 26 08:45:31 emarn1-1 ip: TCP_IOC_ABORT_CONN: aborted 0 connection

Nov 26 08:45:32 emarn1-1 SC[SUNWscu.prenet_start]:RG_EMA_ORACLE:RES_EMA_ORACLE_HASTORAGE: File system check of /home (/dev/md/sogds/rdsk/d130) failed: (-4195137) /dev/md/sogds/rdsk/d130: BAD SUPER BLOCK: MAGIC NUMBER WRONG

Nov 26 08:45:32 emarn1-1 /dev/md/sogds/rdsk/d130: USE AN ALTERNATE SUPER-BLOCK TO SUPPLY NEEDED INFORMATION;

Nov 26 08:45:32 emarn1-1 /dev/md/sogds/rdsk/d130: eg. fsck [-F ufs] -o b=# [special ...]

Nov 26 08:45:32 emarn1-1 /dev/md/sogds/rdsk/d130: where # is the alternate super block. SEE fsck_ufs(1M).

Nov 26 08:45:32 emarn1-1 /dev/md/sogds/rdsk/d130: UNEXPECTED INCONSISTENCY; RUN fsck MANUALLY.

Nov 26 08:45:32 emarn1-1 THE FOLLOWING FILE SYSTEM(S) HAD AN UNEXPECTED INCONSISTENCY: /dev/md/sogds/rdsk/d130 (/home).

Nov 26 08:45:32 emarn1-1 Cluster.RGM.rgmd: Method <hastorageplus_prenet_start> failed on resource <RES_EMA_ORACLE_HASTORAGE> in resource group <RG_EMA_ORACLE> [exit code <1>, time used: 0% of timeout <1800 seconds>]

Nov 26 08:45:32 emarn1-1 ip: TCP_IOC_ABORT_CONN: local = 010.034.159.028:0, remote = 000.000.000.000:0, start = -2, end = 6

Nov 26 08:45:32 emarn1-1 ip: TCP_IOC_ABORT_CONN: aborted 0 connection

What i did, i just restarted the system by scshutdown, after that when i booted the system, following error comes

Rebooting with command: boot

Boot device: root_dsk File and args:

SunOS Release 5.9 Version Generic_118558-06 64-bit

Copyright 1983-2003 Sun Microsystems, Inc. All rights reserved.

Use is subject to license terms.

obpsym: symbolic debugging is available.

Hardware watchdog enabled

INIT: Cannot create /var/adm/utmpx

INIT: failed write of utmpx entry:" "

INIT: failed write of utmpx entry:" "

INIT: SINGLE USER MODE

I performed debugging steps after this

i) I booted the system from cdrom with boot cdrom -s and performed fsck /dev/rdsk/c1t0d0s0 for root , /var , /globaldevices and other partitions on both primary and secondary disks. There were no error reported. After this, i rebooted the system but was no success, same error

ii) Than i tried to reboot the system from secondary disk, no success

iii) Than booted the system from cdrom again and tried to mv the utmpx & wtmpx files and created new files with touch utmpx & touch wtmpx command. Again booted the system but no success

iv) Than i changed the permissions of the files and tried to reboot but no success.

v) Than i tried to reboot the system with boot -x command but no success.

As i have done lot of configuration on the system, SVM, Oracle, Sun cluster, HA and SE 3510 configurations, so i do not want to loose these things.

Can some body please suggest any way to restore the system successfully without loosing any of the above installations.

I will really appreciate your help.

Regards

ashraf

[4812 byte] By [elcashraf] at [2007-11-26 11:43:39]
# 1
It is probably not your problem, but don't mount nothing to /home. Usually, /home is automounted
y_p at 2007-7-7 11:51:32 > top of Java-index,General,Talk to the Sysop...