Sun Cluster 2.2 - accclr2 ID

Dear

I have the folowing Cluster Environment .

HardWare:

Nodes: 2 Sun fire 420R, Storedge :2 D1000

Console : sun blade 100

Software OE : Solaris 2.6

Cluster Sun Cluster 2.2 Parallel clustering.

Disk Suite : Solstice disk Suite 4.2

Logical Disk groups Importdg and Exportdg

DBMS - oracle 8i

Is that a bug or what causes error and the Takeover?

Error messages

Dec 5 03:15:29 accclr2 last message repeated 4 times

Dec 5 03:15:32 accclr2 ID[SUNWcluster.ha.haoracle_fmon.2050]:

import:custblr2: db_evaluate: Oracle background process died

Dec 5 03:15:32 accclr2 ID[SUNWcluster.ha.haoracle_fmon.2090]:

import:custblr2: db_evaluate: initiating DBMS service takeover

Dec 5 03:15:40 accclr2 ID[SUNWcluster.loghost.1010]: Giving up logical

host import

Dec 5 03:15:41 accclr2 ID[SUNWcluster.loghost.1050]: fm_stop method of

data service oracle completed successfully.

Dec 5 03:15:42 accclr2 ID[SUNWcluster.ha.oracle.fault_mon.2001]:

import:custblr2: starting shutdown immediate for oracle instance custblr2

Dec 5 03:15:43 accclr2 ID[SUNWcluster.ha.oracle.fault_mon.2005]:

import:custblr2: Shutdown immediate for oracle instance custblr2 completed

Dec 5 03:15:44 accclr2 ID[SUNWcluster.loghost.1050]: stop_net method

of data service oracle completed successfully.

Dec 5 03:15:44 accclr2 ID[SUNWcluster.loghost.1050]: stop method of

data service oracle completed successfully.

Dec 5 03:15:45 accclr2 ID[SUNWcluster.scnfs.4012]: umount of /user

succeeded

Dec 5 03:15:46 accclr2 ID[SUNWcluster.scnfs.4012]: umount of /impuser

succeeded

Dec 5 03:15:47 accclr2 ID[SUNWcluster.scnfs.4012]: umount of /import

succeeded

Dec 5 03:15:48 accclr2 ID[SUNWcluster.scnfs.4012]: umount of /imphist

succeeded

Dec 5 03:15:49 accclr2 ID[SUNWcluster.scnfs.4012]: umount of /impdata

succeeded

Dec 5 03:15:50 accclr2 ID[SUNWcluster.scnfs.4012]: umount of /hist1

succeeded

Dec 5 03:15:51 accclr2 ID[SUNWcluster.scnfs.4012]: umount of /data2

succeeded

Dec 5 03:15:52 accclr2 ID[SUNWcluster.scnfs.4012]: umount of /data

succeeded

Dec 5 03:15:54 accclr2 ID[SUNWcluster.scnfs.4012]: umount of /custom

succeeded

Dec 5 03:15:54 accclr2 ID[SUNWcluster.scnfs.4012]: umount of /back1

succeeded

Dec 5 03:15:54 accclr2 ID[SUNWcluster.loghost.6310]: Check diskset

ownership for importdg

Dec 5 03:15:55 accclr2 ID[SUNWcluster.loghost.6330]: Relinquishing

ownership of diskset importdg

Dec 5 03:15:55 accclr2 ID[SUNWcluster.loghost.4027]: Diskset

relinquish stage complete

Dec 5 03:15:55 accclr2 ID[SUNWcluster.loghost.1020]: Give up of

logical host import succeeded

Dec 5 03:15:56 accclr2 ID[SUNWcluster.loghost.1050]: start method of

data service oracle completed successfully.

Dec 5 03:15:57 accclr2 ID[SUNWcluster.loghost.1050]: start_net method

of data service oracle completed successfully.

Dec 5 03:15:57 accclr2 ID[SUNWcluster.loghost.1050]: fm_init method of

data service oracle completed successfully.

Dec 5 03:16:06 accclr2 ID[SUNWcluster.loghost.1050]: fm_start method

of data service oracle completed successfully.

Dec 5 03:16:06 accclr2 ID[SUNWcluster.ccd.ccdd.5104]: Error

CCD_UNFREEZE_ACK 0x(71e80) type = 2007 error = 68: svr error: 22

Dec 5 03:16:06 accclr2 ID[SUNWcluster.scccd.add.4001]: error executing

the freeze cmd - LOGHOST_CM lname:curr_master import:accclr1

Dec 5 03:16:07 accclr2 ID[SUNWcluster.loghost.1050]: fm_stop method of

data service oracle completed successfully.

Dec 5 03:16:09 accclr2 ID[SUNWcluster.loghost.1050]: stop_net method

of data service oracle completed successfully.

Dec 5 03:16:09 accclr2 ID[SUNWcluster.loghost.1050]: stop method of

data service oracle completed successfully.

Dec 5 03:16:13 accclr2 ID[SUNWcluster.loghost.1030]: Taking over

logical host import

Dec 5 03:16:13 accclr2 ID[SUNWcluster.loghost.6310]: Check diskset

ownership for importdg

Dec 5 03:16:13 accclr2 ID[SUNWcluster.loghost.6315]: Diskset importdg

not owned by this node

Dec 5 03:16:13 accclr2 ID[SUNWcluster.loghost.6320]: Taking ownership

of diskset importdg

Dec 5 03:16:21 accclr2 ID[SUNWcluster.scnfs.3045]: fsck

/dev/md/importdg/rdsk/d4 /dev/md/importdg/rdsk/d2 /dev/md/importdg/rdsk/d1

/dev/md/importdg/rdsk/d3 /dev/md/importdg/rdsk/d5 /dev/md/importdg/rdsk/d75

/dev/md/importdg/rdsk/d74 /dev/md/importdg/rdsk/d7 /dev/md/importdg/rdsk/d73

/dev/md/importdg/rdsk/d6

Dec 5 03:16:22 accclr2 ID[SUNWcluster.scnfs.3061]: fsck (ufs) complete

Dec 5 03:16:22 accclr2 ID[SUNWcluster.scnfs.3041]: mount /back1

Dec 5 03:16:22 accclr2 ID[SUNWcluster.scnfs.3041]: mount /custom

Dec 5 03:16:22 accclr2 ID[SUNWcluster.scnfs.3041]: mount /data

Dec 5 03:16:22 accclr2 ID[SUNWcluster.scnfs.3041]: mount /data2

Dec 5 03:16:22 accclr2 ID[SUNWcluster.scnfs.3041]: mount /hist1

Dec 5 03:16:22 accclr2 ID[SUNWcluster.scnfs.3041]: mount /impdata

Dec 5 03:16:22 accclr2 ID[SUNWcluster.scnfs.3041]: mount /imphist

Dec 5 03:16:22 accclr2 ID[SUNWcluster.scnfs.3041]: mount /import

Dec 5 03:16:22 accclr2 ID[SUNWcluster.scnfs.3041]: mount /impuser

Dec 5 03:16:22 accclr2 ID[SUNWcluster.scnfs.3041]: mount /user

Dec 5 03:16:22 accclr2 ID[SUNWcluster.scnfs.3041]: mounting finished

Dec 5 03:16:23 accclr2 ID[SUNWcluster.loghost.1050]: start method of

data service oracle completed successfully.

Dec 5 03:16:24 accclr2 ID[SUNWcluster.ha.oracle.start_net.2010]:

import:custblr2: starting up Oracle Listener

Dec 5 03:16:26 accclr2 ID[SUNWcluster.loghost.1050]: start_net method

of data service oracle completed successfully.

Dec 5 03:16:26 accclr2 ID[SUNWcluster.ha.oracle.start_net.2000]:

import:custblr2: Starting up instance custblr2,

PFILE=/data/oracle/dbs/initcustblr2.ora

Dec 5 03:16:27 accclr2 ID[SUNWcluster.loghost.1050]: fm_init method of

data service oracle completed successfully.

Dec 5 03:16:27 accclr2 ID[SUNWcluster.loghost.1040]: Take over of

logical host import succeeded

Dec 5 03:16:29 accclr2 ID[SUNWcluster.loghost.1050]: fm_start method

of data service oracle completed successfully.

Dec 5 03:18:18 accclr2 ID[SUNWcluster.ha.haoracle_fmon.4520]:

export:custblr1: cannot find the SQL*Net service for custblr1

Dec 5 03:22:33 accclr2 last message repeated 5 times

Dec 5 03:23:33 accclr2 ID[SUNWcluster.ha.haoracle_fmon.4520]:

export:custblr1: cannot find the SQL*Net service for custblr1

:)Thanks

Mohammed Tanvir

[6962 byte] By [Tanvir@SCSA] at [2007-11-26 11:55:36]
# 1

Boy oh boy ... Sun Cluster 2.2!

Looks like the root cause was the failure of the Oracle process

Dec 5 03:15:32 accclr2 ID[SUNWcluster.ha.haoracle_fmon.2050]:

import:custblr2: db_evaluate: Oracle background process died

Dec 5 03:15:32 accclr2 ID[SUNWcluster.ha.haoracle_fmon.2090]:

import:custblr2: db_evaluate: initiating DBMS service takeover

So this is to be expected and is not a bug (as far as I can tell). It would be worth digging through the Oracle logs to see why the Oracle process died though.

Tim

TimRead at 2007-7-7 12:14:22 > top of Java-index,Solaris Operating System,Solaris Essentials - General Technical Questions...
# 2

Dear Sir,

Could it be due to a hardware error like HDD.

because our DBA is telling when oracle tries to write data in a Hard disk.and due to some Bad blocks if it fails ,the oracle process will automatically dies down.

But there are no Concrete errors in Logs.

Thanks and Regards

S.Ramesh

Ramana@903 at 2007-7-7 12:14:22 > top of Java-index,Solaris Operating System,Solaris Essentials - General Technical Questions...