system panic but I don't know ...help me please..

Dear All,

I have a Sunfire 6800 System,From past last month i get the following error messages and i force system power off after power on.The system panics,When i see the /var/adm/messages,I find error for a particular uncorectable error ...

Oct 28 09:21:45 cis unix: [ID 251936 kern.notice] panic: ptl1 trap reason 0x2

Oct 28 09:21:45 cis unix: [ID 554257 kern.notice] TL=0x1 TT=0x68 TICK=0xcc10d9f2d15

Oct 28 09:21:45 cis unix: [ID 860431 kern.notice] TPC=0x1014b558 TnPC=0x1014b55c TSTATE=0x80001600

Oct 28 09:21:45 cis unix: [ID 554257 kern.notice] TL=0x2 TT=0x68 TICK=0xcc10d9f2d10

Oct 28 09:21:45 cis unix: [ID 860431 kern.notice] TPC=0x10007098 TnPC=0x1000709c TSTATE=0x9180001501

Oct 28 09:21:45 cis platmod: [ID 373414 kern.info] NOTICE: enque_ecc_mailbox_msg: sg_mbox retval 145 msg_status 145

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 569561 kern.warning] WARNING: [AFT1] Corrected system bus (CE) Event detected by CPU2 at TL=0, errID 0x0003fc54.41bece50

Oct 28 09:21:45 cisAFSR 0x00004002<IVC,CE>.0000004a AFAR 0x00000000.1d970010

Oct 28 09:21:45 cisFault_PC 0x0 Esynd 0x004a AMBIGUOUS /N0/SB0/P0/B0/D2 J13500

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 170778 kern.notice] [AFT1] errID 0x0003fc54.41bece50 Data Bit 15 was in error and corrected

Oct 28 09:21:45 cis platmod: [ID 373414 kern.info] NOTICE: enque_ecc_mailbox_msg: sg_mbox retval 145 msg_status 145

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 991182 kern.warning] WARNING: [AFT1] Corrected system bus (CE) Event detected by CPU5 at TL=0, errID 0x0003fc54.41bedee0

Oct 28 09:21:45 cisAFSR 0x00004002<IVC,CE>.0000004a AFAR 0x00000000.24289530

Oct 28 09:21:45 cisFault_PC 0x0 Esynd 0x004a AMBIGUOUS /N0/SB0/P0/B0/D2 J13500

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 910516 kern.notice] [AFT1] errID 0x0003fc54.41bedee0 Data Bit 15 was in error and corrected

Oct 28 09:21:45 cis platmod: [ID 373414 kern.info] NOTICE: enque_ecc_mailbox_msg: sg_mbox retval 145 msg_status 145

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 581817 kern.warning] WARNING: [AFT1] Corrected system bus (CE) Event detected by CPU0 at TL=0, errID 0x0003fc54.41bed170

Oct 28 09:21:45 cisAFSR 0x00004002<IVC,CE>.0000004a AFAR 0x00000000.1d970010

Oct 28 09:21:45 cisFault_PC 0x0 Esynd 0x004a AMBIGUOUS /N0/SB0/P0/B0/D2 J13500

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 178751 kern.notice] [AFT1] errID 0x0003fc54.41bed170 Data Bit 15 was in error and corrected

Oct 28 09:21:45 cis platmod: [ID 373414 kern.info] NOTICE: enque_ecc_mailbox_msg: sg_mbox retval 145 msg_status 145

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 394823 kern.warning] WARNING: [AFT1] IVC Event detected by CPU2 at TL=0, errID 0x0003fc54.41bece50

Oct 28 09:21:45 cisAFSR 0x00004002<IVC,CE>.0000004a AFAR 0x00000000.1d970010 INVALID

Oct 28 09:21:45 cisFault_PC 0x0 Esynd 0x004a AMBIGUOUS unum not available

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 170778 kern.notice] [AFT1] errID 0x0003fc54.41bece50 Data Bit 15 was in error and corrected

Oct 28 09:21:45 cis platmod: [ID 373414 kern.info] NOTICE: enque_ecc_mailbox_msg: sg_mbox retval 145 msg_status 145

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 579844 kern.warning] WARNING: [AFT1] IVC Event detected by CPU1 at TL=0, errID 0x0003fc54.41bed8f0

Oct 28 09:21:45 cisAFSR 0x00104000<PRIV,IVC>.0000004a AFAR 0x00000000.12ca4740 INVALID

Oct 28 09:21:45 cisFault_PC 0x0 Esynd 0x004a unum not available

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 394639 kern.notice] [AFT1] errID 0x0003fc54.41bed8f0 Data Bit 15 was in error and corrected

Oct 28 09:21:45 cis platmod: [ID 373414 kern.info] NOTICE: enque_ecc_mailbox_msg: sg_mbox retval 145 msg_status 145

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 797313 kern.warning] WARNING: [AFT1] Corrected system bus (CE) Event detected by CPU3 at TL=0, errID 0x0003fc54.41bed5d0

Oct 28 09:21:45 cisAFSR 0x00004002<IVC,CE>.0000004a AFAR 0x00000000.04056dd0

Oct 28 09:21:45 cisFault_PC 0x0 Esynd 0x004a AMBIGUOUS /N0/SB0/P3/B0/D2 J16500

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 366602 kern.notice] [AFT1] errID 0x0003fc54.41bed5d0 Data Bit 15 was in error and corrected

Oct 28 09:21:45 cis platmod: [ID 373414 kern.info] NOTICE: enque_ecc_mailbox_msg: sg_mbox retval 145 msg_status 145

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 639397 kern.warning] WARNING: [AFT1] IVC Event detected by CPU5 at TL=0, errID 0x0003fc54.41bedee0

Oct 28 09:21:45 cisAFSR 0x00004002<IVC,CE>.0000004a AFAR 0x00000000.24289530 INVALID

Oct 28 09:21:45 cisFault_PC 0x0 Esynd 0x004a AMBIGUOUS unum not available

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 910516 kern.notice] [AFT1] errID 0x0003fc54.41bedee0 Data Bit 15 was in error and corrected

Oct 28 09:21:45 cis platmod: [ID 373414 kern.info] NOTICE: enque_ecc_mailbox_msg: sg_mbox retval 145 msg_status 145

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 347170 kern.warning] WARNING: [AFT1] IVC Event detected by CPU0 at TL=0, errID 0x0003fc54.41bed170

Oct 28 09:21:45 cisAFSR 0x00004002<IVC,CE>.0000004a AFAR 0x00000000.1d970010 INVALID

Oct 28 09:21:45 cisFault_PC 0x0 Esynd 0x004a AMBIGUOUS unum not available

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 178751 kern.notice] [AFT1] errID 0x0003fc54.41bed170 Data Bit 15 was in error and corrected

Oct 28 09:21:45 cis platmod: [ID 373414 kern.info] NOTICE: enque_ecc_mailbox_msg: sg_mbox retval 145 msg_status 145

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 581288 kern.warning] WARNING: [AFT1] IVC Event detected by CPU3 at TL=0, errID 0x0003fc54.41bed5d0

Oct 28 09:21:45 cisAFSR 0x00004002<IVC,CE>.0000004a AFAR 0x00000000.04056dd0 INVALID

Oct 28 09:21:45 cisFault_PC 0x0 Esynd 0x004a AMBIGUOUS unum not available

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 366602 kern.notice] [AFT1] errID 0x0003fc54.41bed5d0 Data Bit 15 was in error and corrected

Oct 28 09:21:45 cis platmod: [ID 373414 kern.info] NOTICE: enque_ecc_mailbox_msg: sg_mbox retval 145 msg_status 145

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 782390 kern.warning] WARNING: [AFT1] Corrected system bus (CE) Event detected by CPU6 at TL=0, errID 0x0003fc54.41bff4b0

Oct 28 09:21:45 cisAFSR 0x00000002<CE>.0000004a AFAR 0x00000000.02d4a550

Oct 28 09:21:45 cisFault_PC 0x0 Esynd 0x004a /N0/SB0/P1/B0/D2 J14500

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 520440 kern.notice] [AFT1] errID 0x0003fc54.41bff4b0 Data Bit 15 was in error and corrected

Oct 28 09:21:45 cis platmod: [ID 373414 kern.info] NOTICE: enque_ecc_mailbox_msg: sg_mbox retval 145 msg_status 145

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 237223 kern.warning] WARNING: [AFT1] Corrected system bus (CE) Event detected by CPU7 at TL=0, errID 0x0003fc54.41c00040

Oct 28 09:21:45 cisAFSR 0x00000002<CE>.0000004a AFAR 0x00000000.02d4a550

Oct 28 09:21:45 cisFault_PC 0x0 Esynd 0x004a /N0/SB0/P1/B0/D2 J14500

Oct 28 09:21:45 cis SUNW,UltraSPARC-III: [ID 642625 kern.notice] [AFT1] errID 0x0003fc54.41c00040 Data Bit 15 was in error and corrected

Oct 28 09:21:45 cis unix: [ID 836849 kern.notice]

Oct 28 09:21:45 cis ^Mpanic[cpu4]/thread=300249ecee0:

Oct 28 09:21:45 cis unix: [ID 715043 kern.notice] Kernel panic at trap level 2

Oct 28 09:21:45 cis unix: [ID 100000 kern.notice]

Oct 28 09:21:45 cis genunix: [ID 723222 kern.notice] 000000001040c1f0 unix:sys_tl1_panic+8 (2a107ee24c8, 300024bfeb0, 1d8, 43452000, 81010100, 2a107ee23e8)

Oct 28 09:21:45 cis genunix: [ID 179002 kern.notice]%l0-3: 0000000000000000 0000000000001400 0000000080001600 000000001000723c

Oct 28 09:21:45 cis%l4-7: 000000000000ff00 0000000001010000 000000000000000f 000000001040c2a0

Oct 28 09:21:45 cis genunix: [ID 723222 kern.notice] 000000001040c340 genunix:errorq_dispatch+88 (30000b4a650, 1, 3000252c330, 0, 81010100, 2a107ee24c8)

Oct 28 09:21:45 cis genunix: [ID 179002 kern.notice]%l0-3: 00000000000001d8 0000030000b4a650 0000000000001606 00000000101452d4

Oct 28 09:21:45 cis%l4-7: 000000000000ff00 00000000fefefeff 000000000000000e 000002a107ee3f90

Oct 28 09:21:45 cis genunix: [ID 723222 kern.notice] 000002a107ee21b0 SUNW,UltraSPARC-III:cpu_queue_one_event+160 (5cb9e81be903a44d, fd32839c84dd9025, 607901fb1a74a38, 774238532fdc1785, f9587275d977e29e, 7461d0adf979de9e)

Oct 28 09:21:45 cis genunix: [ID 179002 kern.notice]%l0-3: 2a9d3740ab413457 6b59a093e004e4d8 67421e9f8d1a9b79 c09edf72bff2b015

Oct 28 09:21:45 cis%l4-7: f6c906b2ba80e11c 6bae290f6c3cffc7 df558ea16f7f1ffd b0c984f91dd60f17

Oct 28 09:21:45 cis unix: [ID 100000 kern.notice]

Oct 28 09:21:45 cis genunix: [ID 672855 kern.notice] syncing file systems...

Oct 28 09:22:24 cis unix: [ID 836849 kern.notice]

Oct 28 09:22:24 cis ^Mpanic[cpu4]/thread=300249ecee0:

Oct 28 09:22:24 cis unix: [ID 715357 kern.notice] panic sync timeout

Oct 28 09:22:24 cis unix: [ID 100000 kern.notice]

Oct 28 09:22:24 cis genunix: [ID 353387 kern.notice] dumping to /dev/dsk/c0t0d0s1, offset 1719205888

Oct 28 09:23:39 cis genunix: [ID 409368 kern.notice] ^M100% done: 139253 pages dumped, compression ratio 2.84,

Oct 28 09:23:39 cis genunix: [ID 851671 kern.notice] dump succeeded

what cause make this system panic...?

i catch <IVC,CE>.

i guess CPU to CPU interrupt error...

is this called H/W or OS error ?

help me please~~

Following is the o/p of /var/adm/messages..

[10390 byte] By [shfehfl] at [2007-11-25 22:47:46]
# 1

Did you at least review the post (<a href="http&#58;&#47;&#47;supportforum.sun.com/hardware/index.php?t= msg&amp;th=5349&amp;start=0&amp;rid=23&amp;SQ=472cc143c5e819e5ce 416e8ba91125f4" target="_blank"><b>sticky: If you have question(s) on SunFire Midrange systems.</b></a>) at the top of this forum ?

Open a service case with Sun.

Michael

maal at 2007-7-5 17:03:09 > top of Java-index,Sun Hardware,Servers - General Discussion...
# 2
On a little nudge you can try and run the CEDIAG tool that should be freely downloadable. This will analyse the CE errors in the messages files for you and simplify the results a little.
stumoor at 2007-7-5 17:03:09 > top of Java-index,Sun Hardware,Servers - General Discussion...