Sol 10 on T2000 TRAPS every day :( ... Sol10 sucks ?

This is aligment error trap, but there is also ip module errors:

Nov 23 15:14:09 d1s1 unix: [ID 836849 kern.notice]

Nov 23 15:14:09 d1s1 ^Mpanic[cpu18]/thread=2a101a35cc0:

Nov 23 15:14:09 d1s1 unix: [ID 799565 kern.notice] BAD TRAP: type=34 rp=2a101a353a0 addr=4dbd3e4ccfd7 mmu_fsr=0

Nov 23 15:14:09 d1s1 unix: [ID 100000 kern.notice]

Nov 23 15:14:09 d1s1 unix: [ID 839527 kern.notice] sched:

Nov 23 15:14:09 d1s1 unix: [ID 123557 kern.notice] alignment error:

Nov 23 15:14:09 d1s1 unix: [ID 381800 kern.notice] addr=0x4dbd3e4ccfd7

Nov 23 15:14:09 d1s1 unix: [ID 101969 kern.notice] pid=0, pc=0x13f9044, sp=0x2a101a34c41, tstate=0x80001600, context=0x0

Nov 23 15:14:09 d1s1 unix: [ID 743441 kern.notice] g1-g7: c, 10, ffffffffffffffff, 865, 6f, 0, 2a101a35cc0

Nov 23 15:14:09 d1s1 unix: [ID 100000 kern.notice]

Nov 23 15:14:09 d1s1 genunix: [ID 723222 kern.notice] 000002a101a350c0 unix:die+9c (34, 2a101a353a0, 4dbd3e4ccfd7, 0, 2a101a35180, c1e00000)

Nov 23 15:14:10 d1s1 genunix: [ID 179002 kern.notice]%l0-3: 0000000000000000 0000000000000034 0000000000000034 0000000000000000

Nov 23 15:14:10 d1s1%l4-7: 0000000000000000 000006000316700c 0000000000000000 0000000001074c00

Nov 23 15:14:10 d1s1 genunix: [ID 723222 kern.notice] 000002a101a351a0 unix:trap+694 (2a101a353a0, 10009, e, e, 0, 2a101a35cc0)

Nov 23 15:14:10 d1s1 genunix: [ID 179002 kern.notice]%l0-3: 0000000000000000 0000000001836a40 0000000000000034 0000000000000000

Nov 23 15:14:10 d1s1%l4-7: 0000000000000000 000006000316700c 0000000000000000 0000000000010200

Nov 23 15:14:11 d1s1 genunix: [ID 723222 kern.notice] 000002a101a352f0 unix:ktl0+64 (700694c8, 700691a0, 3e345e09edca9d2e, 4, 800, 6000316700b)

Nov 23 15:14:11 d1s1 genunix: [ID 179002 kern.notice]%l0-3: 0000030001996000 0000000000000090 0000000080001600 000000000101c504

Nov 23 15:14:11 d1s1%l4-7: 0000000000000002 000006000316700c 0000000000000000 000002a101a353a0

Nov 23 15:14:11 d1s1 genunix: [ID 562518 kern.notice] 000002a101a35440 60003167008 (600033ea720, 4, 600049ca028, 40064dbd3e4ccfc7, ff, 6f)

Nov 23 15:14:12 d1s1 genunix: [ID 179002 kern.notice]%l0-3: 0000000000000004 0000000000000800 000006000316700f 00000600049ca02b

Nov 23 15:14:12 d1s1%l4-7: 0000000000000002 000006000316700c 000000000000006f 00000000000000d8

Nov 23 15:14:12 d1s1 genunix: [ID 723222 kern.notice] 000002a101a354f0 arp:ar_rput+374 (600033ea720, 600071d1680, 600049ca028, 0, 1, 800)

Nov 23 15:14:12 d1s1 genunix: [ID 179002 kern.notice]%l0-3: 0000060003294318 00000600049ca018 00000600049ca01e 0000000000000004

Nov 23 15:14:12 d1s1%l4-7: 0000000000000006 0000000000000000 00000600032c0300 00000300005e2a74

Nov 23 15:14:13 d1s1 genunix: [ID 723222 kern.notice] 000002a101a355c0 unix:putnext+218 (6000316de08, 600041922b8, 600032c0300, 0, 60004192548, 0)

Nov 23 15:14:13 d1s1 genunix: [ID 179002 kern.notice]%l0-3: 0000000000000000 0000000000000000 0000000000000000 0000000000005670

Nov 23 15:14:13 d1s1%l4-7: 000000000000010d 0000000070068af8 00000000013fbebc fffffd5efe5d0000

Nov 23 15:14:14 d1s1 genunix: [ID 723222 kern.notice] 000002a101a35670 pfil:pfilmodrput+2d8 (60004192548, 600032c0300, 2a101a30000, 7bedd77c, 600041b3ce8, 300005e2a74)

Nov 23 15:14:14 d1s1 genunix: [ID 179002 kern.notice]%l0-3: 0000000000000000 0000000000000000 0000000000000000 00000000000057e0

Nov 23 15:14:14 d1s1%l4-7: 000000000000010d 00000000703530e8 000000007ba4833c fffffd5efe5d0000

Nov 23 15:14:14 d1s1 genunix: [ID 723222 kern.notice] 000002a101a35730 unix:putnext+218 (60004192738, 60004192548, 600032c0300, 100, 600041927d8, 0)

Nov 23 15:14:15 d1s1 genunix: [ID 179002 kern.notice]%l0-3: 0000000000000000 0000000000000000 0000000000000000 00000000000057e0

Nov 23 15:14:15 d1s1%l4-7: 000000000000010d 00000000703530e8 000000007ba4833c fffffd5efe5d0000

Nov 23 15:14:15 d1s1 genunix: [ID 723222 kern.notice] 000002a101a357e0 ipge:ipge_drain_fifo+57a8 (600049ca002, 7bede174, 600071d1680, 7bedd77c, 7beddb80, 1)

Nov 23 15:14:15 d1s1 genunix: [ID 179002 kern.notice]%l0-3: 0000000000000001 00000300005b2900 00000000010b6878 00000600041ada68

Nov 23 15:14:15 d1s1%l4-7: 0000000000000000 0000000000000000 000000000105db5c 00000600041927d8

Nov 23 15:14:16 d1s1 unix: [ID 100000 kern.notice]

Nov 23 15:14:16 d1s1 genunix: [ID 672855 kern.notice] syncing file systems...

Nov 23 15:14:17 d1s1 genunix: [ID 733762 kern.notice] 17

Nov 23 15:14:19 d1s1 genunix: [ID 733762 kern.notice] 5

Nov 23 15:14:22 d1s1 genunix: [ID 733762 kern.notice] 4

Nov 23 15:15:05 d1s1 last message repeated 20 times

Nov 23 15:15:06 d1s1 genunix: [ID 622722 kern.notice] done (not all i/o completed)

Nov 23 15:15:07 d1s1 genunix: [ID 111219 kern.notice] dumping to /dev/dsk/c0t0d0s1, offset 214827008, content: kernel

Nov 23 15:16:29 d1s1 genunix: [ID 409368 kern.notice] ^M100% done: 128159 pages dumped, compression ratio 4.61,

Nov 23 15:16:29 d1s1 genunix: [ID 851671 kern.notice] dump succeeded

Nov 23 15:22:19 d1s1 savecore: [ID 570001 auth.error] reboot after panic: BAD TRAP: type=34 rp=2a101a353a0 addr=4dbd3e4ccfd7 mmu_fsr=0

[5297 byte] By [REALRegressor] at [2007-11-26 11:40:56]
# 1
! _ALL LATEST PATCHES INSTALLED_ !
REALRegressor at 2007-7-7 11:42:44 > top of Java-index,Solaris Operating System,Solaris Essentials - General Technical Questions...
# 2
I'd suspect a hardware problem...
robertcohen at 2007-7-7 11:42:44 > top of Java-index,Solaris Operating System,Solaris Essentials - General Technical Questions...
# 3

It is not looks like hardware problem... There was two repeatable traps:

1. ip module trap - I fixed this problem by uninstalling cooltuner/removing tuning parameters from /etc/system

2. aligment trap in ipge module - i don't know how to solve this... :(

There was a week without traps on a just installed system... And then I installed all patches using "smpatch update" and now it panics every day...

REALRegressor at 2007-7-7 11:42:44 > top of Java-index,Solaris Operating System,Solaris Essentials - General Technical Questions...
# 4

There have been issues where the T2000's hardware context switching has found some

race conditions in the IP stack.

Please contact your software support vendor and ask that the case be escalated to get the latest IDR patches for S10 tcp/ip stack. They are IDRs ( Interim Diagnostic/Relief patches) as the formal patches have not been released and we understand that systems crashing needs a fast fix.

tim

timuglow at 2007-7-7 11:42:44 > top of Java-index,Solaris Operating System,Solaris Essentials - General Technical Questions...