Kernel Panic

Had a Sol10 (1106) virtual machine hosted on ESX 3.01 panic on me last night. It panicked repeatedly, then stopped for no apparent reason. Here's the output from /var/adm/messages:

May 14 21:07:48 IUProWeb03 ^Mpanic[cpu0]/thread=ffffffff8d839d80:

May 14 21:07:48 IUProWeb03 genunix: [ID 335743 kern.notice] BAD TRAP: type=e (#pf Page fault) rp=fffffe8000281500 addr=30 occurred in module "zfs" due to a NULL pointer dereference

May 14 21:07:48 IUProWeb03 unix: [ID 100000 kern.notice]

May 14 21:07:48 IUProWeb03 unix: [ID 839527 kern.notice] httpd:

May 14 21:07:48 IUProWeb03 unix: [ID 753105 kern.notice] #pf Page fault

May 14 21:07:48 IUProWeb03 unix: [ID 532287 kern.notice] Bad kernel fault at addr=0x30

May 14 21:07:48 IUProWeb03 unix: [ID 243837 kern.notice] pid=1229, pc=0xfffffffff64ee487, sp=0xfffffe80002815f0, eflags=0x10206

May 14 21:07:48 IUProWeb03 unix: [ID 211416 kern.notice] cr0: 8005003b<pg,wp,ne,et,ts,mp,pe> cr4: 6b0<xmme,fxsr,pge,pae,pse>

May 14 21:07:48 IUProWeb03 unix: [ID 354241 kern.notice] cr2: 30 cr3: 7fdf000 cr8: c

May 14 21:07:49 IUProWeb03 unix: [ID 592667 kern.notice]rdi: fffffffffbc1fee8 rsi: fffffffffaaedeb8 rdx:0

May 14 21:07:49 IUProWeb03 unix: [ID 592667 kern.notice]rcx:0 r8:1 r9: fffffffffbc3b7e0

May 14 21:07:49 IUProWeb03 unix: [ID 592667 kern.notice]rax:30 rbx:60000 rbp: fffffe8000281670

May 14 21:07:49 IUProWeb03 unix: [ID 592667 kern.notice]r10:0 r11:16c1b r12: fffffe8000281730

May 14 21:07:49 IUProWeb03 unix: [ID 592667 kern.notice]r13: 1000 r14: 2000 r15: 2000

May 14 21:07:49 IUProWeb03 unix: [ID 592667 kern.notice]fsb: ffffffff80000000 gsb: fffffffffbc240e0 ds:43

May 14 21:07:49 IUProWeb03 unix: [ID 592667 kern.notice] es:43 fs:0 gs: 1c3

May 14 21:07:49 IUProWeb03 unix: [ID 592667 kern.notice]trp:e err:0 rip: fffffffff64ee487

May 14 21:07:49 IUProWeb03 unix: [ID 592667 kern.notice] cs:28 rfl:10206 rsp: fffffe80002815f0

May 14 21:07:49 IUProWeb03 unix: [ID 266532 kern.notice] ss:0

May 14 21:07:49 IUProWeb03 unix: [ID 100000 kern.notice]

May 14 21:07:49 IUProWeb03 genunix: [ID 655072 kern.notice] fffffe8000281410 unix:die+da ()

May 14 21:07:49 IUProWeb03 genunix: [ID 655072 kern.notice] fffffe80002814f0 unix:trap+d77 ()

May 14 21:07:49 IUProWeb03 genunix: [ID 655072 kern.notice] fffffe8000281500 unix:cmntrap+13f ()

May 14 21:07:49 IUProWeb03 genunix: [ID 655072 kern.notice] fffffe8000281670 zfs:zfs_getpage+1e7 ()

May 14 21:07:49 IUProWeb03 genunix: [ID 655072 kern.notice] fffffe80002816c0 genunix:fop_getpage+47 ()

May 14 21:07:49 IUProWeb03 genunix: [ID 655072 kern.notice] fffffe8000281780 genunix:segmap_fault+118 ()

May 14 21:07:49 IUProWeb03 genunix: [ID 655072 kern.notice] fffffe80002818c0 sockfs:snf_segmap+26d ()

May 14 21:07:49 IUProWeb03 genunix: [ID 655072 kern.notice] fffffe80002819e0 sockfs:sosendfile64+1df ()

May 14 21:07:49 IUProWeb03 genunix: [ID 655072 kern.notice] fffffe8000281ba0 genunix:sendvec64+215 ()

May 14 21:07:49 IUProWeb03 genunix: [ID 655072 kern.notice] fffffe8000281ec0 genunix:sendfilev+52c ()

May 14 21:07:49 IUProWeb03 genunix: [ID 655072 kern.notice] fffffe8000281f10 unix:sys_syscall32+101 ()

May 14 21:07:49 IUProWeb03 unix: [ID 100000 kern.notice]

May 14 21:07:49 IUProWeb03 genunix: [ID 672855 kern.notice] syncing file systems...

May 14 21:07:49 IUProWeb03 genunix: [ID 904073 kern.notice] done

May 14 21:07:50 IUProWeb03 genunix: [ID 111219 kern.notice] dumping to /dev/dsk/c1t0d0s1, offset 431030272, content: kernel

May 14 21:07:58 IUProWeb03 genunix: [ID 409368 kern.notice] ^M100% done: 77792 pages dumped, compression ratio 2.50,

May 14 21:07:58 IUProWeb03 genunix: [ID 851671 kern.notice] dump succeeded

This server is running Apache, MySQL, and Tomcat, and hasn't given me any trouble since I deployed it several months ago.

Message was edited by:

CCJackson

[4049 byte] By [CCJacksona] at [2007-11-27 4:22:28]
# 1

> May 14 21:07:48 IUProWeb03

> ^Mpanic[cpu0]/thread=ffffffff8d839d80:

> May 14 21:07:48 IUProWeb03 genunix: [ID 335743

> kern.notice] BAD TRAP: type=e (#pf Page fault)

> rp=fffffe8000281500 addr=30 occurred in module "zfs"

> due to a NULL pointer dereference

As it occured in the ZFS module a good starting point would be probably to check the patch levels of ZFS and kernel.

Hope this helps,

Oliver

oliberta at 2007-7-12 9:29:48 > top of Java-index,Solaris Operating System,Solaris Essentials - General Technical Questions...