something failed on Sun Fire v210

hi,

Server: Sun Fire v210

OS: Solaris 10 6/06 + lastest Recommended patches

kernel patch level: 118833-36

1. OS coredumped

2. last dump lines:

WARNING: /pci@1d,700000/SUNW,emlxs@1/fp@0,0/sst@w50017a44c2ca7000,0 (sst3):

Error for Command: <undecoded cmd 0xb8>Error Level: Fatal

Requested Block: 0 Error Block: 0

Vendor: HP Serial Number:<

Sense Key: Illegal Request

ASC: 0x24 (invalid field in cdb), ASCQ: 0x0, FRU: 0x0

/pci@1d,700000/SUNW,emlxs@1,1/fp@0,0 (fcp0):

FCP: Report Lun Has Changed

WARNING: Unbound dma handle 30006fecdc0 from fp2

WARNING: Unbound dma handle 30006fecdc0 from fp2

WARNING: Unbound dma handle 30006fecdc0 from fp2

WARNING: Unbound dma handle 30006fecdc0 from fp2

WARNING: /pci@1d,700000/SUNW,emlxs@1/fp@0,0/sst@w50017a44c2ca7000,0 (sst3):

Error for Command: <undecoded cmd 0xb8>Error Level: Fatal

Requested Block: 0 Error Block: 0

Vendor: HP Serial Number:<

Sense Key: Illegal Request

ASC: 0x24 (invalid field in cdb), ASCQ: 0x0, FRU: 0x0

/pci@1d,700000/SUNW,emlxs@1,1/fp@0,0 (fcp0):

FCP: Report Lun Has Changed

WARNING: Unbound dma handle 30006fecdc0 from fp2

WARNING: Unbound dma handle 30006fecdc0 from fp2

/pci@1d,700000/SUNW,emlxs@1,1/fp@0,0 (fcp0):

FCP: Report Lun Has Changed

WARNING: Unbound dma handle 30006fecdc0 from fp2

WARNING: Unbound dma handle 30006fecdc0 from fp2

WARNING: /pci@1d,700000/SUNW,emlxs@1/fp@0,0/sst@w50017a44c2ca7000,0 (sst3):

Error for Command: <undecoded cmd 0xb8>Error Level: Fatal

Requested Block: 0 Error Block: 0

Vendor: HP Serial Number:<

Sense Key: Illegal Request

ASC: 0x24 (invalid field in cdb), ASCQ: 0x0, FRU: 0x0

WARNING: Unbound dma handle 30006fecdc0 from fp2

/pci@1d,700000/SUNW,emlxs@1,1/fp@0,0 (fcp0):

FCP: Report Lun Has Changed

WARNING: Unbound dma handle 30006fecdc0 from fp2

WARNING: Unbound dma handle 30006fecdc0 from fp2

WARNING: Unbound dma handle 30006fecdc0 from fp2

WARNING: /pci@1d,700000/SUNW,emlxs@1/fp@0,0/sst@w50017a44c2ca7000,0 (sst3):

Error for Command: <undecoded cmd 0xb8>Error Level: Fatal

Requested Block: 0 Error Block: 0

Vendor: HP Serial Number:<

Sense Key: Illegal Request

ASC: 0x24 (invalid field in cdb), ASCQ: 0x0, FRU: 0x0

/pci@1d,700000/SUNW,emlxs@1,1/fp@0,0 (fcp0):

FCP: Report Lun Has Changed

/pci@1d,700000/SUNW,emlxs@1,1/fp@0,0 (fcp0):

FCP: Report Lun Has Changed

NOTICE: SUNW-MSG-ID: SUNOS-8000-0G, TYPE: Error, VER: 1, SEVERITY: Major

panic[cpu0]/thread=2a10001fcc0:

pcisch-3: Fatal PCI bus error(s)

3. after several days install lastest Recommended patches

4. now i see this after every reboot:

SUNW-MSG-ID: SUN4U-8005-WS, TYPE: Fault, VER: 1, SEVERITY: Critical

EVENT-TIME: Thu Feb 15 14:54:54 EET 2007

PLATFORM: SUNW,Sun-Fire-V210, CSN: -, HOSTNAME: dummy1

SOURCE: eft, REV: 1.16

EVENT-ID: 9d2ab741-ee23-cd7f-abb6-d5cc55808016

DESC: A problem has been detected in the PCI subsystem. Refer to http://sun.com/msg/SUN4U-8005-WS for more information.

AUTO-RESPONSE: One or more device instances may be disabled

IMPACT: Loss of services provided by the device instances associated with this fault

REC-ACTION: Ensure that the latest drivers and patches are installed, schedule a repair procedure to replace the affected device if necessary, or contact Sun for support.

5. fmdump -v -u 9d2ab741-ee23-cd7f-abb6-d5cc55808016

Feb 15 14:54:54.9896 9d2ab741-ee23-cd7f-abb6-d5cc55808016 SUN4U-8005-WS

25% fault.io.tomatillo

Problem in: hc:///motherboard=0/hostbridge=0

Affects: hc:///motherboard=0/hostbridge=0

FRU: hc:///motherboard=0

25% defect.io.pci.driver

Problem in: hc:///motherboard=0/hostbridge=0/pcibus=1/pcidev=2/pcifn=1

Affects: mod:///mod-name=bge/mod-id=130

FRU: pkg:///SUNWcakr

25% defect.io.pci.driver

Problem in: hc:///motherboard=0/hostbridge=0/pcibus=1/pcidev=1/pcifn=1

Affects: mod:///mod-name=emlxs/mod-id=96

FRU: pkg:///SUNWemlxs

25% defect.io.pci.driver

Problem in: hc:///motherboard=0/hostbridge=0/pcibus=1/pcidev=32/pcifn=0

Affects: mod:///mod-name=pcisch/mod-id=23

FRU: pkg:///SUNWcakr

Whats wrong ? :/

--

Mike

[4509 byte] By [mpecha] at [2007-11-26 18:34:27]
# 1

...not really sure what's going on.

Perhaps you could use your service contract or system warranty

and open a support case with Sun.

http://www.sun.com/secure/contact/

I see some "before" and "after" circumstances in your description.

They may be related or not.TechSupport would be able to sort it all out.

At the time of the reset event, your excerpt shows a tape drive peripheral

connected to an Emulex fibre-channel adapter card.

Your excerpt after the patch cluster was applied mentions something

that suggests a malfunction on the PCI bus.

... but again, that's just a guess after spending mere seconds reading the info.

When opening your service case, request it be passed to the Storage team.

These forums are user-to-user discussions, not techsupport.

rukbata at 2007-7-9 6:08:33 > top of Java-index,Sun Hardware,Servers - General Discussion...
# 2
thank you for replyi did more tests and i see the same warning output after reboot w/o FC card too!
mpecha at 2007-7-9 6:08:33 > top of Java-index,Sun Hardware,Servers - General Discussion...