something failed on Sun Fire v210
hi,
Server: Sun Fire v210
OS: Solaris 10 6/06 + lastest Recommended patches
kernel patch level: 118833-36
1. OS coredumped
2. last dump lines:
WARNING: /pci@1d,700000/SUNW,emlxs@1/fp@0,0/sst@w50017a44c2ca7000,0 (sst3):
Error for Command: <undecoded cmd 0xb8>Error Level: Fatal
Requested Block: 0 Error Block: 0
Vendor: HP Serial Number:<
Sense Key: Illegal Request
ASC: 0x24 (invalid field in cdb), ASCQ: 0x0, FRU: 0x0
/pci@1d,700000/SUNW,emlxs@1,1/fp@0,0 (fcp0):
FCP: Report Lun Has Changed
WARNING: Unbound dma handle 30006fecdc0 from fp2
WARNING: Unbound dma handle 30006fecdc0 from fp2
WARNING: Unbound dma handle 30006fecdc0 from fp2
WARNING: Unbound dma handle 30006fecdc0 from fp2
WARNING: /pci@1d,700000/SUNW,emlxs@1/fp@0,0/sst@w50017a44c2ca7000,0 (sst3):
Error for Command: <undecoded cmd 0xb8>Error Level: Fatal
Requested Block: 0 Error Block: 0
Vendor: HP Serial Number:<
Sense Key: Illegal Request
ASC: 0x24 (invalid field in cdb), ASCQ: 0x0, FRU: 0x0
/pci@1d,700000/SUNW,emlxs@1,1/fp@0,0 (fcp0):
FCP: Report Lun Has Changed
WARNING: Unbound dma handle 30006fecdc0 from fp2
WARNING: Unbound dma handle 30006fecdc0 from fp2
/pci@1d,700000/SUNW,emlxs@1,1/fp@0,0 (fcp0):
FCP: Report Lun Has Changed
WARNING: Unbound dma handle 30006fecdc0 from fp2
WARNING: Unbound dma handle 30006fecdc0 from fp2
WARNING: /pci@1d,700000/SUNW,emlxs@1/fp@0,0/sst@w50017a44c2ca7000,0 (sst3):
Error for Command: <undecoded cmd 0xb8>Error Level: Fatal
Requested Block: 0 Error Block: 0
Vendor: HP Serial Number:<
Sense Key: Illegal Request
ASC: 0x24 (invalid field in cdb), ASCQ: 0x0, FRU: 0x0
WARNING: Unbound dma handle 30006fecdc0 from fp2
/pci@1d,700000/SUNW,emlxs@1,1/fp@0,0 (fcp0):
FCP: Report Lun Has Changed
WARNING: Unbound dma handle 30006fecdc0 from fp2
WARNING: Unbound dma handle 30006fecdc0 from fp2
WARNING: Unbound dma handle 30006fecdc0 from fp2
WARNING: /pci@1d,700000/SUNW,emlxs@1/fp@0,0/sst@w50017a44c2ca7000,0 (sst3):
Error for Command: <undecoded cmd 0xb8>Error Level: Fatal
Requested Block: 0 Error Block: 0
Vendor: HP Serial Number:<
Sense Key: Illegal Request
ASC: 0x24 (invalid field in cdb), ASCQ: 0x0, FRU: 0x0
/pci@1d,700000/SUNW,emlxs@1,1/fp@0,0 (fcp0):
FCP: Report Lun Has Changed
/pci@1d,700000/SUNW,emlxs@1,1/fp@0,0 (fcp0):
FCP: Report Lun Has Changed
NOTICE: SUNW-MSG-ID: SUNOS-8000-0G, TYPE: Error, VER: 1, SEVERITY: Major
panic[cpu0]/thread=2a10001fcc0:
pcisch-3: Fatal PCI bus error(s)
3. after several days install lastest Recommended patches
4. now i see this after every reboot:
SUNW-MSG-ID: SUN4U-8005-WS, TYPE: Fault, VER: 1, SEVERITY: Critical
EVENT-TIME: Thu Feb 15 14:54:54 EET 2007
PLATFORM: SUNW,Sun-Fire-V210, CSN: -, HOSTNAME: dummy1
SOURCE: eft, REV: 1.16
EVENT-ID: 9d2ab741-ee23-cd7f-abb6-d5cc55808016
DESC: A problem has been detected in the PCI subsystem. Refer to http://sun.com/msg/SUN4U-8005-WS for more information.
AUTO-RESPONSE: One or more device instances may be disabled
IMPACT: Loss of services provided by the device instances associated with this fault
REC-ACTION: Ensure that the latest drivers and patches are installed, schedule a repair procedure to replace the affected device if necessary, or contact Sun for support.
5. fmdump -v -u 9d2ab741-ee23-cd7f-abb6-d5cc55808016
Feb 15 14:54:54.9896 9d2ab741-ee23-cd7f-abb6-d5cc55808016 SUN4U-8005-WS
25% fault.io.tomatillo
Problem in: hc:///motherboard=0/hostbridge=0
Affects: hc:///motherboard=0/hostbridge=0
FRU: hc:///motherboard=0
25% defect.io.pci.driver
Problem in: hc:///motherboard=0/hostbridge=0/pcibus=1/pcidev=2/pcifn=1
Affects: mod:///mod-name=bge/mod-id=130
FRU: pkg:///SUNWcakr
25% defect.io.pci.driver
Problem in: hc:///motherboard=0/hostbridge=0/pcibus=1/pcidev=1/pcifn=1
Affects: mod:///mod-name=emlxs/mod-id=96
FRU: pkg:///SUNWemlxs
25% defect.io.pci.driver
Problem in: hc:///motherboard=0/hostbridge=0/pcibus=1/pcidev=32/pcifn=0
Affects: mod:///mod-name=pcisch/mod-id=23
FRU: pkg:///SUNWcakr
Whats wrong ? :/
--
Mike

