V440 service (yellow) LED on, but no faults

The service (yellow) LED on a new V440 is cycling on and off, mostly on, but there are no faults visible in prtdiag and the syslog is clean.

Could this be caused by a non-conforming or 3rd party DIMM configuration? (We bought the system from a VAR who had a track record of picking junk off the floor and stuffing it in the boxes they ship.)

Is there a description of the logic that drives this LED somewhere? You'd think it would be the OR of all the components of the system.

# /usr/platform/*440*/sbin/prtdiag -v | more

System Configuration: Sun Microsystems sun4u Sun Fire V440

System clock frequency: 177 MHZ

Memory size: 8GB

==================================== CPUs ====================================

E$ CPUCPUTemperature Fan

CPU FreqSizeImpl.MaskDieAmbientSpeedUnit

-- -- -- -- ---

0 1593 MHz 1MB US-IIIi 3.4--

1 1593 MHz 1MB US-IIIi 3.4--

================================= IO Devices =================================

BusFreq

Brd Type MHzSlotName Model

- - - - --

0pci66MB pci108e,abba (network)SUNW,pci-ce

0pci33MB isa/su (serial)

0pci33MB isa/su (serial)

0pci33MB isa/rmc-comm-rmc_comm (seria+

0pci33MB pci10b9,5229 (ide)

0pci66MB pci108e,abba (network)SUNW,pci-ce

0pci66MB scsi-pci1000,30 (scsi-2)LSI,1030

0pci66MB scsi-pci1000,30 (scsi-2)LSI,1030

============================ Memory Configuration ============================

Segment Table:

--

Base AddressSizeInterleave Factor Contains

--

0x04GB16 BankIDs 0,1,2,3,4,5,6,7,8,9,10,

11,12,13,14,15

0x10000000004GB16 BankIDs 16,17,18,19,20,21,22,23

,24,25,26,27,28,29,30,31

Bank Table:

[...]

Memory Module Groups:

--

ControllerIDGroupID Labels

--

0 0C0/P0/B0/D0,C0/P0/B0/D1

0 1C0/P0/B1/D0,C0/P0/B1/D1

Memory Module Groups:

--

ControllerIDGroupID Labels

--

1 0C1/P0/B0/D0,C1/P0/B0/D1

1 1C1/P0/B1/D0,C1/P0/B1/D1

============================ Environmental Status ============================

Fan Speeds:

--

LocationSensorStatusSpeed

--

FT0/F0 TACHokay3901 rpm

FT1/F0 TACHokay3857 rpm

FT1/F1 TACHokay3813 rpm

PS0FF_PDCT_FAN okay

PS1FF_PDCT_FAN okay

Keyswitch:

LocationKeyswitchState

SYSSYSCTRLNORMAL

--

Led State:

--

LocationLedStateColor

--

SYSACTon green

SYSSERVICEon amber

SYSLOCATEoff white

PS0POKon green

PS0SERVICEoff amber

PS0OK2RMoff blue

PS0STBYon green

PS1POKon green

PS1SERVICEoff amber

PS1OK2RMoff blue

PS1STBYon green

HDD0SERVICEoff amber

HDD0OK2RMoff blue

HDD1SERVICEoff amber

HDD1OK2RMoff blue

HDD2SERVICEoff amber

HDD2OK2RMoff blue

HDD3SERVICEoff amber

HDD3OK2RMoff blue

-

Temperature sensors:

-

LocationSensorTemperature Lo LoWarn HiWarn Hi Status

-

MB T_AMB30C-10C0C65C75C okay

SCSIBP T_AMB25C-11C0C47C52C okay

C0 T_AMB29C-10C0C60C65C okay

C0/P0 T_CORE 80C-10C0C 108C 113C okay

C1 T_AMB28C-10C0C60C65C okay

C1/P0 T_CORE 80C-10C0C 108C 113C okay

Voltage sensors:

LocationSensorVoltage LoLoWarn HiWarnHiStatus

MB V_+1V5 1.49V1.20V1.27V1.73V1.80V okay

MB V_SCSI_CORE1.83V1.44V1.53V2.07V2.16V okay

MB V_VCCTM2.52V2.00V2.12V2.88V3.00V okay

MB V_NET0_1V2D1.25V0.96V1.02V1.38V1.44V okay

MB V_NET1_1V2D1.24V0.96V1.02V1.38V1.44V okay

MB V_NET0_1V2A1.26V0.96V1.02V1.38V1.44V okay

MB V_NET1_1V2A1.25V0.96V1.02V1.38V1.44V okay

MB V_+3V3 3.38V2.64V2.81V3.80V3.96V okay

MB V_+3V3STBY3.31V2.64V2.81V3.80V3.96V okay

MB V_+5V 5.07V4.00V4.25V5.75V6.00V okay

MB V_+12V12.06V9.60V 10.20V 13.80V 14.40V okay

MB V_-12V-11.82V-14.40V -13.80V -10.20V -9.60V okay

MB/BATV_BAT 3.03V-2.25V--okay

PS0FF_POK- ----okay

PS0P_PWR- ----okay

PS1FF_POK- ----okay

PS1P_PWR- ----okay

-

Current sensors:

-

LocationSensorCurrentLoLoWarn HiWarnHiStatus

-

MB FF_SCSIB- ----okay

MB FF_SCSIA- ----okay

MB FF_POK- ----okay

C0/P0FF_POK- ----okay

C1/P0FF_POK- ----okay

======== FRU Status =========

-

Fru Operational Status:

-

LocationStatus

-

SC okay

PS0 okay

PS1 okay

HDD0present

HDD1present

HDD2present

HDD3present

================================ HW Revisions ================================

ASIC Revisions:

pci: Rev 4

pci: Rev 4

pci: Rev 4

pci: Rev 4

System PROM revisions:

-

OBP 4.22.19 2006/09/06 23:42 Sun Fire V440,Netra 440

OBDIAG 4.22.19 2006/09/06 23:57

[4939 byte] By [wsandersa] at [2007-11-27 1:22:33]
# 1
Resetting the ALOM cleared the fault LED, BTW. It has stayed off for an hour or so.Perhaps there was a power glitch, but I see nothing in the syslog.
wsandersa at 2007-7-12 0:09:53 > top of Java-index,Sun Hardware,Servers - General Discussion...
# 2

This looks like an ALOM bug. I forgot to look at the ALOM logs prior to older posts. I see this whenever I power cycle the system on using the "poweron" ALOM command:

APR 27 22:33:06 sponge: 00060003: "SC System booted."

APR 27 22:34:18 sponge: 00060000: "SC Login: User admin Logged on."

APR 27 22:34:22 sponge: 00040001: "SC Request to Power On Host."

APR 27 22:34:23 sponge: 00040002: "Host System has Reset"

APR 27 22:34:25 sponge: 00040058: "Host Power-On Failed: System Power OK, Open Boot not responding"

APR 27 22:34:25 sponge: 0004000b: "Host System has read and cleared bootmode."

APR 27 22:34:54 sponge: 0004004f: "Indicator SYS.SERVICE is now ON"

APR 27 22:34:54 sponge: 0004004f: "Indicator PS0.POK is now ON"

APR 27 22:34:54 sponge: 0004004f: "Indicator PS1.POK is now ON"

APR 27 22:41:18 sponge: 00040002: "Host System has Reset"

APR 27 22:42:57 sponge: 00040002: "Host System has Reset"

APR 27 22:43:00 sponge: 0004000b: "Host System has read and cleared bootmode."

APR 27 22:45:02 sponge: 0004004f: "Indicator SYS.ACT is now ON"

So, this looks like some kind of deadlock with the OBP not coming up fast enough for the ALOM, or something. There are no apparent faults on the system. I do have diag-mode=max since several V440s we have received have had other hardware problems.

wsandersa at 2007-7-12 0:09:53 > top of Java-index,Sun Hardware,Servers - General Discussion...