Service LED (yellow one) is on, but no apparent errors

Hello.

For some time now, on two of my servers, the yellow Service LED is on. The servers are working fine, though.

In the ALOM logs, I also don't find anything useful:

winds01_sc> showlogs -v

NOV 24 17:43:18 winds01: 00040029:"Host system has shut down."

NOV 24 17:57:38 winds01: 0000000a:"Power Supply 1 AC Power Unavailable."

NOV 25 13:43:01 winds01: 00040029:"Host system has shut down."

NOV 25 13:43:51 winds01: 0000000a:"Power Supply 1 AC Power Unavailable."

NOV 25 14:35:08 winds01: 00040002:"Host System has Reset"

NOV 25 14:54:10 winds01: 00040029:"Host system has shut down."

NOV 25 14:59:08 winds01: 00040002:"Host System has Reset"

NOV 25 15:44:29 winds01: 00040029:"Host system has shut down."

NOV 25 17:48:31 winds01: 00040002:"Host System has Reset"

NOV 25 20:38:11 winds01: 00040002:"Host System has Reset"

NOV 26 10:26:59 winds01: 00040002:"Host System has Reset"

MAY 05 10:53:35 winds01: 00040002:"Host System has Reset"

MAY 05 11:55:48 winds01: 00040002:"Host System has Reset"

MAY 07 17:02:04 winds01: 00060003:"SC System booted."

MAY 07 17:02:04 winds01: 00040002:"Host System has Reset"

MAY 07 17:03:18 winds01: 00060007:"Failed to send email alert for recent event."

MAY 07 17:02:04 winds01: 00040002:"Host System has Reset"

MAY 07 17:04:33 winds01: 00060007:"Failed to send email alert for recent event."

MAY 07 17:06:10 winds01: 00040002:"Host System has Reset"

MAY 07 17:07:25 winds01: 00060007:"Failed to send email alert for recent event."

MAY 07 17:06:11 winds01: 0004000b:"Host System has read and cleared bootmode."

MAY 07 17:08:40 winds01: 00060007:"Failed to send email alert for recent event."

MAY 07 17:06:44 winds01: 0004000b:"Host System has read and cleared bootmode."

MAY 07 17:09:55 winds01: 00060007:"Failed to send email alert for recent event."

MAY 07 17:07:18 winds01: 0004004f:"Indicator MB.SERVICE is now ON"

MAY 07 17:11:10 winds01: 00060007:"Failed to send email alert for recent event."

MAY 07 17:08:18 winds01: 0004004f:"Indicator MB.ACT is now ON"

MAY 07 17:12:25 winds01: 00060007:"Failed to send email alert for recent event."

MAY 07 18:31:33 winds01: 00040002:"Host System has Reset"

MAY 07 18:32:47 winds01: 00060007:"Failed to send email alert for recent event."

MAY 07 18:31:33 winds01: 0004000b:"Host System has read and cleared bootmode."

MAY 07 18:34:04 winds01: 00060007:"Failed to send email alert for recent event."

MAY 07 18:32:17 winds01: 0004004f:"Indicator MB.ACT is now OFF"

MAY 07 18:35:19 winds01: 00060007:"Failed to send email alert for recent event."

MAY 07 18:33:17 winds01: 0004004f:"Indicator MB.ACT is now ON"

MAY 07 18:36:34 winds01: 00060007:"Failed to send email alert for recent event."

JUL 17 12:18:49 winds01: 00060000:"SC Login: User admin Logged on."

As you can see, MB.SERVICE is on since MAY 07 17:07:18. But why has it turned on? prtdiag doesn't show anything broken either:

askwar@winds01 ~ $ /usr/platform/`uname -i`/sbin/prtdiag -v

System Configuration: Sun Microsystems sun4u Sun Fire V240

System clock frequency: 167 MHZ

Memory size: 2GB

==================================== CPUs ====================================

E$ CPUCPUTemperature

CPU FreqSizeImplementation MaskDieAmb. StatusLocation

-- ---- ---

01002 MHz 1MB SUNW,UltraSPARC-IIIi2.4--onlineMB/P0

================================= IO Devices =================================

BusFreq Slot +Name +

TypeMHzStatusPath Model

- - - --

pci66MB pci108e,1648 (network)

okay/pci@1f,700000/network@2

pci66MB pci108e,1648 (network)

okay/pci@1f,700000/network@2,1

pci33MB isa/su (serial)

okay/pci@1e,600000/isa@7/serial@0,3f8

pci33MB isa/su (serial)

okay/pci@1e,600000/isa@7/serial@0,2e8

pci33MB pci10b9,5229 (ide)

okay/pci@1e,600000/ide@d

pci33PCI2pci1077,1016 (scsi)QLGC,ISP10160

okay/pci@1e,600000/pci@2/scsi@4

pci33PCI2pci1077,1016 (scsi)QLGC,ISP10160

okay/pci@1e,600000/pci@2/scsi@5

pci66MB scsi-pci1000,21 (scsi-2)

okay/pci@1c,600000/scsi@2

pci66MB scsi-pci1000,21 (scsi-2)

okay/pci@1c,600000/scsi@2,1

pci66MB pci108e,1648 (network)

okay/pci@1d,700000/network@2

pci66MB pci108e,1648 (network)

okay/pci@1d,700000/network@2,1

pci66PCI0SUNW,XVR-100 (display)SUNW,375-3126

okay/pci@1d,700000/SUNW,XVR-100

pci337isa/rmc-comm-rmc_comm (seria+

okay/pci@1e,600000/isa@7/rmc-comm@0,3e8

pci3310 pciclass,0c0310 (usb)

okay/pci@1e,600000/usb@a

============================ Memory Configuration ============================

Segment Table:

--

Base AddressSizeInterleave Factor Contains

--

0x02GB4BankIDs 0,1,2,3

Bank Table:

--

Physical Location

IDControllerID GroupIDSizeInterleave Way

--

00 0 512MB0,1,2,3

10 1 512MB

20 1 512MB

30 0 512MB

Memory Module Groups:

--

ControllerIDGroupID Labels Status

--

0 0MB/P0/B0/D0

0 0MB/P0/B0/D1

0 1MB/P0/B1/D0

0 1MB/P0/B1/D1

============================ Environmental Status ============================

Fan Speeds:

Location Sensor StatusSpeed

F0RS okay6617 rpm

F1RS okay6617 rpm

F2RS okay6750 rpm

MB/P0/F0 RS okay16071 rpm

MB/P0/F1 RS okay16875 rpm

PS0FF_FAN okay

PS1FF_FAN okay

Temperature sensors:

--

LocationSensor Temperature LoLoWarn HiWarnHi Status

--

MB/P0 T_CORE53C--110C115Cokay

MB T_ENC23C-3C5C40C48Cokay

PS0FF_OT- ----okay

PS1FF_OT- ----okay

--

Current sensors:

--

Location SensorCurrentLoLoWarn HiWarnHi Status

--

MBFF_SCSI- ----okay

PS0FF_OC - ----okay

PS1FF_OC - ----okay

-

Voltage sensors:

-

LocationSensorVoltageLoLoWarn HiWarnHiStatus

-

MB/P0 V_CORE 1.45V-1.26V1.54V-okay

Keyswitch:

LocationState

SYSCTRLNORMAL

-

Led State:

-

Location LedStateColor

-

MB ACTon green

MB SERVICEon amber

MB LOCATEoff white

PS0ACTon green

PS0SERVICEoff amber

PS0OK2RMoff blue

PS1ACTon green

PS1SERVICEoff amber

PS1OK2RMoff blue

HDD0SERVICEoff amber

HDD0OK2RMoff blue

HDD1SERVICEoff amber

HDD1OK2RMoff blue

HDD2SERVICEoff amber

HDD2OK2RMoff blue

HDD3SERVICEoff amber

HDD3OK2RMoff blue

=========================== FRU Operational Status ===========================

Fru Operational Status:

LocationStatus

MB/SCokay

PS0 okay

HDD0present

HDD1present

PS1 okay

================================ HW Revisions ================================

ASIC Revisions:

-

PathDeviceStatus Revision

-

/pci@1f,700000 pci108e,a801okay4

/pci@1e,600000 pci108e,a801okay4

/pci@1c,600000 pci108e,a801okay4

/pci@1d,700000 pci108e,a801okay4

System PROM revisions:

-

OBP 4.22.11 2006/06/12 14:45 Sun Fire V210/V240,Netra 210/240

OBDIAG 4.22.11 2006/06/12 14:57

Same "issues" with another server:

sc_winds05> showlogs -v

JAN 12 06:34:58 winds05: 00000009:"Power Supply 0 AC Power Unavailable."

FEB 28 22:00:38 winds05: 00040002:"Host System has Reset"

NOV 24 17:08:55 winds05: 00040029:"Host system has shut down."

NOV 24 17:57:35 winds05: 0000000a:"Power Supply 1 AC Power Unavailable."

NOV 25 13:43:06 winds05: 00040029:"Host system has shut down."

NOV 25 13:43:56 winds05: 0000000a:"Power Supply 1 AC Power Unavailable."

NOV 25 14:34:26 winds05: 00040002:"Host System has Reset"

NOV 25 14:55:13 winds05: 00040029:"Host system has shut down."

NOV 25 15:02:36 winds05: 00040002:"Host System has Reset"

NOV 25 15:44:41 winds05: 00040029:"Host system has shut down."

NOV 25 17:34:47 winds05: 00040002:"Host System has Reset"

NOV 26 11:16:51 winds05: 00040002:"Host System has Reset"

MAY 07 17:02:07 winds05: 00060003:"SC System booted."

MAY 07 17:02:07 winds05: 00040002:"Host System has Reset"

MAY 07 17:03:21 winds05: 00060007:"Failed to send email alert for recent event."

MAY 07 17:02:07 winds05: 00040002:"Host System has Reset"

MAY 07 17:04:36 winds05: 00060007:"Failed to send email alert for recent event."

MAY 07 17:09:33 winds05: 00040002:"Host System has Reset"

MAY 07 17:10:48 winds05: 00060007:"Failed to send email alert for recent event."

MAY 07 17:09:34 winds05: 0004000b:"Host System has read and cleared bootmode."

MAY 07 17:12:03 winds05: 00060007:"Failed to send email alert for recent event."

MAY 07 17:10:06 winds05: 0004000b:"Host System has read and cleared bootmode."

MAY 07 17:13:18 winds05: 00060007:"Failed to send email alert for recent event."

MAY 07 17:10:21 winds05: 0004004f:"Indicator MB.SERVICE is now ON"

MAY 07 17:14:33 winds05: 00060007:"Failed to send email alert for recent event."

MAY 07 17:11:21 winds05: 0004004f:"Indicator MB.ACT is now ON"

MAY 07 17:15:46 winds05: 00060007:"Failed to send email alert for recent event."

MAY 07 18:30:49 winds05: 00040002:"Host System has Reset"

MAY 07 18:30:50 winds05: 0004000b:"Host System has read and cleared bootmode."

MAY 07 18:31:19 winds05: 0004004f:"Indicator MB.ACT is now OFF"

MAY 07 18:32:19 winds05: 0004004f:"Indicator MB.ACT is now ON"

MAY 29 06:32:23 winds05: 00040002:"Host System has Reset"

MAY 29 06:32:24 winds05: 0004000b:"Host System has read and cleared bootmode."

MAY 29 06:32:46 winds05: 0004004f:"Indicator MB.ACT is now OFF"

MAY 29 06:33:46 winds05: 0004004f:"Indicator MB.ACT is now ON"

JUL 17 12:20:32 winds05: 00060000:"SC Login: User admin Logged on."

askwar@winds05 ~ $ /usr/platform/`uname -i`/sbin/prtdiag -v

System Configuration: Sun Microsystems sun4u Sun Fire V240

System clock frequency: 167 MHZ

Memory size: 4GB

==================================== CPUs ====================================

E$ CPUCPUTemperature

CPU FreqSizeImplementationMaskDieAmb. Location

-- - - -- - - --

0 1002 MHz 1MB SUNW,UltraSPARC-IIIi2.4--MB/P0

================================= IO Devices =================================

BusFreq

Brd Type MHzSlotName Model

- - - - --

0pci66MB pci108e,1648 (network)

0pci66MB pci108e,1648 (network)

0pci33MB isa/su (serial)

0pci33MB isa/su (serial)

0pci33MB pci10b9,5229 (ide)

0pci33 PCI2 pci1077,1016 (scsi)QLGC,ISP10160

0pci33 PCI2 pci1077,1016 (scsi)QLGC,ISP10160

0pci66MB scsi-pci1000,21 (scsi-2)

0pci66MB scsi-pci1000,21 (scsi-2)

0pci66MB pci108e,1648 (network)

0pci66MB pci108e,1648 (network)

0pci66 PCI0 SUNW,XVR-100 (display)SUNW,375-3126

0pci337 isa/rmc-comm-rmc_comm (seria+

============================ Memory Configuration ============================

Segment Table:

--

Base AddressSizeInterleave Factor Contains

--

0x04GB16 BankIDs 0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15

Bank Table:

--

Physical Location

IDControllerID GroupIDSizeInterleave Way

--

00 0 256MB0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15

10 0 256MB

20 1 256MB

30 1 256MB

40 0 256MB

50 0 256MB

60 1 256MB

70 1 256MB

80 1 256MB

90 1 256MB

100 0 256MB

110 0 256MB

120 1 256MB

130 1 256MB

140 0 256MB

150 0 256MB

Memory Module Groups:

--

ControllerIDGroupID Labels

--

0 0MB/P0/B0/D0

0 0MB/P0/B0/D1

0 1MB/P0/B1/D0

0 1MB/P0/B1/D1

============================ Environmental Status ============================

Fan Speeds:

LocationSensor StatusSpeed

F0 RS okay6750 rpm

F1 RS okay6818 rpm

F2 RS okay6818 rpm

MB/P0/F0RS okay16875 rpm

MB/P0/F1RS okay16875 rpm

PS0FF_FAN okay

PS1FF_FAN okay

Temperature sensors:

--

LocationSensor Temperature LoLoWarn HiWarnHiStatus

--

MB/P0 T_CORE54C--110C115Cokay

MB T_ENC22C-3C5C40C48Cokay

PS0FF_OT- ----okay

PS1FF_OT- ----okay

-

Current sensors:

-

Location Sensor CurrentLoLoWarn HiWarnHiStatus

-

MB FF_SCSI- ----okay

PS0FF_OC - ----okay

PS1FF_OC - ----okay

-

Voltage sensors:

-

LocationSensorVoltageLoLoWarn HiWarnHiStatus

-

MB/P0V_CORE1.45V-1.26V1.54V-okay

--

Led State:

--

LocationLedStateColor

--

MB ACTon green

MB SERVICEon amber

MB LOCATEoff white

PS0ACTon green

PS0SERVICEoff amber

PS0OK2RMoff blue

PS1ACTon green

PS1SERVICEoff amber

PS1OK2RMoff blue

HDD0SERVICEoff amber

HDD0OK2RMoff blue

HDD1SERVICEoff amber

HDD1OK2RMoff blue

HDD2SERVICEoff amber

HDD2OK2RMoff blue

HDD3SERVICEoff amber

HDD3OK2RMoff blue

=========================== FRU Operational Status ===========================

-

Fru Operational Status:

-

LocationStatus

-

MB/SCokay

PS0 okay

HDD0present

HDD1present

PS1 okay

================================ HW Revisions ================================

ASIC Revisions:

pci: Rev 4

pci: Rev 4

pci: Rev 4

pci: Rev 4

System PROM revisions:

-

OBP 4.22.11 2006/06/12 14:45 Sun Fire V210/V240,Netra 210/240

OBDIAG 4.22.11 2006/06/12 14:57

Any ideas about why this happens? What caused the LEDs to go on? And how do I turn them off, if there's no error?

Thanks,

Alexander Skwar

[16076 byte] By [A.Skwara] at [2007-11-27 10:49:24]
# 1

Have you plugged in both power supply units?

If one of two doesn't have power the system maintenance LED will light up.

Ronald@dedrienotarissena at 2007-7-29 11:19:00 > top of Java-index,Sun Hardware,Servers - General Discussion...