Service LED (yellow one) is on, but no apparent errors
Hello.
For some time now, on two of my servers, the yellow Service LED is on. The servers are working fine, though.
In the ALOM logs, I also don't find anything useful:
winds01_sc> showlogs -v
NOV 24 17:43:18 winds01: 00040029:"Host system has shut down."
NOV 24 17:57:38 winds01: 0000000a:"Power Supply 1 AC Power Unavailable."
NOV 25 13:43:01 winds01: 00040029:"Host system has shut down."
NOV 25 13:43:51 winds01: 0000000a:"Power Supply 1 AC Power Unavailable."
NOV 25 14:35:08 winds01: 00040002:"Host System has Reset"
NOV 25 14:54:10 winds01: 00040029:"Host system has shut down."
NOV 25 14:59:08 winds01: 00040002:"Host System has Reset"
NOV 25 15:44:29 winds01: 00040029:"Host system has shut down."
NOV 25 17:48:31 winds01: 00040002:"Host System has Reset"
NOV 25 20:38:11 winds01: 00040002:"Host System has Reset"
NOV 26 10:26:59 winds01: 00040002:"Host System has Reset"
MAY 05 10:53:35 winds01: 00040002:"Host System has Reset"
MAY 05 11:55:48 winds01: 00040002:"Host System has Reset"
MAY 07 17:02:04 winds01: 00060003:"SC System booted."
MAY 07 17:02:04 winds01: 00040002:"Host System has Reset"
MAY 07 17:03:18 winds01: 00060007:"Failed to send email alert for recent event."
MAY 07 17:02:04 winds01: 00040002:"Host System has Reset"
MAY 07 17:04:33 winds01: 00060007:"Failed to send email alert for recent event."
MAY 07 17:06:10 winds01: 00040002:"Host System has Reset"
MAY 07 17:07:25 winds01: 00060007:"Failed to send email alert for recent event."
MAY 07 17:06:11 winds01: 0004000b:"Host System has read and cleared bootmode."
MAY 07 17:08:40 winds01: 00060007:"Failed to send email alert for recent event."
MAY 07 17:06:44 winds01: 0004000b:"Host System has read and cleared bootmode."
MAY 07 17:09:55 winds01: 00060007:"Failed to send email alert for recent event."
MAY 07 17:07:18 winds01: 0004004f:"Indicator MB.SERVICE is now ON"
MAY 07 17:11:10 winds01: 00060007:"Failed to send email alert for recent event."
MAY 07 17:08:18 winds01: 0004004f:"Indicator MB.ACT is now ON"
MAY 07 17:12:25 winds01: 00060007:"Failed to send email alert for recent event."
MAY 07 18:31:33 winds01: 00040002:"Host System has Reset"
MAY 07 18:32:47 winds01: 00060007:"Failed to send email alert for recent event."
MAY 07 18:31:33 winds01: 0004000b:"Host System has read and cleared bootmode."
MAY 07 18:34:04 winds01: 00060007:"Failed to send email alert for recent event."
MAY 07 18:32:17 winds01: 0004004f:"Indicator MB.ACT is now OFF"
MAY 07 18:35:19 winds01: 00060007:"Failed to send email alert for recent event."
MAY 07 18:33:17 winds01: 0004004f:"Indicator MB.ACT is now ON"
MAY 07 18:36:34 winds01: 00060007:"Failed to send email alert for recent event."
JUL 17 12:18:49 winds01: 00060000:"SC Login: User admin Logged on."
As you can see, MB.SERVICE is on since MAY 07 17:07:18. But why has it turned on? prtdiag doesn't show anything broken either:
askwar@winds01 ~ $ /usr/platform/`uname -i`/sbin/prtdiag -v
System Configuration: Sun Microsystems sun4u Sun Fire V240
System clock frequency: 167 MHZ
Memory size: 2GB
==================================== CPUs ====================================
E$ CPUCPUTemperature
CPU FreqSizeImplementation MaskDieAmb. StatusLocation
-- ---- ---
01002 MHz 1MB SUNW,UltraSPARC-IIIi2.4--onlineMB/P0
================================= IO Devices =================================
BusFreq Slot +Name +
TypeMHzStatusPath Model
- - - --
pci66MB pci108e,1648 (network)
okay/pci@1f,700000/network@2
pci66MB pci108e,1648 (network)
okay/pci@1f,700000/network@2,1
pci33MB isa/su (serial)
okay/pci@1e,600000/isa@7/serial@0,3f8
pci33MB isa/su (serial)
okay/pci@1e,600000/isa@7/serial@0,2e8
pci33MB pci10b9,5229 (ide)
okay/pci@1e,600000/ide@d
pci33PCI2pci1077,1016 (scsi)QLGC,ISP10160
okay/pci@1e,600000/pci@2/scsi@4
pci33PCI2pci1077,1016 (scsi)QLGC,ISP10160
okay/pci@1e,600000/pci@2/scsi@5
pci66MB scsi-pci1000,21 (scsi-2)
okay/pci@1c,600000/scsi@2
pci66MB scsi-pci1000,21 (scsi-2)
okay/pci@1c,600000/scsi@2,1
pci66MB pci108e,1648 (network)
okay/pci@1d,700000/network@2
pci66MB pci108e,1648 (network)
okay/pci@1d,700000/network@2,1
pci66PCI0SUNW,XVR-100 (display)SUNW,375-3126
okay/pci@1d,700000/SUNW,XVR-100
pci337isa/rmc-comm-rmc_comm (seria+
okay/pci@1e,600000/isa@7/rmc-comm@0,3e8
pci3310 pciclass,0c0310 (usb)
okay/pci@1e,600000/usb@a
============================ Memory Configuration ============================
Segment Table:
--
Base AddressSizeInterleave Factor Contains
--
0x02GB4BankIDs 0,1,2,3
Bank Table:
--
Physical Location
IDControllerID GroupIDSizeInterleave Way
--
00 0 512MB0,1,2,3
10 1 512MB
20 1 512MB
30 0 512MB
Memory Module Groups:
--
ControllerIDGroupID Labels Status
--
0 0MB/P0/B0/D0
0 0MB/P0/B0/D1
0 1MB/P0/B1/D0
0 1MB/P0/B1/D1
============================ Environmental Status ============================
Fan Speeds:
Location Sensor StatusSpeed
F0RS okay6617 rpm
F1RS okay6617 rpm
F2RS okay6750 rpm
MB/P0/F0 RS okay16071 rpm
MB/P0/F1 RS okay16875 rpm
PS0FF_FAN okay
PS1FF_FAN okay
Temperature sensors:
--
LocationSensor Temperature LoLoWarn HiWarnHi Status
--
MB/P0 T_CORE53C--110C115Cokay
MB T_ENC23C-3C5C40C48Cokay
PS0FF_OT- ----okay
PS1FF_OT- ----okay
--
Current sensors:
--
Location SensorCurrentLoLoWarn HiWarnHi Status
--
MBFF_SCSI- ----okay
PS0FF_OC - ----okay
PS1FF_OC - ----okay
-
Voltage sensors:
-
LocationSensorVoltageLoLoWarn HiWarnHiStatus
-
MB/P0 V_CORE 1.45V-1.26V1.54V-okay
Keyswitch:
LocationState
SYSCTRLNORMAL
-
Led State:
-
Location LedStateColor
-
MB ACTon green
MB SERVICEon amber
MB LOCATEoff white
PS0ACTon green
PS0SERVICEoff amber
PS0OK2RMoff blue
PS1ACTon green
PS1SERVICEoff amber
PS1OK2RMoff blue
HDD0SERVICEoff amber
HDD0OK2RMoff blue
HDD1SERVICEoff amber
HDD1OK2RMoff blue
HDD2SERVICEoff amber
HDD2OK2RMoff blue
HDD3SERVICEoff amber
HDD3OK2RMoff blue
=========================== FRU Operational Status ===========================
Fru Operational Status:
LocationStatus
MB/SCokay
PS0 okay
HDD0present
HDD1present
PS1 okay
================================ HW Revisions ================================
ASIC Revisions:
-
PathDeviceStatus Revision
-
/pci@1f,700000 pci108e,a801okay4
/pci@1e,600000 pci108e,a801okay4
/pci@1c,600000 pci108e,a801okay4
/pci@1d,700000 pci108e,a801okay4
System PROM revisions:
-
OBP 4.22.11 2006/06/12 14:45 Sun Fire V210/V240,Netra 210/240
OBDIAG 4.22.11 2006/06/12 14:57
Same "issues" with another server:
sc_winds05> showlogs -v
JAN 12 06:34:58 winds05: 00000009:"Power Supply 0 AC Power Unavailable."
FEB 28 22:00:38 winds05: 00040002:"Host System has Reset"
NOV 24 17:08:55 winds05: 00040029:"Host system has shut down."
NOV 24 17:57:35 winds05: 0000000a:"Power Supply 1 AC Power Unavailable."
NOV 25 13:43:06 winds05: 00040029:"Host system has shut down."
NOV 25 13:43:56 winds05: 0000000a:"Power Supply 1 AC Power Unavailable."
NOV 25 14:34:26 winds05: 00040002:"Host System has Reset"
NOV 25 14:55:13 winds05: 00040029:"Host system has shut down."
NOV 25 15:02:36 winds05: 00040002:"Host System has Reset"
NOV 25 15:44:41 winds05: 00040029:"Host system has shut down."
NOV 25 17:34:47 winds05: 00040002:"Host System has Reset"
NOV 26 11:16:51 winds05: 00040002:"Host System has Reset"
MAY 07 17:02:07 winds05: 00060003:"SC System booted."
MAY 07 17:02:07 winds05: 00040002:"Host System has Reset"
MAY 07 17:03:21 winds05: 00060007:"Failed to send email alert for recent event."
MAY 07 17:02:07 winds05: 00040002:"Host System has Reset"
MAY 07 17:04:36 winds05: 00060007:"Failed to send email alert for recent event."
MAY 07 17:09:33 winds05: 00040002:"Host System has Reset"
MAY 07 17:10:48 winds05: 00060007:"Failed to send email alert for recent event."
MAY 07 17:09:34 winds05: 0004000b:"Host System has read and cleared bootmode."
MAY 07 17:12:03 winds05: 00060007:"Failed to send email alert for recent event."
MAY 07 17:10:06 winds05: 0004000b:"Host System has read and cleared bootmode."
MAY 07 17:13:18 winds05: 00060007:"Failed to send email alert for recent event."
MAY 07 17:10:21 winds05: 0004004f:"Indicator MB.SERVICE is now ON"
MAY 07 17:14:33 winds05: 00060007:"Failed to send email alert for recent event."
MAY 07 17:11:21 winds05: 0004004f:"Indicator MB.ACT is now ON"
MAY 07 17:15:46 winds05: 00060007:"Failed to send email alert for recent event."
MAY 07 18:30:49 winds05: 00040002:"Host System has Reset"
MAY 07 18:30:50 winds05: 0004000b:"Host System has read and cleared bootmode."
MAY 07 18:31:19 winds05: 0004004f:"Indicator MB.ACT is now OFF"
MAY 07 18:32:19 winds05: 0004004f:"Indicator MB.ACT is now ON"
MAY 29 06:32:23 winds05: 00040002:"Host System has Reset"
MAY 29 06:32:24 winds05: 0004000b:"Host System has read and cleared bootmode."
MAY 29 06:32:46 winds05: 0004004f:"Indicator MB.ACT is now OFF"
MAY 29 06:33:46 winds05: 0004004f:"Indicator MB.ACT is now ON"
JUL 17 12:20:32 winds05: 00060000:"SC Login: User admin Logged on."
askwar@winds05 ~ $ /usr/platform/`uname -i`/sbin/prtdiag -v
System Configuration: Sun Microsystems sun4u Sun Fire V240
System clock frequency: 167 MHZ
Memory size: 4GB
==================================== CPUs ====================================
E$ CPUCPUTemperature
CPU FreqSizeImplementationMaskDieAmb. Location
-- - - -- - - --
0 1002 MHz 1MB SUNW,UltraSPARC-IIIi2.4--MB/P0
================================= IO Devices =================================
BusFreq
Brd Type MHzSlotName Model
- - - - --
0pci66MB pci108e,1648 (network)
0pci66MB pci108e,1648 (network)
0pci33MB isa/su (serial)
0pci33MB isa/su (serial)
0pci33MB pci10b9,5229 (ide)
0pci33 PCI2 pci1077,1016 (scsi)QLGC,ISP10160
0pci33 PCI2 pci1077,1016 (scsi)QLGC,ISP10160
0pci66MB scsi-pci1000,21 (scsi-2)
0pci66MB scsi-pci1000,21 (scsi-2)
0pci66MB pci108e,1648 (network)
0pci66MB pci108e,1648 (network)
0pci66 PCI0 SUNW,XVR-100 (display)SUNW,375-3126
0pci337 isa/rmc-comm-rmc_comm (seria+
============================ Memory Configuration ============================
Segment Table:
--
Base AddressSizeInterleave Factor Contains
--
0x04GB16 BankIDs 0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15
Bank Table:
--
Physical Location
IDControllerID GroupIDSizeInterleave Way
--
00 0 256MB0,1,2,3,4,5,6,7,8,9,10,11,12,13,14,15
10 0 256MB
20 1 256MB
30 1 256MB
40 0 256MB
50 0 256MB
60 1 256MB
70 1 256MB
80 1 256MB
90 1 256MB
100 0 256MB
110 0 256MB
120 1 256MB
130 1 256MB
140 0 256MB
150 0 256MB
Memory Module Groups:
--
ControllerIDGroupID Labels
--
0 0MB/P0/B0/D0
0 0MB/P0/B0/D1
0 1MB/P0/B1/D0
0 1MB/P0/B1/D1
============================ Environmental Status ============================
Fan Speeds:
LocationSensor StatusSpeed
F0 RS okay6750 rpm
F1 RS okay6818 rpm
F2 RS okay6818 rpm
MB/P0/F0RS okay16875 rpm
MB/P0/F1RS okay16875 rpm
PS0FF_FAN okay
PS1FF_FAN okay
Temperature sensors:
--
LocationSensor Temperature LoLoWarn HiWarnHiStatus
--
MB/P0 T_CORE54C--110C115Cokay
MB T_ENC22C-3C5C40C48Cokay
PS0FF_OT- ----okay
PS1FF_OT- ----okay
-
Current sensors:
-
Location Sensor CurrentLoLoWarn HiWarnHiStatus
-
MB FF_SCSI- ----okay
PS0FF_OC - ----okay
PS1FF_OC - ----okay
-
Voltage sensors:
-
LocationSensorVoltageLoLoWarn HiWarnHiStatus
-
MB/P0V_CORE1.45V-1.26V1.54V-okay
--
Led State:
--
LocationLedStateColor
--
MB ACTon green
MB SERVICEon amber
MB LOCATEoff white
PS0ACTon green
PS0SERVICEoff amber
PS0OK2RMoff blue
PS1ACTon green
PS1SERVICEoff amber
PS1OK2RMoff blue
HDD0SERVICEoff amber
HDD0OK2RMoff blue
HDD1SERVICEoff amber
HDD1OK2RMoff blue
HDD2SERVICEoff amber
HDD2OK2RMoff blue
HDD3SERVICEoff amber
HDD3OK2RMoff blue
=========================== FRU Operational Status ===========================
-
Fru Operational Status:
-
LocationStatus
-
MB/SCokay
PS0 okay
HDD0present
HDD1present
PS1 okay
================================ HW Revisions ================================
ASIC Revisions:
pci: Rev 4
pci: Rev 4
pci: Rev 4
pci: Rev 4
System PROM revisions:
-
OBP 4.22.11 2006/06/12 14:45 Sun Fire V210/V240,Netra 210/240
OBDIAG 4.22.11 2006/06/12 14:57
Any ideas about why this happens? What caused the LEDs to go on? And how do I turn them off, if there's no error?
Thanks,
Alexander Skwar

