E3500 ? I dont know what to do

HI PPL,

got a E3500 that making my head spinn.

First i didnt have output, then (after reading in this forum)

i did a Hyperterm and wow i get output, alots of out Puts.

My System are doing countinus Reboot due to severe error.

Hier is 2xparts of my Logfile where i think the error ist.

(Have attached the hole log)

7,0>FPU Instruction Test

7,0>TESTING IO BOARD 1

7,0>Board 1 I/O FPROM Test

7,0>I/O Board EPROM checksum Test

7,0>ERROR: TEST=I/O FPROM,SUBTEST=I/O Board EPROM checksum ID=a.1

7,0>Component under test: Board 1 Firehose PROM U2501

"7,0>POST/iPOST PROM Checksum Error"

"checksum from prom header = e56f"

"computed segment checksum = 256f"

7,0>

" *** Aborting Test List due to severe error ***"

++++++++++++++++++++++++++++++++++++++++++++++

7,0>RESET INFO for CPU/Memory board in slot 7

"7,0>AC ESR 00000000.00000021 INCWS UPA_A_ERR"

"7,0>DC[0] 00"

"7,0>DC[1] 00"

"7,0>DC[2] 00"

"7,0>DC[3] 00"

"7,0>DC[4] 00"

"7,0>DC[5] 00"

"7,0>DC[6] 00"

"7,0>DC[7] 00"

"7,0>FHC CSR 00050200 LOC_FATAL SYNC NOT_BRD_PRES"

"7,0>FHC RCSR 02000000 FATAL"

7,0>

Can some one Help me whit som Ideas what to do!!!

[1583 byte] By [Traco] at [2007-11-25 22:47:57]
# 1
Replace I/O Board in slot 1. Hopefully it will have newer firmware than x.x.24. RePOST to see if CPU14 is also having problems. You could have 2 issues. I can't tell if system came up on CPU15 because POST output was truncated.
jds2n at 2007-7-5 17:03:20 > top of Java-index,Sun Hardware,Servers - General Discussion...
# 2
ThanksOk Ill Try,Got a Spare I/O.New Post in 2 HourBye the way WHY "Hopefully it will have newer firmware than x.x.24."?
Traco at 2007-7-5 17:03:20 > top of Java-index,Sun Hardware,Servers - General Discussion...
# 3
7,0>Displaying PROM Versions7,0>Slot 1 IO Type 1FCODE 1.8.24 1999/12/23 17:29 iPOST 3.4.24 1999/12/23 17:34 <Ancient Firmware. Latest is x.x.30 Your CPU/Mem Bd has x.x.297,0>Slot 7 CPU/MemoryOBP3.2.29 2001/6/18 17:28 POST 3.9.29 2001/6/18 17:50
jds2n at 2007-7-5 17:03:20 > top of Java-index,Sun Hardware,Servers - General Discussion...
# 4

Hmm

after 5 Hourst i turnd on my Hyper term and see that i got a

Extended Post Menu?

Frankly i dont have a clue.

was Testing all the Menus and then after a while the

system Hang again.

Log= EPost01

Then i Change The I/O Board, (whit out Changin the Nvram)

(Becurce this Last Board ist the Orig Board and i didnt change it that time)well how can read can also Learn :-/

Turning SystemKey off Diag and Waiting...

Hang=

7,0>TESTING IO BOARD 1

7,0>Board 1 I/O FPROM Test

7,0>I/O Board EPROM checksum Test

7,0>ERROR: TEST=I/O FPROM,SUBTEST=I/O Board EPROM checksum ID=a.1

7,0>Component under test: Board 1 Firehose PROM U2501

7,0>POST/iPOST PROM Checksum Error

checksum from prom header = e56f

computed segment checksum = 256f

7,0> *** Aborting Test List due to severe error ***

Looking like the same Error as on the First Kard. :-(

Traco at 2007-7-5 17:03:20 > top of Java-index,Sun Hardware,Servers - General Discussion...
# 5
The IPost is =7,0>Slot 1 IO Type 1FCODE 1.8.24 1999/12/23 17:29 iPOST 3.4.24 1999/12/23 17:34 The Same :-(Can i update It?well wel the whole Log isCapture03Looks Like i have to get me a nother I/O Boardbut were?
Traco at 2007-7-5 17:03:20 > top of Java-index,Sun Hardware,Servers - General Discussion...
# 6

Both CPUs 14 & 15 fail while testing I/O board in Slot 1. I'm not quite sure what you mean about not replacing NVRAM. The NVRAM is on the Clock Board. The speed of the CPU's & the system speed seem to say that this is an E3000 which would have 4 slots (1,3,5&7) & 10 SCSI disks while an E3500 would have 5 slots (1,3,5,7&9) & 8 Fibre disks. You cannot upgrade the firmware until you can get it to pass POST. The whole system can get upgraded using patch 103346-30 but you'll have to boot first. If you had another, good, I/O board. you could copy it's firmware to this board.

jds2n at 2007-7-5 17:03:20 > top of Java-index,Sun Hardware,Servers - General Discussion...
# 7

Hi and Thanks so far,

Frankly im Newbe to this stuff so i think i do a lots of misstakes.

After reading in some manuals i saw that when you Replace the I/O you schud take the NVRAM Modul from the old I/O and put it on the New I/O.

My Errors Started whit CPU Board Error.

I Replaced the Board and then i got this

TOD dont Matched and the System was giving the output

waiting for ARP/RARP.

My view of What went (did i do ) wrong?

I think that i Replaced the CPU Board whit another whit

Newer Code xx.29

then i get this Error whit the I/O TOD Dont matched.

Im starting to think that i have Replaced a Good CPU Board,

and now the Firmware playing whit me.

Think i will Try to Replace all Back again to Default.

I got 4 HDD in my System but it looks like the system cant fine them? on witch Board are they Attached?

Is it Possible from Extended Post to do the Update?

Thanks again for Looking into it.

Jan

Traco at 2007-7-5 17:03:20 > top of Java-index,Sun Hardware,Servers - General Discussion...
# 8

Hi Again,

Have been doing some reconstrucktion and now

i shud have the Orig HW.

But as Always i get this (Key=Diag)

7,0>TESTING IO BOARD 1

7,0>Board 1 I/O FPROM Test

7,0>I/O Board EPROM checksum Test

7,0>ERROR: TEST=I/O FPROM,SUBTEST=I/O Board EPROM checksum ID=a.1

7,0>Component under test: Board 1 Firehose PROM U2501

7,0>POST/iPOST PROM Checksum Error

checksum from prom header = e56f

computed segment checksum = 256f

7,0>

*** Aborting Test List due to severe error ***

7,0>@(#) iPOST 3.4.24 1999/12/23 17:34

++++++++++++++++++++++++++++++++++++++++++++++

7,0>Board 1 OnBoard IO Chipset (FEPS) Test

7,0>FAS366 Registers Test

7,0>ESP FAS366 DVMA burst mode read/write Test

7,0>FAS366 FIFO TO DMA Test

7,0>DMA TO FAS366 FIFO Test

7,0>FEPS (Ethernet) Registers Test

7,0>Data Access Error from address 00000000.00000200

7,0> tl tt tstatetpctnpc

7,0> 01 32 00000000.15001600 000001ff.f0088000 000001ff.f0088010

7,0>AFSR 00000000.88000000 AFAR 00000000.00000200

7,0>(PRIV) Privileged Code

7,0>(TO) Time Out Error

7,0>FATAL ERROR

7,0>At time of error: POST was testing Board 1 FEPS ASIC U3600

7,0>Diagnosis: Board 7, software, any system board

7,0>Log Date: Nov 17 13:43:39 GMT 2005

7,0>

7,0>RESET INFO for CPU/Memory board in slot 7

7,0>AC ESR 00000000.00000021 INCWS UPA_A_ERR

7,0>DC[0] 00

7,0>DC[1] 00

7,0>DC[2] 00

7,0>DC[3] 00

7,0>DC[4] 00

7,0>DC[5] 00

7,0>DC[6] 00

7,0>DC[7] 00

7,0>FHC CSR 00050200 LOC_FATAL SYNC NOT_BRD_PRES

7,0>FHC RCSR 02000000 FATAL

7,1>

7,0>

7,1>@(#) POST 3.9.29 2001/06/18 17:50

++++++++++++++++++++++++++++++++++++++++++++++

7,1>Data Access Error from address 00000000.00000200

7,1> tl tt tstatetpctnpc

7,1> 01 32 00000000.15001600 000001ff.f0088000 000001ff.f0088010

7,1>AFSR 00000000.88000000 AFAR 00000000.00000200

7,1>(PRIV) Privileged Code

7,1>(TO) Time Out Error

++++++++++++++++++++++++++++++++++++++++++++++

7,0>Board 1 OnBoard IO Chipset (FEPS) Test

7,0>FAS366 Registers Test

7,0>ESP FAS366 DVMA burst mode read/write Test

7,0>FAS366 FIFO TO DMA Test

7,0>DMA TO FAS366 FIFO Test

7,0>FEPS (Ethernet) Registers Test

7,0>Data Access Error from address 00000000.00000200

7,0> tl tt tstatetpctnpc

7,0> 01 32 00000000.15001600 000001ff.f0088000 000001ff.f0088010

7,0>AFSR 00000000.88000000 AFAR 00000000.00000200

7,0>(PRIV) Privileged Code

7,0>(TO) Time Out Error

7,0>FATAL ERROR

7,0>At time of error: POST was testing Board 1 FEPS ASIC U3600

7,0>Diagnosis: Board 7, software, any system board

7,0>Log Date: Nov 17 13:58:57 GMT 2005

7,0>

7,0>RESET INFO for CPU/Memory board in slot 7

7,0>AC ESR 00000000.00000021 INCWS UPA_A_ERR

7,0>DC[0] 00

7,0>DC[1] 00

7,0>DC[2] 00

7,0>DC[3] 00

7,0>DC[4] 00

7,0>DC[5] 00

7,0>DC[6] 00

7,0>DC[7] 00

7,0>FHC CSR 00050200 LOC_FATAL SYNC NOT_BRD_PRES

7,0>FHC RCSR 02000000 FATAL

Im not able to stop in OBP?

Ctrl+A Dont work?

Traco at 2007-7-5 17:03:20 > top of Java-index,Sun Hardware,Servers - General Discussion...
# 9

I Set the Key to Off and in Normal On

then i got This OutPut:

Hardware Power ON

Clock board TOD does not match TOD on any IO board.

***FCode S_Bus LP9002s (lpfs) Version 2.31a5***

***FCode S_Bus LP9002s (lpfs) Version 2.31a5***

Clock board TOD does not match TOD on any IO board.

***FCode S_Bus LP9002s (lpfs) Version 2.31a5***

***FCode S_Bus LP9002s (lpfs) Version 2.31a5***

4-slot Sun Enterprise 3000, No Keyboard

OpenBoot 3.2.29, 1280 MB memory installed, Serial #8516096.

Copyright 2001 Sun Microsystems, Inc. All rights reserved

Ethernet address 8:0:20:81:f2:0, Host ID: 8081f200.

Boot device: net File and args:

Timeout waiting for ARP/RARP packet

Timeout waiting for ARP/RARP packet

Timeout waiting for ARP/RARP packet

Timeout waiting for ARP/RARP packet

Timeout waiting for ARP/RARP packet

Timeout waiting for ARP/RARP packet

Timeout waiting for ARP/RARP packet

Timeout waiting for ARP/RARP packet

Timeout waiting for ARP/RARP packet

Timeout waiting for ARP/RARP packet

And nothing happen ,

cant do Break to OBP,

Does some 1 know how to ower Hyperterm?

Traco at 2007-7-5 17:03:20 > top of Java-index,Sun Hardware,Servers - General Discussion...
# 10

Make sure your Keyswitch is not in DIAG position as most probably your diag-device is net so you are attempting to boot from net:

Boot device: net File and args:

You will need to discover what your Hypeterm "Break" sequence is for Laptop / Comuter Hyperterm is running on. Most likely the [Break} key is pat of it but is it Shift + Break or Ctrl+Break or Alt+Break or maybe even a Function Key without Break. Once you see the banner & Boot Device you have about 5 seconds to Break it to the ok prompt before all keystrokes are ignored and it attempts to boot.

You can safely ignore, for now:

"Clock board TOD does not match TOD on any IO board" since you can't get to ok prompt to fix it with:

ok copy-clock-tod-to-io-boards

Once you get to ok prompt you need to:

ok setenv auto-boot? false

ok reset-all

Then you can put keyswitch back into DIAG and watch POST run again, if needed.

"I got 4 HDD in my System but it looks like the system cant fine them?"

The internal disks are controlled by the onboard SCSI FAS chip on the I/O Board in slot 1 via internal wiring from Slot 1 Backplane to SCSI disks. You must have a good I/O board in Slot 1 to control internal disks & CD

jds2n at 2007-7-5 17:03:20 > top of Java-index,Sun Hardware,Servers - General Discussion...
# 11

Super Thanks,

i now got OBP :-)

have done a Copy Clock and setenv. and reset.

Then i was thinking.

I have 2x SF6800 working and i had a MemoryModule error.

I replaced the faulty Memory and stil got this error.

after reading i found out that i have to do a "set Keyswitch" to "discower" this module new.

Maybe i got this on this System too.

My HW are back in "old status" and whit the new I/O

i dint come so fahr as OBP.

Ill put back the New I/O and Try to get OBP and do a

Reset -all

************************************************************

Thank you so fahr

i got the Patch,:-)

Now i Wounder how do i get it on the system?

Coud you post the Commands ?

Pleace Pleace

************************************************************

Traco at 2007-7-5 17:03:20 > top of Java-index,Sun Hardware,Servers - General Discussion...
# 12

Traco,

There is a README file inside the zip download.

Your flash-update instructions are there.

You will have to wait until the system can boot with success, because the OBP binary update file is commonly copied to the root of the boot drive..

A memory error has no relationship with what you have shown us for this Enterprise Server.You have an old system and various parts of it are finally just failing. ( I am guessing that you need a new I/O board #1 and a new cpu board #7 at the very least.)

How important is this computer?If it is a production server, you may have to consider replacing it completely.The cost of repairing it may be too high.

Bill at 2007-7-5 17:03:20 > top of Java-index,Sun Hardware,Servers - General Discussion...
# 13

Hi Bill,

Looks that you have Right,

I Replaced First the CPU Board.

(But buil in the Old one just to see the Diffrence)

(Learning by Doing)

I allso got a new I/O Board but, this i think have a

Eprom Error too.

Replacing the hole System?

Yes its Probobly the best solution.

Do you have enny Idea what i shoud by?

It have to be 2x Cpu whit the same ore more Speed.

I have a Fiberschannel Module in it, can i put this module in a Newer System.?

Pleace do Reckomend a System 4 me.

dont have to be the newes one.

I have a Appl. that have to run 6 more mnd and then its replaced.

Traco at 2007-7-5 17:03:20 > top of Java-index,Sun Hardware,Servers - General Discussion...