E450: Solaris10: Boot Hangs
Hi all,
I have two identical E450 boxes, both were working previously(a couple of months ago). I went to install Solaris 10 on them, one works fine; the other one hangs on booting from the CDROM or a DISK prepared on the other working E450.
Here is the output of the hung boot process:
SunOS Release 5.10 Version Generic_118833-17 64-bit
Copyright 1983-2005 Sun Microsystems, Inc. All rights reserved.
Use is subject to license terms.
/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\| /-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\| /-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\| /
Let me know how I can proceed to trouble shoot this.
Thx.
[793 byte] By [
Britanicus] at [2007-11-26 11:31:34]

# 2
Here is the Post:
Boot device: /pci@1f,4000/scsi@3/disk@0,0:a File and args: -v
The boot filesystem is logging.
The ufs log is empty and will not be used.
|Size: /-\|/-\|0x79a58+/-\|/-\|/-\|/-\|0x1c139+/-\|0x30413 Bytes
/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\| /-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\| /-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\| /-\|module /platform/sun4u/kernel/sparcv9/unix: text at [0x1000000, 0x1079a57] data at 0x1800000
module misc/sparcv9/krtld: text at [0x1079a58, 0x109269f] data at 0x184c550
module /platform/sun4u/kernel/sparcv9/genunix: text at [0x10926a0, 0x11e755f] data at 0x1851d80
module /platform/SUNW,Ultra-4/kernel/misc/sparcv9/platmod: text at [0x11e7560, 0x11e75e7] data at 0x18a3a80
module /platform/sun4u/kernel/cpu/sparcv9/SUNW,UltraSPARC-II: text at [0x11e7600, 0x11f3647] data at 0x18a41c0
SunOS Release 5.10 Version Generic_118833-17 64-bit
Copyright 1983-2005 Sun Microsystems, Inc. All rights reserved.
Use is subject to license terms.
/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\Ethernet address = 8:0:20:fe:de:0
|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\ |/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\ |/-\|/-\|/mem = 2097152K (0x80000000)
avail mem = 2081824768
root nexus = Sun Enterprise 450 (4 X UltraSPARC-II 400MHz)
pseudo0 at root
pseudo0 is /pseudo
scsi_vhci0 at root
scsi_vhci0 is /scsi_vhci
pcipsy0 at root: UPA 0x1f 0x4000
pcipsy0 is /pci@1f,4000
/pci@1f,4000/scsi@3 (glm0):
Rev. 3 Symbios 53c875 found.
PCI-device: scsi@3, glm0
glm0 is /pci@1f,4000/scsi@3
sd15 at glm0: target 0 lun 0
sd15 is /pci@1f,4000/scsi@3/sd@0,0
root on /pci@1f,4000/scsi@3/disk@0,0:a fstype ufs
PCI-device: ebus@1, ebus0
ebus0 is /pci@1f,4000/ebus@1
envctrl0 at ebus0: offset 14,600000
envctrl0 is /pci@1f,4000/ebus@1/SUNW,envctrl@14,600000
pseudo-device: dld0
dld0 is /pseudo/dld@0
se0 at ebus0: offset 14,400000
se0 is /pci@1f,4000/ebus@1/se@14,400000
su0 at ebus0: offset 14,3083f8
su0 is /pci@1f,4000/ebus@1/su@14,3083f8
su1 at ebus0: offset 14,3062f8
su1 is /pci@1f,4000/ebus@1/su@14,3062f8
# 3
> se0 is /pci@1f,4000/ebus@1/se@14,400000
> su0 at ebus0: offset 14,3083f8
> su0 is /pci@1f,4000/ebus@1/su@14,3083f8
> su1 at ebus0: offset 14,3062f8
> su1 is /pci@1f,4000/ebus@1/su@14,3062f8
I believe that 'su' is one of the serial drivers (not sure though...) You might want to verify that your terminal has the right settings, that you don't have something else plugged into serial ports other than for the console.
I suppose the problem could be not with the 'su' device, but with something past that point. On my 250 (not identical, but close), the next thing that happens at boot is a banner identifying the CPUs.
Can you go through a full POST in the OBP?
--
Darren
# 5
>But, I need the ttya port to work; so what is the next step?
pass
>also how can I generate the full POST?
As root, in the OS,
# eeprom "diag-device=disk" //else it'll try boot off the network, by default
# eeprom "diag-level=max
# eeprom "diag-switch?=true"
Then a _full_ power cycle (ie, init 5, then power-up -- not init 6).
When you come to power up, make sure you have the laptop or
other serial console connected and the Sun keyboard disconnected,
as LOTS of tests run before the video card is turned on (and, if you
have a keyboard connected, the E450 won't use the serial port).
All of those eeprom switches are persistent, btw, so booting will
take about 20-40 minutes each time you reboot.Once you no longer
need all these diagnostics,
# eeprom "diag-switch?=false"
will make it boot at a normal pace from the normal boot device.