5.11+T2000 = SUN4-8000-0Y
I'm administering a 24-strand T2000 with Solaris 11.
The product notes for the T2000 indicate that, if certain patches and kernel parameters aren't in place, the ALOM will report SUN4-8000-0Y faults, aka a catastrophic failure of the PCI Express system.
When I got this machine up-and-running back in June of 2006, I became aware of the problem and added the needed entries to /etc/system.
However, yesterday I was training a new user on the ALOM when we initiated a scheduled reboot of the machine. When the system came back on line, we had several problems with the gigabit ethernet cards, which reside on the PCIX bus:
WARNING: pciex8086,105e - e1000g[0] : reset failed
WARNING: pciex8086,105e - e1000g[1] : reset failed
The ALOM showed a SUN4-8000-0Y fault, which I cleared with the service console's clearfault and clearasrdb commands.
I then ran a full POST, which was clean.
However, whenever the machine boots, it receives the same problems about the ethernet cards. The machine will otherwise boot normally, except for its inability to communicate with devices on its PCIX bus.
One unfortunate side effect of this problem is that I cannot easily move files to/from the machine, since my only access to the machine is from the service console.
Does anyone have any suggestions on how to resolve the problem?

