This just happened to me on an sds mirrored drive, Oracle redo filesystem, e3500 with sol. 9 9/02.
I had shutdown all oracle databases and rebooted the system. There were no messages indicating a problem anywhere. Upon starting, a couple of the databases complained. Further investigation showed "I/O error" when ls -l the files and then the "unexpected free inode" message in the messages file.
The system had been happily running for 3 months with no problem.
fsck'ing the /redo eventually fixed this but I am concerned over this sudden problem with no warning.
Run iostat -e from time to time and see if errors are accumulating over time. Also, check your /var/adm/messages file and the console (assuming you have one) for messages related to disk problems.
Are the files systems Oracle is using RAW or cooked ? If cooked, you might consider using "logging" as a mount option in the /etc/vfstab file for these file systems. Actually, use logging for all your files systems.
Hi & thank for your help,
Typing iostat -e, I get :
- errors
devices/w h/w trn tot
sd6 0101
ssd03000 30
ssd13000 30
ssd2 0000
ssd3 0000
ssd4 0000
ssd5 0000
ssd6 0000
ssd7 0000
ssd81500 15
ssd91500 15
st111410 15
nfs1 0000
What is the meaning of each field ? Manpage lacks definition for thoses.
Thanks again,
It could be. The software errors you are getting can easily be caused by a loss cable. I was messing around behind my errors on day and started noticing lots of software errors shortly after. Started poking around and I found a cable I had bumped. Pushed it back in and the errors disappeared.
First rule of diagnosis:
- Eliminate the hardware FIRST