error with UFSDUMP Solaris - disk not responding to selection - dad1: disk

In last days I have made backup of level 0 of my Solaris 8 SPARC using ufsdump 0uf /dev/rmt/0 /silcename, each slice was copied correctly, with the exception of my /oracle partition (slice) on which the following error is log:

-

Jul 8 11:54:58 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:54:58 server1 Uncorrectable data Error: Block 13e6490

Jul 8 11:55:00 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:55:00 server1 disk not responding to selection

Jul 8 11:55:04 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:55:04 server1 Uncorrectable data Error: Block 13e64b0

Jul 8 11:55:10 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:55:10 server1 Uncorrectable data Error: Block 13e64a0

Jul 8 11:55:12 server1 dada: [ID 107833 kern.notice] dad1: disk okay

Jul 8 11:55:27 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:55:27 server1 ID not found

Jul 8 11:55:29 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:55:29 server1 disk not responding to selection

Jul 8 11:55:29 server1 dada: [ID 107833 kern.notice] dad1: disk okay

Jul 8 11:56:02 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:56:02 server1 Uncorrectable data Error: Block 13e64a2

Jul 8 11:56:04 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:56:04 server1 disk not responding to selection

Jul 8 11:56:05 server1 dada: [ID 107833 kern.notice] dad1: disk okay

Jul 8 11:56:16 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:56:16 server1 Uncorrectable data Error: Block 13e64a3

Jul 8 11:56:18 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,

Jul 8 11:56:18 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:56:18 server1 disk not responding to selection

Jul 8 11:56:18 server1 dada: [ID 107833 kern.notice] dad1: disk okay

Jul 8 11:56:33 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:56:33 server1 Uncorrectable data Error: Block 13e64a7

Jul 8 11:56:35 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:56:35 server1 disk not responding to selection

Jul 8 11:56:35 server1 dada: [ID 107833 kern.notice] dad1: disk okay

Jul 8 11:56:43 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:56:43 server1 Uncorrectable data Error: Block 13e64a8

Jul 8 11:56:45 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:56:45 server1 disk not responding to selection

Jul 8 11:56:45 server1 dada: [ID 107833 kern.notice] dad1: disk okay

Jul 8 11:57:01 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:01 server1 ID not found

Jul 8 11:57:02 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:02 server1 disk not responding to selection

Jul 8 11:57:02 server1 dada: [ID 107833 kern.notice] dad1: disk okay

Jul 8 11:57:11 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:11 server1 Uncorrectable data Error: Block 13e64ac

Jul 8 11:57:12 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:12 server1 disk not responding to selection

Jul 8 11:57:17 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:17 server1 ID not found

Jul 8 11:57:19 server1 dada: [ID 107833 kern.notice] dad1: disk okay

Jul 8 11:57:23 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:23 server1 Uncorrectable data Error: Block 13e64af

Jul 8 11:57:25 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:25 server1 disk not responding to selection

Jul 8 11:57:25 server1 dada: [ID 107833 kern.notice] dad1: disk okay

Jul 8 11:57:30 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:30 server1 Uncorrectable data Error: Block 13f51d0

Jul 8 11:57:32 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:32 server1 disk not responding to selection

Jul 8 11:57:32 server1 dada: [ID 107833 kern.notice] dad1: disk okay

Jul 8 11:57:36 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:36 server1 Uncorrectable data Error: Block 13f51e0

Jul 8 11:57:38 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:38 server1 disk not responding to selection

Jul 8 11:57:38 server1 dada: [ID 107833 kern.notice] dad1: disk okay

Jul 8 11:57:38 server1 sendmail[293]: [ID 801593 mail.crit] NOQUEUE: SYSERR(root): getrequests: accept: Software caused connection abort

Jul 8 11:57:44 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:44 server1 Uncorrectable data Error: Block 13f51e3

Jul 8 11:57:46 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:46 server1 disk not responding to selection

Jul 8 11:57:46 server1 dada: [ID 107833 kern.notice] dad1: disk okay

Jul 8 11:57:56 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:44 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:44 server1 Uncorrectable data Error: Block 13f51e3

Jul 8 11:57:46 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:46 server1 disk not responding to selection

Jul 8 11:57:46 server1 dada: [ID 107833 kern.notice] dad1: disk okay

Jul 8 11:57:56 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:56 server1 disk not responding to selection

Jul 8 11:57:59 server1 dada: [ID 107833 kern.warning] WARNING: /pci@1f,0/pci@1,1/ide@3/dad@1,0 (dad1):

Jul 8 11:57:59 server1 Uncorrectable data Error: Block 13f51d7

Jul 8 11:58:01 server1 dada: [ID 107833 kern.notice] dad1: disk okay

-

I have tried to record with another tape, and also to erase the tape, and even so the problem persists.

This error has not happened to me in previous made backups. I did Send command FSCK to check problems in the disc, not finding any problem. Until before this problem, I have made satisfactorily backup with ufsdump of all partitions of my Solaris Server SPARC.

The output of command:iostat -Eis:

--

dad0Soft Errors: 0 Hard Errors: 0 Transport Errors: 17

Model: ST320414ARevision: 3.28Serial No: 3EC191B0

Size: 20.40GB <20403339264 bytes>

Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0

Illegal Request: 0

dad1Soft Errors: 0 Hard Errors: 0 Transport Errors: 17

Model: ST320420ARevision: 3.21Serial No: 3CL152CD

Size: 20.40GB <20403339264 bytes>

Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0

Illegal Request: 0

sd0Soft Errors: 0 Hard Errors: 2 Transport Errors: 0

Vendor: LGProduct: CD-ROM CRD-8483B Revision: 1.02 Serial No:

Size: 18446744073.71GB <-1 bytes>

Media Error: 0 Device Not Ready: 2 No Device: 0 Recoverable: 0

Illegal Request: 0 Predictive Failure Analysis: 0

st4Soft Errors: 0 Hard Errors: 0 Transport Errors: 0

Vendor: HPProduct: C1537ARevision: L007 Serial No:62

-

What the mean the log ?

How to solve ?

In fact, thank you very much.

Jimmy

[8432 byte] By [jimmy@hga] at [2007-11-27 10:49:33]
# 1

Disk not responding to selection generally indicates a hard drive error.

The fact that their showing up as transport errors indicates it could be scsi bus or or ide chain problems.

Could be cables, termination, controller. Or it could be the drive electronics.

robert.cohena at 2007-7-29 11:19:59 > top of Java-index,Solaris Operating System,Solaris 10 Features...