zfs-discuss@opensolaris.org
[Top] [All Lists]

[zfs-discuss] uncorrectable error during zfs send; what are the right ne

Subject: [zfs-discuss] uncorrectable error during zfs send; what are the right next steps?
From: Matt Ingenthron
Date: Sat, 22 Mar 2008 18:30:43 PDT
Hi all,

I'm migrating to a new laptop from one which has had hardware issues lately.  I 
kept my home directory on zfs, so in theory it should be straightforward to 
send/receive, but I've had issues.  I've moved the disk out of the faulty 
system, though I saw the same issue there.

The behavior I have is the zfs send will start and work up to a point (about 
792MByte), but then all zfs activity with the source pool/filesystems will 
hang.  When I was doing it across the network, the sender would have all 
zpool/zfs commands hang after hitting this point, and now that both disks are 
on the same system, zpool/zfs commands having to do with the sending pool hang 
once it gets to this point.

Is there anything I can do to recover from this condition?  Near as I can tell, 
I can mount the filesystem and access things, so I could try just a bulk copy, 
but I'd like to know why this won't work.  Is there any data I can gather which 
may help identify the cause?  

It seems that the zpool/zfs commands working with that pool shouldn't hang, 
even if one zfs send/receive is hitting this condition.

I did some searches for bugs, but didn't find anything helpful.

Details:
snv_79b

imported pool: "oldspace"

messages:
# tail /var/adm/messages
Mar 22 17:28:36 hancock genunix: [ID 936769 kern.info] fssnap0 is 
/pseudo/fssnap@0
Mar 22 17:28:36 hancock pseudo: [ID 129642 kern.info] pseudo-device: winlock0
Mar 22 17:28:36 hancock genunix: [ID 936769 kern.info] winlock0 is 
/pseudo/winlock@0
Mar 22 17:28:36 hancock pseudo: [ID 129642 kern.info] pseudo-device: pm0
Mar 22 17:28:36 hancock genunix: [ID 936769 kern.info] pm0 is /pseudo/pm@0
Mar 22 17:28:36 hancock ipf: [ID 774698 kern.info] IP Filter: v4.1.9, running.
Mar 22 17:28:36 hancock rdc: [ID 517869 kern.info] @(#) rdc: built 20:52:37 Nov 
27 2007
Mar 22 17:28:36 hancock pseudo: [ID 129642 kern.info] pseudo-device: rdc0
Mar 22 17:28:36 hancock genunix: [ID 936769 kern.info] rdc0 is /pseudo/rdc@0
Mar 22 17:49:13 hancock zfs: [ID 664491 kern.warning] WARNING: Pool 'oldspace' 
has encountered an uncorrectable I/O error. Manual intervention is required.

no messages from fmadm
# fmadm faulty
# 


command: 
# zfs send oldspace/home/mi109165@laptopmigration | zfs receive 
space/homes/mi109165

output from zdb:
oldspace
    version=3
    name='oldspace'
    state=0
    txg=3360658
    pool_guid=986377251057668768
    hostid=630972017
    hostname='hancock'
    vdev_tree
        type='root'
        id=0
        guid=986377251057668768
        children[0]
                type='disk'
                id=0
                guid=5072910331796803983
                path='/dev/dsk/c3t0d0s3'
                devid='id1,sd@f259bde7147e5a411000cc9950000/d'
                phys_path='/pci@0,0/pci1179,1@1a,7/storage@1/disk@0,0:d'
                whole_disk=0
                metaslab_array=14
                metaslab_shift=28
                ashift=9
                asize=32221691904
                is_log=0
                DTL=110
space
    version=10
    name='space'
    state=0
    txg=4
    pool_guid=2349984716539065036
    hostid=630972017
    hostname='hancock'
    vdev_tree
        type='root'
        id=0
        guid=2349984716539065036
        children[0]
                type='disk'
                id=0
                guid=11913572021826904359
                path='/dev/dsk/c1t0d0s7'
                devid='id1,sd@f00000000479cc692000ab26a0001/h'
                phys_path='/pci@0,0/pci1179,1@1f,2/disk@0,0:h'
                whole_disk=0
                metaslab_array=14
                metaslab_shift=29
                ashift=9
                asize=58098450432
                is_log=0

# iostat -E
sd0       Soft Errors: 172 Hard Errors: 0 Transport Errors: 0 
Vendor: ATA      Product: FUJITSU MHW2120B Revision: 0013 Serial No:  
Size: 120.03GB <120034123776 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0 
Illegal Request: 172 Predictive Failure Analysis: 0 
sd1       Soft Errors: 0 Hard Errors: 409 Transport Errors: 0 
Vendor: MATSHITA Product: DVD-RAM UJ-852S  Revision: 1.00 Serial No:  
Size: 0.00GB <0 bytes>
Media Error: 0 Device Not Ready: 409 No Device: 0 Recoverable: 0 
Illegal Request: 0 Predictive Failure Analysis: 0 
sd3       Soft Errors: 305 Hard Errors: 0 Transport Errors: 0 
Vendor: ST912082 Product: 1A               Revision: 0014 Serial No:  
Size: 120.03GB <120034123776 bytes>
Media Error: 0 Device Not Ready: 0 No Device: 0 Recoverable: 0 
Illegal Request: 305 Predictive Failure Analysis: 0 



# zpool status space
  pool: space
 state: ONLINE
 scrub: none requested
config:

        NAME        STATE     READ WRITE CKSUM
        space       ONLINE       0     0     0
          c1t0d0s7  ONLINE       0     0     0

errors: No known data errors
# zpool status oldspace
^C
(process not responding...)

# zfs list -r space
NAME                       USED  AVAIL  REFER  MOUNTPOINT
space                     1.39G  51.8G    19K  /space
space/homes               1.39G  51.8G    18K  /space/homes
space/homes/mi109165       792M  51.8G   792M  /space/homes/mi109165
space/homes/new-mi109165   630M  51.8G   630M  /new-export/home/mi109165
# zfs list -r oldspace


^C
(process not responding....)


Thanks in advance for any help/pointers,

- Matt
 
 
This message posted from opensolaris.org
_______________________________________________
zfs-discuss mailing list
zfs-discuss@xxxxxxxxxxxxxxx
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

<Prev in Thread] Current Thread [Next in Thread>