Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 18 Aug 2009 08:57:21 -0400
From:      Boris Kochergin <spawk@acm.poly.edu>
To:        Pawel Jakub Dawidek <pjd@FreeBSD.org>
Cc:        freebsd-fs@freebsd.org
Subject:   Re: ZFS RAID-Z panic on vdev failure + subsequent panics and hangs
Message-ID:  <4A8AA531.2000004@acm.poly.edu>
In-Reply-To: <4A81CF20.7010108@acm.poly.edu>
References:  <4A78AFB2.10103@acm.poly.edu>	<20090805115621.GG1784@garage.freebsd.pl>	<4A798A12.4070408@acm.poly.edu>	<20090807073738.GA1607@garage.freebsd.pl>	<20090807074400.GB1607@garage.freebsd.pl>	<4A7C3002.8000003@acm.poly.edu>	<20090807191334.GA1814@garage.freebsd.pl>	<4A7C81CA.2040303@acm.poly.edu>	<20090807193842.GA2487@garage.freebsd.pl>	<4A7C87C5.1070608@acm.poly.edu>	<20090807202756.GB2487@garage.freebsd.pl> <4A81CF20.7010108@acm.poly.edu>

next in thread | previous in thread | raw e-mail | index | archive | help
Boris Kochergin wrote:
> Pawel Jakub Dawidek wrote:
>> On Fri, Aug 07, 2009 at 04:00:05PM -0400, Boris Kochergin wrote:
>>  
>>> Pawel Jakub Dawidek wrote:
>>>    
>>>> On Fri, Aug 07, 2009 at 03:34:34PM -0400, Boris Kochergin wrote:
>>>>  
>>>>      
>>>>> Pawel Jakub Dawidek wrote:
>>>>>           
>>>>>> Yeah, that's strange indeed. Could you try:
>>>>>>
>>>>>>     print ab->b_arc_node.list_prev
>>>>>>     print ab->b_arc_node.list_next
>>>>>>
>>>>>>
>>>>>>                
>>>>> (kgdb) print ab->b_arc_node.list_prev
>>>>> $1 = (struct list_node *) 0x1
>>>>>            
>>>> Yeah, list_prev is corrupted. If it panics on you everytime, I could
>>>> send you a patch which will try to catch where the corruption occurs.
>>>>
>>>>  
>>>>       
>>> I eventually get the arc_evict panic every time I successfully 
>>> manage to mount the filesystem, but it usually panics (with the 
>>> other backtrace) as soon as I try to mount it, or mount just hangs. 
>>> I'll gladly try the patch, though--the data on the array is 
>>> important to me. Thanks.
>>>     
>>
>> To get the data from there you could also try to 'zfs send' it without
>> mounting the dataset at all (just in case).
>>
>>   
> Sorry for the delay. I had to find another machine to move the disks 
> into so that I could continue experimenting. Anyway, the filesystem 
> didn't have any snapshots I could send, so I tried creating one with 
> "zfs snapshot home@1" and the machine hung.
>
> FYI, In the new machine, all disks (including the one with the / 
> filesystem) retain their device names.
>
> -Boris
> _______________________________________________
> freebsd-fs@freebsd.org mailing list
> http://lists.freebsd.org/mailman/listinfo/freebsd-fs
> To unsubscribe, send any mail to "freebsd-fs-unsubscribe@freebsd.org"
Some more panics using RELENG_8 sources from yesterday: 
http://acm.poly.edu/~spawk/zfs/. The one in panic3.txt happens much more 
often than the other ones. If any brave soul wants to look into it, I 
can provide NFS/geom_gate/whatever access to the disk images (or actual 
disks, if there's a difference) so that they can recreate the problem on 
a local machine.

-Boris



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?4A8AA531.2000004>