Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 29 Jul 2009 12:32:13 +0200
From:      Thomas Backman <serenity@exscape.org>
To:        Pawel Jakub Dawidek <pjd@FreeBSD.org>
Cc:        freebsd-fs@FreeBSD.org, FreeBSD current <freebsd-current@FreeBSD.org>, Andriy Gapon <avg@icyb.net.ua>
Subject:   Re: zfs: Fatal trap 12: page fault while in kernel mode
Message-ID:  <F4F82B3E-C119-40EF-9AA4-937052876D1E@exscape.org>
In-Reply-To: <20090729084723.GD1586@garage.freebsd.pl>
References:  <20090727072503.GA52309@jpru.ffm.jpru.de> <4A6E06E6.9030300@mail.zedat.fu-berlin.de> <4A6EC9E2.5070200@icyb.net.ua> <20090729084723.GD1586@garage.freebsd.pl>

next in thread | previous in thread | raw e-mail | index | archive | help

On Jul 29, 2009, at 10:47, Pawel Jakub Dawidek wrote:

> On Tue, Jul 28, 2009 at 12:50:26PM +0300, Andriy Gapon wrote:
>> on 27/07/2009 22:58 O. Hartmann said the following:
>>> Juergen Unger wrote:
>> [snip]
>>>>> _sx_xlock(3c,0,874aa28d,70f,8ae9a9f8,...) at _sx_xlock+0x43
>>>>> dmu_buf_update_user(0,8ae9a9f8,0,0,0,...) at dmu_buf_update_user 
>>>>> +0x35
>>>>> zfs_znode_dmu_fini(8ae9a9f8,874b312d,1114,110b,879ab000,...) at  
>>>>> zfs_znode_dmu_f3
>>>>> zfs_freebsd_reclaim(fcd29c3c,1,0,8ec63754,fcd29c60,...) at  
>>>>> zfs_freebsd_reclaim+0
>>>>> VOP_RECLAIM_APV(874b65a0,fcd29c3c,0,0,8ec637c8,...) at  
>>>>> VOP_RECLAIM_APV+0xa5
>>>>> vgonel(8ec637c8,0,80c77037,386,0,...) at vgonel+0x1a4
>>>>> vnlru_free(80f2a0f0,0,80c77037,300,3e8,...) at vnlru_free+0x2d5
>>>>> vnlru_proc(0,fcd29d38,80c652bc,33e,871932a8,...) at vnlru_proc 
>>>>> +0x80
>>>>> fork_exit(8090d960,0,fcd29d38) at fork_exit+0xb8
>>>>> fork_trampoline() at fork_trampoline+0x8
>> [snip]
>>>
>>> I see a similar problem on two SMP boxes (is your SMP?), but in my  
>>> case,
>>> it seems not to be ZFS related although I also use ZFS as /home  
>>> filesystem
>>
>> In this case this does seem to be caused by ZFS.
>>> From the backtrace we see that _sx_xlock() is called on bogus  
>>> struct sx pointer
>> (0x3c) and this is caused by dmu_buf_update_user() called with NULL  
>> first argument
>> (dmu_buf_t). Which means that znode_t z_dbuf was NULL - this could  
>> have been
>> caught by ASSERT in zfs_znode_dmu_fini if it were enabled.
>>
>> If you have the crash dump, then it would be interesting to examine  
>> znode_t
>> structure ('zp' argument) in zfs_znode_dmu_fini.
>>
>> P.S. I see that zfs_inactive checks for z_dbuf being NULL and there  
>> is the
>> following comment:
>> /*
>> * The fs has been unmounted, or we did a
>> * suspend/resume and this file no longer exists.
>> */
>> Maybe zfs_freebsd_reclaim should do the same?
>
> Yes, you might be right.
>
> Could you guys, who can reproduce it, try this patch:
>
> 	http://people.freebsd.org/~pjd/patches/zfs_vnops.c.2.patch
OFF TOPIC:
Due to similarities in the backtrace between this and a panic I've  
been seeing on exporting after zfs recv (see http://lists.freebsd.org/pipermail/freebsd-current/2009-July/009105.html 
  and also http://lists.freebsd.org/pipermail/freebsd-current/2009-July/009174.html 
  for a panics-every-time script) I tried this patch. Unfortunately, I  
still get the same panic (from vgonel() and up, it's the same, except  
for my typo in the linked email.)
Just thought I should point it out. Except for temporary storage  
problems when moving my data to ZFS, this panic is the last hurdle in  
not using FreeBSD for me. :/

Regards,
Thomas 



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?F4F82B3E-C119-40EF-9AA4-937052876D1E>