From owner-freebsd-fs@FreeBSD.ORG Sun Aug 2 09:30:00 2009 Return-Path: Delivered-To: freebsd-fs@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id C90E41065670; Sun, 2 Aug 2009 09:30:00 +0000 (UTC) (envelope-from pjd@garage.freebsd.pl) Received: from mail.garage.freebsd.pl (chello087206049004.chello.pl [87.206.49.4]) by mx1.freebsd.org (Postfix) with ESMTP id 29E978FC17; Sun, 2 Aug 2009 09:29:59 +0000 (UTC) (envelope-from pjd@garage.freebsd.pl) Received: by mail.garage.freebsd.pl (Postfix, from userid 65534) id BFEE645CAC; Sun, 2 Aug 2009 11:29:57 +0200 (CEST) Received: from localhost (abib242.neoplus.adsl.tpnet.pl [83.7.117.242]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by mail.garage.freebsd.pl (Postfix) with ESMTP id 25DD145C98; Sun, 2 Aug 2009 11:29:52 +0200 (CEST) Date: Sun, 2 Aug 2009 11:30:16 +0200 From: Pawel Jakub Dawidek To: Juergen Unger Message-ID: <20090802093016.GB3071@garage.freebsd.pl> References: <20090727072503.GA52309@jpru.ffm.jpru.de> <4A6E06E6.9030300@mail.zedat.fu-berlin.de> <4A6EC9E2.5070200@icyb.net.ua> <20090729084723.GD1586@garage.freebsd.pl> <20090802092714.GA5813@jpru.ffm.jpru.de> Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="3MwIy2ne0vdjdPXF" Content-Disposition: inline In-Reply-To: <20090802092714.GA5813@jpru.ffm.jpru.de> User-Agent: Mutt/1.4.2.3i X-PGP-Key-URL: http://people.freebsd.org/~pjd/pjd.asc X-OS: FreeBSD 8.0-CURRENT i386 X-Spam-Checker-Version: SpamAssassin 3.0.4 (2005-06-05) on mail.garage.freebsd.pl X-Spam-Level: ** X-Spam-Status: No, score=2.5 required=4.5 tests=BAYES_00,RCVD_IN_SORBS_DUL, RCVD_IN_XBL autolearn=no version=3.0.4 Cc: freebsd-fs@FreeBSD.org, "O. Hartmann" , freebsd-current@FreeBSD.org, spambox@haruhiism.net, Andriy Gapon Subject: Re: zfs: Fatal trap 12: page fault while in kernel mode X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 02 Aug 2009 09:30:01 -0000 --3MwIy2ne0vdjdPXF Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Sun, Aug 02, 2009 at 11:27:14AM +0200, Juergen Unger wrote: > Hi Pawel, >=20 > On Wed, Jul 29, 2009 at 10:47:23AM +0200, Pawel Jakub Dawidek wrote: > > On Tue, Jul 28, 2009 at 12:50:26PM +0300, Andriy Gapon wrote: > > > on 27/07/2009 22:58 O. Hartmann said the following: > > > > Juergen Unger wrote: > > > [snip] > > > >>> _sx_xlock(3c,0,874aa28d,70f,8ae9a9f8,...) at _sx_xlock+0x43 > > > >>> dmu_buf_update_user(0,8ae9a9f8,0,0,0,...) at dmu_buf_update_user+= 0x35 > > > >>> zfs_znode_dmu_fini(8ae9a9f8,874b312d,1114,110b,879ab000,...) at z= fs_znode_dmu_f3 > > > >>> zfs_freebsd_reclaim(fcd29c3c,1,0,8ec63754,fcd29c60,...) at zfs_fr= eebsd_reclaim+0 > > > >>> VOP_RECLAIM_APV(874b65a0,fcd29c3c,0,0,8ec637c8,...) at VOP_RECLAI= M_APV+0xa5 > > > >>> vgonel(8ec637c8,0,80c77037,386,0,...) at vgonel+0x1a4 > > > >>> vnlru_free(80f2a0f0,0,80c77037,300,3e8,...) at vnlru_free+0x2d5 > > > >>> vnlru_proc(0,fcd29d38,80c652bc,33e,871932a8,...) at vnlru_proc+0x= 80 > > > >>> fork_exit(8090d960,0,fcd29d38) at fork_exit+0xb8 > > > >>> fork_trampoline() at fork_trampoline+0x8 > >[snip] > > > P.S. I see that zfs_inactive checks for z_dbuf being NULL and there i= s the > > > following comment: > > > /* > > > * The fs has been unmounted, or we did a > > > * suspend/resume and this file no longer exists. > > > */ > > > Maybe zfs_freebsd_reclaim should do the same? > >=20 > > Yes, you might be right. > >=20 > > Could you guys, who can reproduce it, try this patch: > >=20 > > http://people.freebsd.org/~pjd/patches/zfs_vnops.c.2.patch >=20 > I tried the patch, restarted the whole thing yesterday morning > and after less then 24 hours and approximately 3215 zfs-receive > jobs it do not crashes anymore, but the last started zfs-receive > jobs is hanging, cannot be killed, even not with -9. Even other > zfs commands are hanging and cannot be killed, while zpool commands > seems to be not affected. >=20 > root 86397 0.0 0.0 3920 1308 ?? D 3:18AM 0:00.29 zfs receive= -Fv zzzz/203 > root 5001 0.0 0.0 3920 1208 0 D+ 10:45AM 0:00.00 zfs list -t= snapshot > root 5477 0.0 0.0 3920 1240 3 D+ 11:08AM 0:00.00 zfs list >=20 > also the sync command I tried to execute hangs forever: >=20 > root 5457 0.0 0.0 1528 492 2- D+ 11:05AM 0:00.04 sync >=20 > Other parts of the system which do not have something todo with zfs > are still working well. I will leave the machine running in this > state, is there something I can do to retrieve other usefull information > for you? If you can break into debugger and send me 'show alltrace' for starters. --=20 Pawel Jakub Dawidek http://www.wheel.pl pjd@FreeBSD.org http://www.FreeBSD.org FreeBSD committer Am I Evil? Yes, I Am! --3MwIy2ne0vdjdPXF Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.4 (FreeBSD) iD8DBQFKdVyoForvXbEpPzQRAi8dAKCEpGhqHjJvGVa3Bw0iyvNiElA7MQCg2HvH CIaaGbnB7hW+rlFIBoNdCgQ= =IyTt -----END PGP SIGNATURE----- --3MwIy2ne0vdjdPXF--