From owner-freebsd-fs@FreeBSD.ORG Mon Aug 29 22:02:34 2011 Return-Path: Delivered-To: freebsd-fs@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id A54C0106564A for ; Mon, 29 Aug 2011 22:02:34 +0000 (UTC) (envelope-from luke@digital-crocus.com) Received: from mail.digital-crocus.com (node2.digital-crocus.com [91.209.244.128]) by mx1.freebsd.org (Postfix) with ESMTP id 5CA2E8FC16 for ; Mon, 29 Aug 2011 22:02:33 +0000 (UTC) DomainKey-Signature: a=rsa-sha1; q=dns; c=nofws; s=dkselector; d=hybrid-logic.co.uk; h=Received:Received:Subject:From:To:Cc:In-Reply-To:References:Content-Type:Organization:Date:Message-ID:Mime-Version:X-Mailer:Content-Transfer-Encoding:X-Spam-Score:X-Digital-Crocus-Maillimit:X-Authenticated-Sender:X-Complaints:X-Admin:X-Abuse; b=IA1dEe16vsCBPNPFuyUogNAa+3uMzCWdxpHOxNfWzqA0rJYqr+xc0G5QnpKWV8ivCzcJIoOSCgpF2dbzP591ar2e9en6cefnedej11ABZ/j79fpXR3Vi1y5NcarLrSOH; Received: from luke by mail.digital-crocus.com with local (Exim 4.69 (FreeBSD)) (envelope-from ) id 1Qy9tm-000AMo-63 for freebsd-fs@freebsd.org; Mon, 29 Aug 2011 23:01:42 +0100 Received: from 127cr.net ([78.105.122.99] helo=[192.168.1.23]) by mail.digital-crocus.com with esmtpa (Exim 4.69 (FreeBSD)) (envelope-from ) id 1Qy9tl-000AMW-Oz; Mon, 29 Aug 2011 23:01:42 +0100 From: Luke Marsden To: Martin Matuska In-Reply-To: <4E5BFC6F.5080507@FreeBSD.org> References: <1314646728.7898.44.camel@pow> <4E5BFC6F.5080507@FreeBSD.org> Content-Type: text/plain; charset="UTF-8" Organization: Hybrid Logic Date: Mon, 29 Aug 2011 23:02:29 +0100 Message-ID: <1314655349.7898.53.camel@pow> Mime-Version: 1.0 X-Mailer: Evolution 2.32.2 Content-Transfer-Encoding: 7bit X-Spam-Score: -1.0 X-Digital-Crocus-Maillimit: done X-Authenticated-Sender: luke X-Complaints: abuse@digital-crocus.com X-Admin: admin@digital-crocus.com X-Abuse: abuse@digital-crocus.com (Please include full headers in abuse reports) Cc: freebsd-fs@freebsd.org, tech@hybrid-logic.co.uk Subject: Re: ZFS hang in production on 8.2-RELEASE X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 29 Aug 2011 22:02:34 -0000 On Mon, 2011-08-29 at 22:54 +0200, Martin Matuska wrote: > >> procstat -kk on the zfskern process gives: > >> > >> PID TID COMM TDNAME > >> KSTACK > >> 5 100045 zfskern arc_reclaim_thre mi_switch+0x176 > >> sleepq_timedwait+0x42 _cv_timedwait+0x134 arc_reclaim_thread+0x2a9 > >> fork_exit+0x118 fork_trampoline+0xe > >> 5 100046 zfskern l2arc_feed_threa mi_switch+0x176 > >> sleepq_timedwait+0x42 _cv_timedwait+0x134 l2arc_feed_thread+0x1ce > >> fork_exit+0x118 fork_trampoline+0xe > >> 5 100098 zfskern txg_thread_enter mi_switch+0x176 > >> sleepq_wait+0x42 _cv_wait+0x129 txg_thread_wait+0x79 txg_quiesce_thread > >> +0xb5 fork_exit+0x118 fork_trampoline+0xe > >> 5 100099 zfskern txg_thread_enter mi_switch+0x176 > >> sleepq_timedwait+0x42 _cv_timedwait+0x134 txg_thread_wait+0x3c > >> txg_sync_thread+0x365 fork_exit+0x118 fork_trampoline+0xe > >> > >> Any ideas on what might be causing this? > > It sounds like the bug Martin Matuska has recently fixed in FreeBSD > > and reported upstream to Illumos: > > https://www.illumos.org/issues/1313 > > > > The fix has been MFC'ed to 8-STABLE r224647 on Aug 4th. > > > > --Artem > No, I think this is more likely fixed by pjd's bugfix in r224791 (MFC'ed > to stable/8 as r225100). > > The corresponding patch is: > http://people.freebsd.org/~pjd/patches/zfsdev_state_lock.patch > Great, thanks! Will this patch apply to ZFS v15? We can't upgrade to v28 yet. -- Best Regards, Luke Marsden CTO, Hybrid Logic Ltd. Web: http://www.hybrid-cluster.com/ Hybrid Web Cluster - cloud web hosting Mobile: +1-415-449-1165 (US) / +447791750420 (UK)