Date: Tue, 9 Mar 2010 13:57:07 +0100 From: Borja Marcos <borjam@sarenet.es> To: Pawel Jakub Dawidek <pjd@FreeBSD.org> Cc: FreeBSD Stable <freebsd-stable@freebsd.org>, Stefan Bethke <stb@lassitu.de> Subject: Re: Many processes stuck in zfs Message-ID: <EC9BC6B4-8D0E-4FE3-852F-0E3A24569D33@sarenet.es> In-Reply-To: <20100309122954.GE3155@garage.freebsd.pl> References: <864468D4-DCE9-493B-9280-00E5FAB2A05C@lassitu.de> <20100309122954.GE3155@garage.freebsd.pl>
next in thread | previous in thread | raw e-mail | index | archive | help
On Mar 9, 2010, at 1:29 PM, Pawel Jakub Dawidek wrote: > On Tue, Mar 09, 2010 at 10:15:53AM +0100, Stefan Bethke wrote: >> Over the past couple of months, I've more or less regularly observed = machines having more and more processes stuck in the zfs wchan. The = processes never recover from that, and trying to reboot only gets the = entire system stuck, without any console messages. I can enter the = debugger, and I have saved a couple of dumps. >>=20 >> The situation seems to be triggered by zfs receive'ing snapshots from = the sister machine (both synchronize their active ZFS filesystems to = each other, using zfs send and zfs receive). It appears it's the = receiving causing trouble. >>=20 >> Both machines run 8-stable from mid-February, with a single-disk ZFS = pool, with ARC limited to 512M, prefetch and ZIL disabled via = loader.conf. >>=20 >> What should I be looking at to further diagnose? >=20 > What kind of hardware do you have there? There is 3-way deadlock I've = a > fix for which would be hard to trigger on single or dual core = machines. >=20 > Feel free to try the fix: >=20 > http://people.freebsd.org/~pjd/patches/zfs_3way_deadlock.patch Maybe related to the deadlock I reported when I was receiving an = incremental snapshot while the target dataset was being read? Borja.
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?EC9BC6B4-8D0E-4FE3-852F-0E3A24569D33>