From owner-freebsd-fs@FreeBSD.ORG Sun Oct 10 20:00:33 2010 Return-Path: Delivered-To: freebsd-fs@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id E0C02106564A for ; Sun, 10 Oct 2010 20:00:33 +0000 (UTC) (envelope-from wjw@digiware.nl) Received: from mail.digiware.nl (mail.ip6.digiware.nl [IPv6:2001:4cb8:1:106::2]) by mx1.freebsd.org (Postfix) with ESMTP id 7ACCB8FC12 for ; Sun, 10 Oct 2010 20:00:33 +0000 (UTC) Received: from localhost (localhost.digiware.nl [127.0.0.1]) by mail.digiware.nl (Postfix) with ESMTP id D057A153437; Sun, 10 Oct 2010 22:00:31 +0200 (CEST) X-Virus-Scanned: amavisd-new at digiware.nl Received: from mail.digiware.nl ([127.0.0.1]) by localhost (rack1.digiware.nl [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id Otr5-Z+b5Tqg; Sun, 10 Oct 2010 22:00:20 +0200 (CEST) Received: from [192.168.10.215] (unknown [192.168.10.215]) by mail.digiware.nl (Postfix) with ESMTP id 5E725153433; Sun, 10 Oct 2010 22:00:20 +0200 (CEST) References: <4CB1DD0F.6000209@digiware.nl> <20101010193415.GA93540@icarus.home.lan> Message-Id: From: Willem Jan Withagen To: Jeremy Chadwick In-Reply-To: <20101010193415.GA93540@icarus.home.lan> Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable X-Mailer: iPad Mail (7B405) Mime-Version: 1.0 (iPad Mail 7B405) Date: Sun, 10 Oct 2010 22:06:05 +0200 Cc: "freebsd-fs@freebsd.org" Subject: Re: ZFS freeze/livelock X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 10 Oct 2010 20:00:34 -0000 Op 10 okt. 2010 om 21:34 heeft Jeremy Chadwick = het volgende geschreven: > On Sun, Oct 10, 2010 at 05:34:39PM +0200, Willem Jan Withagen wrote: >> Just had my FreeBSD freeze on me with what I would think is sort of >> an livelock.... >>=20 >> While I was receiving zfs snapshots on my data pool. >>=20 >> Top and systat just kept running, >> but anything getting near a shell (and perhaps disk-io) ended up in: >>=20 >> root@zfs.digiware.nl# gpart create -s gpt da6 >> load: 0.00 cmd: csh 12393 [zfsvfs->z_teardown_inactive_lock] 26.12r >> 0.00u 0.00s 0% 2480k >> load: 0.10 cmd: csh 12393 [zfsvfs->z_teardown_inactive_lock] 96.01r >> 0.00u 0.00s 0% 2480k >>=20 >> Trying to execute to execute shutdown -r now had no effect what so = ever. >> Neither did the three-finger salute. >> (Well at least not in 60 sec I was willing to wait.) >>=20 >> Only way out of this situation was hard-reset. And I do have to >> admit I like ZFS for the speed it recovers after unexpected reboot. >>=20 >> To bad there was no alt-ctrl-backspace escape to debugger compiled >> in. I'll do that with the next kernel, just in case. >>=20 >> So the only data point I can give is the ^T output above. >=20 > We don't know what FreeBSD version you're using (specifically uname -a > output, since build date matters), but if it's RELENG_8 with ZFS v15, > you might check out this thread (be sure to read Kai and I's diagnoses Sorry about that. I'm running Stable on this box, as of last tuesday so thats v15, but = the disks are still at v14.=20 >=20 > http://lists.freebsd.org/pipermail/freebsd-fs/2010-October/009687.html I'll check It out. > I'm in the process of moving all of my machines, including my home > server, over to gmirror. (Home machine started showing signs of = serious > ZFS performance degredation; mutt doing a stat() on 24 files and > directories total taking literally 0.4 seconds on a dual-core machine. > Makes no sense, doesn't happen with UFS2, I'm done.) Well, all new things require time, hard work and diligent testing. It is = no different than any new serious component added. Be It the migration = to real multi processor, or giant removal. Both went through phases of better and not so better stability. Had we all given up, then there would not have been the current state of = freebsd. So i understand your feelings, but then i'm not running It on super = essential servers and not giving up so easily. --WjW =20=