From owner-freebsd-stable@FreeBSD.ORG Tue Apr 8 07:38:02 2008 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 49E171065671; Tue, 8 Apr 2008 07:38:02 +0000 (UTC) (envelope-from johan@headweb.com) Received: from core.stromnet.se (core.stromnet.se [83.218.84.131]) by mx1.freebsd.org (Postfix) with ESMTP id E00BA8FC21; Tue, 8 Apr 2008 07:38:01 +0000 (UTC) (envelope-from johan@headweb.com) Received: from localhost (core.stromnet.se [83.218.84.131]) by core.stromnet.se (Postfix) with ESMTP id 68A90D4640C; Tue, 8 Apr 2008 09:38:00 +0200 (CEST) X-Virus-Scanned: amavisd-new at stromnet.se Received: from core.stromnet.se ([83.218.84.131]) by localhost (core.stromnet.se [83.218.84.135]) (amavisd-new, port 10024) with ESMTP id Xia7q9CnVQxy; Tue, 8 Apr 2008 09:37:58 +0200 (CEST) Received: from johan-mp.stromnet.se (90-224-172-102-no129.tbcn.telia.com [90.224.172.102]) by core.stromnet.se (Postfix) with ESMTP id 00D5CD4640F; Tue, 8 Apr 2008 09:37:57 +0200 (CEST) Message-Id: From: =?ISO-8859-1?Q?Johan_Str=F6m?= To: Jeremy Chadwick In-Reply-To: <20080408073200.GA32128@eos.sc1.parodius.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed; delsp=yes Content-Transfer-Encoding: quoted-printable Mime-Version: 1.0 (Apple Message framework v919.2) Date: Tue, 8 Apr 2008 09:37:57 +0200 References: <0B67CBBD-11CB-44C2-807D-5F00654CDD35@headweb.com> <20080408073200.GA32128@eos.sc1.parodius.com> X-Mailer: Apple Mail (2.919.2) Cc: freebsd-fs@freebsd.org, freebsd-stable@freebsd.org Subject: Re: ZFS deadlock X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 08 Apr 2008 07:38:02 -0000 On Apr 8, 2008, at 9:32 AM, Jeremy Chadwick wrote: > On Tue, Apr 08, 2008 at 08:17:38AM +0200, Johan Str=F6m wrote: >> Hello >> >> A box of mine running RELENG_7_0 and ZFS over a couple of disks (6 =20= >> disks, 3 >> mirrors) seems to have gotten stuck. =46rom Ctrl-T: >> >> load: 0.50 cmd: zsh 40188 =20 >> [zfs:&buf_hash_table.ht_locks[i].ht_lock] 0.02u >> 0.04s 0% 3404k >> load: 0.43 cmd: zsh 40188 =20 >> [zfs:&buf_hash_table.ht_locks[i].ht_lock] 0.02u >> 0.04s 0% 3404k >> load: 0.10 cmd: zsh 40188 =20 >> [zfs:&buf_hash_table.ht_locks[i].ht_lock] 0.02u >> 0.04s 0% 3404k >> load: 0.10 cmd: zsh 40188 =20 >> [zfs:&buf_hash_table.ht_locks[i].ht_lock] 0.02u >> 0.04s 0% 3404k >> load: 0.11 cmd: zsh 40188 =20 >> [zfs:&buf_hash_table.ht_locks[i].ht_lock] 0.02u >> 0.04s 0% 3404k >> >> Worked for a while then that stopped working too (was over ssh). When >> trying a local login i only got >> >> load: 0.09 cmd: login 1611 [zfs] 0.00u 0.00s 0% 208k >> >> I found one post like this earlier (by Xin LI), but nobody seemed =20 >> to have >> replied... >> in my current conf, I think my kmem/kmem_max is at 512Mb (not sure =20= >> though, >> since I've edited my file yesterday for next reboot), with 2G of =20 >> system >> RAM.. Normally I'd run kmem(max) 1G (with arcsize of 512M. =20 >> currently it is >> at default), but since I just got back to 2G total mem after some =20 >> hardware >> problems I've been runnig at those lows (1G total is kindof tight =20 >> with >> zfs..) >> >> Well, just wanted to report... The box is not totally dead yet, ie =20= >> I can >> still do Ctrl-T on console, but thats it.. I don't really know what =20= >> more I >> can do so.. I don't have KDB/DDB. >> I'll wait another hour or so before I hard reboot it, unless it =20 >> "unlocks" >> or if anyone have any suggestions. > > I don't think there are any suggestions left to give. Many people, > including myself, have experienced this kind of problem. It's well- > documented both on my Common Issues page, and the official FreeBSD ZFS > Wiki. Ah.. I guess I was just to restrictive with the googling on =20 "zfs:&buf_hash_table.ht_locks[i].ht_lock". > > > ZFS is still considered highly experimental, so if your data is at all > important to you, perform backups or switch to another filesystem > provider. That I am aware of. Thanks.=