From owner-freebsd-current@FreeBSD.ORG Thu Nov 1 01:04:15 2007 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id E203816A421 for ; Thu, 1 Nov 2007 01:04:15 +0000 (UTC) (envelope-from mcdouga9@egr.msu.edu) Received: from mx.egr.msu.edu (surfnturf.egr.msu.edu [35.9.37.164]) by mx1.freebsd.org (Postfix) with ESMTP id AB24413C494 for ; Thu, 1 Nov 2007 01:04:15 +0000 (UTC) (envelope-from mcdouga9@egr.msu.edu) Received: from localhost (localhost.egr.msu.edu [127.0.0.1]) by mx.egr.msu.edu (Postfix) with ESMTP id 39E902EB862; Wed, 31 Oct 2007 16:17:32 -0400 (EDT) X-Virus-Scanned: amavisd-new at egr.msu.edu Received: from mx.egr.msu.edu ([127.0.0.1]) by localhost (surfnturf.egr.msu.edu [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id ZYzCfeFLEA-I; Wed, 31 Oct 2007 16:17:32 -0400 (EDT) Received: from localhost (daemon.egr.msu.edu [35.9.44.65]) by mx.egr.msu.edu (Postfix) with ESMTP id 11E862EB82C; Wed, 31 Oct 2007 16:17:32 -0400 (EDT) Received: by localhost (Postfix, from userid 21281) id 0DB4C33C3D; Wed, 31 Oct 2007 16:17:32 -0400 (EDT) Date: Wed, 31 Oct 2007 16:17:32 -0400 From: Adam McDougall To: Pawel Jakub Dawidek Message-ID: <20071031201731.GA53377@egr.msu.edu> References: <20071029024356.GR3612@egr.msu.edu> <20071031095428.GH8068@garage.freebsd.pl> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20071031095428.GH8068@garage.freebsd.pl> User-Agent: Mutt/1.5.16 (2007-06-09) Cc: freebsd-current@freebsd.org Subject: Re: zfs stuck, cannot do any I/O, processes in Disk Wait X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 01 Nov 2007 01:04:16 -0000 On Wed, Oct 31, 2007 at 10:54:28AM +0100, Pawel Jakub Dawidek wrote: On Sun, Oct 28, 2007 at 10:43:56PM -0400, Adam McDougall wrote: > I think I have had this happen at least once before, but someone else > rebooted the system before I could see it. I have a server with a > number of zfs filesystems mounted from a raidz, but it won't transfer > any data. I'm not sure why its stuck. It is running 7.0-PRERELEASE > Wed Oct 17 and I'm pretty sure it is WITHOUT vm_kern.c.2.patch. The > system is amd64 and I have not seen a kmem panic since I raised kmem > to 1.5G. > > I logged in to scp a file off of zfs, was able to ls -l to see the file > but the scp hung before transferring any bytes. Now I cannot do a ls -l > in that directory, /z. I noticed several days worth of rsync processes > stuck in disk wait, it must have been in this state for several days. > I have no urgent need to reboot this system, its more important to try > to get a permanent fix. Please let me know what other information I > can provide. I'm afraid you need to do some debugging on your own. I'd start from compiling INVARIANTS, INVARIANT_SUPPORT and WITNESS in. You may want to take a look at developers handbook to see how to obtain some informations from DDB. Thank you, this is useful guidance for me and I will do so. -- Pawel Jakub Dawidek http://www.wheel.pl pjd@FreeBSD.org http://www.FreeBSD.org FreeBSD committer Am I Evil? Yes, I Am!