From owner-freebsd-fs@FreeBSD.ORG Sun Apr 22 03:39:09 2007 Return-Path: X-Original-To: freebsd-fs@freebsd.org Delivered-To: freebsd-fs@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 9121C16A401 for ; Sun, 22 Apr 2007 03:39:09 +0000 (UTC) (envelope-from anderson@freebsd.org) Received: from mh2.centtech.com (moat3.centtech.com [64.129.166.50]) by mx1.freebsd.org (Postfix) with ESMTP id 6585A13C465 for ; Sun, 22 Apr 2007 03:39:09 +0000 (UTC) (envelope-from anderson@freebsd.org) Received: from neutrino.centtech.com (andersonbox1.centtech.com [192.168.42.21]) by mh2.centtech.com (8.13.8/8.13.8) with ESMTP id l3M3d3lD018231; Sat, 21 Apr 2007 22:39:05 -0500 (CDT) (envelope-from anderson@freebsd.org) Message-ID: <462AD8D7.3040009@freebsd.org> Date: Sat, 21 Apr 2007 22:39:03 -0500 From: Eric Anderson User-Agent: Thunderbird 2.0.0.0 (X11/20070420) MIME-Version: 1.0 To: Wes Morgan References: <20070421223552.F969@volatile.chemikals.org> In-Reply-To: <20070421223552.F969@volatile.chemikals.org> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Virus-Scanned: ClamAV 0.88.4/3146/Sat Apr 21 14:39:19 2007 on mh2.centtech.com X-Virus-Status: Clean X-Spam-Status: No, score=-1.3 required=8.0 tests=AWL,BAYES_05 autolearn=ham version=3.1.6 X-Spam-Checker-Version: SpamAssassin 3.1.6 (2006-10-03) on mh2.centtech.com Cc: freebsd-fs@freebsd.org Subject: Re: apparent filesystem-related hangs X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 22 Apr 2007 03:39:09 -0000 On 04/21/07 21:51, Wes Morgan wrote: > I have several filesystems built on top of a gconcat volume consisting of > 2 300gb and 1 500gb drive. The /usr partition constitutes the bulk of it. > The volume is sitting around 95% full, and twice I've had the system > become "hung" in what I believe are filesystem operations. It seems to > crop up when rtorrent tries to create files that would fill up the system, > but of course it doesn't actually reserve it until it is used. Any running > process remains responsive until it needs to access the disk. > > The system was running a fairly recent 6.2-stable, March 29, but I've > since updated to the most recent -stable. > > If anyone can give me some pointers as to how to tell exactly where these > processes are hanging, I can try to reproduce it... But it takes a while > to fsck the volume so I don't want to have to try too many things. If it's > not a known problem I can then submit a PR. Output of a 'ps -auxl' would help a lot, since it will tell you what each process is doing. Also, if it's really hung (does a 'df' or 'mount' return?), then having DDB in your kernel config, and a valid dump device set up would allow you to get a dump (at the debugger prompt, do 'call doadump'). Once you have that, you can debug it much better while your system is back up doing other things. I have a similar issue on one of my file servers, however I know it was caused by file system damage that fsck cannot fix. My choice is to live with it, or dump/restore to a newly newfs'ed file system. That's no fault of UFS of course (this was caused by disk issues). Eric