From owner-freebsd-stable@FreeBSD.ORG Fri May 20 14:26:01 2005 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 107DF16A4CE; Fri, 20 May 2005 14:26:01 +0000 (GMT) Received: from postmaster.etv.net (postmaster.etv.net [208.14.190.176]) by mx1.FreeBSD.org (Postfix) with ESMTP id E9ECB43D9D; Fri, 20 May 2005 14:25:59 +0000 (GMT) (envelope-from efinleywork@efinley.com) Received: from work.efinley.com ([205.161.203.55] helo=elliotdevelop) by postmaster.etv.net with smtp (Exim 4.50 (FreeBSD)) id 1DZ8Rq-000LEY-Oh; Fri, 20 May 2005 08:25:58 -0600 Message-ID: <08dc01c55d47$d7697100$37cba1cd@emerytelcom.com> From: "Elliot Finley" To: "Peter Jeremy" References: <001801c55a14$609720d0$37cba1cd@emerytelcom.com> <20050516195859.GA59189@server.vk2pj.dyndns.org> <042501c55ba7$360fac30$37cba1cd@emerytelcom.com> <20050518194356.GA2129@cirb503493.alcatel.com.au> Date: Fri, 20 May 2005 08:25:58 -0600 Organization: Emery Telcom MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 6.00.2800.1437 X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1441 cc: freebsd-stable@freebsd.org cc: sos@freebsd.org Subject: Re: 5.4-RC2 freezing - ATA related? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list Reply-To: Elliot Finley List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 20 May 2005 14:26:01 -0000 From: "Peter Jeremy" > On Wed, 2005-May-18 06:43:37 -0600, Elliot Finley wrote: > >Had the system lock up again. This is with the new ATA mkIII patches on > >http://people.freebsd.org/~sos/ATA. > > > >I didn't get the crashdump (forgot to set dumpdev), but I did get 'ps' and > >'show lockedvnods' output from DDB. The output is in the form of > >screenshots combined into a single .pdf which can be accessed here > >http://www.efinley.com/Binder1.pdf > > That shows a deadlock-to-root in your /dev/ar0s1a (presumably root) > filesystem. The perl process (pid 487) has an exclusive lock on > the FS mountpoint - this is blocking 130 other processes. Pid 487 > is itself waiting on another filesystem lock (you can't determine > the actual lock tree without more poking around kernel memory). > > The vnode locks are held by processes: > PID name waiting on > 487 perl [ufs c3c1c1b4] > 57 syncer [snaplk c535f500] (holds 2 locks) > 476 perl [ufs c87e4f1c] > 489 perl [snaplk c535f500] (holds 2 locks) > 3337 mksnap_ffs [getblk d77656f4] > > Looking through the process list, cron has started a "dump -L" which > is trying to create a filesystem snapshot. That has wedged on > "getblk" (trying to perform physical disk I/O) and is probably the > root of your problem. Nothing else is waiting on physical I/O. > > I'd say that your first guess was right: This is a bug in the ATA > code and is probably a job for sos. I took the -L option off of my dump command in my daily dump script. I've gone two days without locking up which is unusual. I think that may be what was tickling the bug that was locking me up. Thanks for the analysis Peter. Elliot