From owner-freebsd-stable@FreeBSD.ORG Tue May 31 23:47:22 2005 Return-Path: X-Original-To: stable@freebsd.org Delivered-To: freebsd-stable@FreeBSD.ORG Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id CC7FB16A41C for ; Tue, 31 May 2005 23:47:22 +0000 (GMT) (envelope-from steve@Watt.COM) Received: from wattres.watt.com (wattres.watt.com [66.93.133.130]) by mx1.FreeBSD.org (Postfix) with ESMTP id 7590043D1D for ; Tue, 31 May 2005 23:47:22 +0000 (GMT) (envelope-from steve@Watt.COM) Received: from wattres.watt.com (localhost.watt.com [127.0.0.1]) by wattres.watt.com (8.13.3/8.13.3) with ESMTP id j4VNlKm8064966; Tue, 31 May 2005 16:47:20 -0700 (PDT) (envelope-from steve@wattres.watt.com) Received: (from steve@localhost) by wattres.watt.com (8.13.3/8.13.3/Submit) id j4VNlKgF064965; Tue, 31 May 2005 16:47:20 -0700 (PDT) (envelope-from steve) Message-Id: <200505312347.j4VNlKgF064965@wattres.watt.com> X-Newsgroups: local.freebsd-stable In-Reply-To: <08dc01c55d47$d7697100$37cba1cd@emerytelcom.com> References: <001801c55a14$609720d0$37cba1cd@emerytelcom.com> <20050516195859.GA59189@server.vk2pj.dyndns.org> <042501c55ba7$360fac30$37cba1cd@emerytelcom.com> <20050518194356.GA2129@cirb503493.alcatel.com.au> Organization: Watt Consultants From: steve@Watt.COM (Steve Watt) Date: Tue, 31 May 2005 16:47:19 -0700 X-Mailer: Mail User's Shell (7.2.6 beta(5) 10/07/98) To: efinleywork@efinley.com X-Archived: 1117583240.762325069@wattres.Watt.COM X-Virus-Scanned: ClamAV 0.85.1/902/Tue May 31 14:14:31 2005 on wattres.Watt.COM X-Virus-Status: Clean Cc: stable@freebsd.org Subject: Re: 5.4-RC2 freezing - ATA related? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 31 May 2005 23:47:22 -0000 In <08dc01c55d47$d7697100$37cba1cd@emerytelcom.com>, efinleywork@efinley.com writes: >From: "Peter Jeremy" >> On Wed, 2005-May-18 06:43:37 -0600, Elliot Finley wrote: >> >Had the system lock up again. This is with the new ATA mkIII patches on >> >http://people.freebsd.org/~sos/ATA. >> > >> >I didn't get the crashdump (forgot to set dumpdev), but I did get 'ps' >and >> >'show lockedvnods' output from DDB. The output is in the form of >> >screenshots combined into a single .pdf which can be accessed here >> >http://www.efinley.com/Binder1.pdf >> >> That shows a deadlock-to-root in your /dev/ar0s1a (presumably root) >> filesystem. The perl process (pid 487) has an exclusive lock on >> the FS mountpoint - this is blocking 130 other processes. Pid 487 >> is itself waiting on another filesystem lock (you can't determine >> the actual lock tree without more poking around kernel memory). >> >> The vnode locks are held by processes: >> PID name waiting on >> 487 perl [ufs c3c1c1b4] >> 57 syncer [snaplk c535f500] (holds 2 locks) >> 476 perl [ufs c87e4f1c] >> 489 perl [snaplk c535f500] (holds 2 locks) >> 3337 mksnap_ffs [getblk d77656f4] >> >> Looking through the process list, cron has started a "dump -L" which >> is trying to create a filesystem snapshot. That has wedged on >> "getblk" (trying to perform physical disk I/O) and is probably the >> root of your problem. Nothing else is waiting on physical I/O. >> >> I'd say that your first guess was right: This is a bug in the ATA >> code and is probably a job for sos. > >I took the -L option off of my dump command in my daily dump script. I've >gone two days without locking up which is unusual. I think that may be what >was tickling the bug that was locking me up. This is a filesystem lock problem, not an ATA driver problem. I analyzed it, and posted the results to -hackers last week, with the subject "snapshots and innds". The problem is that there is an invariant being broken in msync() -- Kirk describes it fully in his reply to my message. -- Steve Watt KD6GGD PP-ASEL-IA ICBM: 121W 56' 57.8" / 37N 20' 14.9" Internet: steve @ Watt.COM Whois: SW32 Free time? There's no such thing. It just comes in varying prices...