From owner-freebsd-stable@FreeBSD.ORG Sun Apr 30 21:59:48 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 6BDAB16A46B for ; Sun, 30 Apr 2006 21:59:48 +0000 (UTC) (envelope-from brad-fbsd-stable@duttonbros.com) Received: from uno.mnl.com (uno.mnl.com [63.97.246.49]) by mx1.FreeBSD.org (Postfix) with SMTP id E9D1643D68 for ; Sun, 30 Apr 2006 21:59:41 +0000 (GMT) (envelope-from brad-fbsd-stable@duttonbros.com) Received: (qmail 77563 invoked by uid 85); 30 Apr 2006 21:59:39 -0000 Received: from 127.0.0.1 by uno (envelope-from , uid 89) with qmail-scanner-1.25 (spamassassin: 2.55. Clear:RC:1(127.0.0.1):. Processed in 0.060665 secs); 30 Apr 2006 21:59:39 -0000 Received: from unknown (HELO uno.mnl.com) (127.0.0.1) by localhost with SMTP; 30 Apr 2006 21:59:39 -0000 Received: from 67.169.82.217 (SquirrelMail authenticated user bdutton) by uno.mnl.com with HTTP; Sun, 30 Apr 2006 14:59:39 -0700 (PDT) Message-ID: <60907.67.169.82.217.1146434379.squirrel@uno.mnl.com> In-Reply-To: <57229.67.169.82.217.1146336755.squirrel@uno.mnl.com> References: <62791.67.169.82.217.1145819073.squirrel@uno.mnl.com> <20060427065534.GI17000@garage.freebsd.pl> <20060427173550.GA22635@garage.freebsd.pl> <54892.67.169.82.217.1146176385.squirrel@uno.mnl.com> <65294.67.169.82.217.1146195422.squirrel@uno.mnl.com> <57229.67.169.82.217.1146336755.squirrel@uno.mnl.com> Date: Sun, 30 Apr 2006 14:59:39 -0700 (PDT) From: "Bradley W. Dutton" To: "Pawel Jakub Dawidek" User-Agent: SquirrelMail/1.4.6 MIME-Version: 1.0 Content-Type: text/plain;charset=iso-8859-1 Content-Transfer-Encoding: 8bit X-Priority: 3 (Normal) Importance: Normal Cc: freebsd-stable@freebsd.org Subject: Re: 6.1 prerelease graid3 livelock? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: brad-fbsd-stable@duttonbros.com List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 30 Apr 2006 21:59:48 -0000 I can reproduce the panic I experienced before. When in single user mode if I try to mount a raid3 array that isn't complete I get the following error: panic: Lock (sx) GEOM topology locked @ /usr/src/sys/geom/raid3/g_raid3.c:775. The full error and alltrace is here: http://duttonbros.com/freebsd/ddb.log As a workaround I would boot a pre March 20 kernel, rebuild the array, then boot the new kernel again. Thanks, Brad > It looks like the second patch fixed the problem. The box has been up for > just over a day and a half without any problems. > > Thanks, > Brad > > >> Well I don't know what was going on earlier but I reverted to good >> kernel, >> synced my raid arrays (no longer degraded from the panics), then booted >> a >> kernel with the second patch applied, this time no problems so far. I'll >> let you know how things go after the box is running for a while. >> >> Thanks, >> Brad >> >> >>> Ok, I had already installed and booted the first patch. I then rebuilt >>> the >>> kernel with the second patch. Trying to reboot from the first patch to >>> the >>> second resulted in a crash/panic on shutdown. I didn't capture the >>> output >>> from this. Once I booted the second patch the machine panics in the >>> boot >>> process, in short: >>> /dev/raid3t/moviesf: clean,o 190615 free (19p11 frags, 23588 oblocks, >>> 0.8% >>> fralogy locked @ /usr/src/sys/geom/raid3/g_raid3.c:773. >>> KDB: enter: panic >>> [thread pid 35 tid 100030 ] >>> Stopped at kdb_enter+0x30: leave >>> >>> The text copied from the serial console was a little garbled, it did >>> say >>> something like: >>> sx lock, geom topology locked... >>> >>> I did an "alltrace" at that point which I'll send seperately. >>> >>> Thanks, >>> Brad >>> >>> >>>> On Thu, Apr 27, 2006 at 08:55:35AM +0200, Pawel Jakub Dawidek wrote: >>>> +> On Sun, Apr 23, 2006 at 12:04:33PM -0700, Bradley W. Dutton wrote: >>>> +> +> Hi, >>>> +> +> >>>> +> +> I'm experiencing a sort of livelock on a 6.1 prerelease box. It >>>> appears >>>> +> +> all of the IO related activity hangs but the box continues to >>>> do >>>> +> +> routing/NAT/etc for internet access from my other boxes. I can >>>> usually >>>> +> +> get the lockup to occur within about 12 hours of booting. >>>> +> >>>> +> Ok, I think I found it. Could you try this patch: >>>> +> >>>> +> http://people.freebsd.org/~pjd/patches/g_raid3.c.4.patch >>>> >>>> markus@ reported the livelock is still there, so please try this patch >>>> instead: >>>> >>>> http://people.freebsd.org/~pjd/patches/g_raid3.c.5.patch >>>