From owner-freebsd-geom@FreeBSD.ORG Sat Jan 13 17:43:25 2007 Return-Path: X-Original-To: freebsd-geom@freebsd.org Delivered-To: freebsd-geom@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id CE9EA16A403 for ; Sat, 13 Jan 2007 17:43:25 +0000 (UTC) (envelope-from cyberleo@cyberleo.net) Received: from pizzabox.cyberleo.net (alpha.cyberleo.net [198.145.45.10]) by mx1.freebsd.org (Postfix) with ESMTP id 8C7C813C44B for ; Sat, 13 Jan 2007 17:43:23 +0000 (UTC) (envelope-from cyberleo@cyberleo.net) Received: (qmail 60039 invoked from network); 13 Jan 2007 17:43:22 -0000 Received: from adsl-69-212-1-127.dsl.chcgil.ameritech.net (HELO ?172.16.44.14?) (cyberleo@cyberleo.net@69.212.1.127) by alpha.cyberleo.net with ESMTPA; 13 Jan 2007 17:43:22 -0000 Message-ID: <45A91A0F.8070701@cyberleo.net> Date: Sat, 13 Jan 2007 11:42:39 -0600 From: CyberLeo Kitsana User-Agent: Thunderbird 1.5 (X11/20051201) MIME-Version: 1.0 To: "R. B. Riddick" References: <835894.31143.qm@web30309.mail.mud.yahoo.com> In-Reply-To: <835894.31143.qm@web30309.mail.mud.yahoo.com> Content-Type: text/plain; charset=iso-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: FreeBSD Geom Subject: Re: geom_raid5 livelock? X-BeenThere: freebsd-geom@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GEOM-specific discussions and implementations List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 13 Jan 2007 17:43:25 -0000 R. B. Riddick wrote: > --- CyberLeo Kitsana wrote: >> ... > Possibly there happens an ENOMEM error, which would explain the repititions... > > A further debug line (it is again at debug level 2) before the current line > 1165 (inside the inbed==children IF but in the end of it): > G_RAID5_LOGREQ(bp, "[ready err%d cmp%jd]", obp->bio_error, obp->bio_completed); > Or turn on "bootverbose"... > > For better data safety (e. g. in case of a power loss), I would recommend to > reduce kern.geom.raid5.wdt to 0 or 1 (the lower the safer). > > For less memory consumtion I would use lower values for .maxmem and for > .maxwql... Good morning! http://home.cyberleo.net/cyberleo/workspace/Zip/graid5-all2.log http://home.cyberleo.net/cyberleo/workspace/Zip/graid5-all2-2.log I'm not sure what error 5 is, but it looks ominous. The first log shows two seconds of the first test, where only ad2s2 was showing up. The second log is after a restart of everything, and ad0s2, ad2s2, and ad6s2 show up, indicating that this most likely isn't just a drive, bus, or controller failure. The machine is on a UPS, so power loss isn't too much of an issue. What other impacts would reducing kern.geom.raid5.wdt have? -- Fuzzy love, -CyberLeo Technical Administrator CyberLeo.Net Webhosting http://www.CyberLeo.Net Furry Peace! - http://www.fur.com/peace/