From owner-freebsd-current Sun Aug 29 23:24:53 1999 Delivered-To: freebsd-current@freebsd.org Received: from allegro.lemis.com (allegro.lemis.com [192.109.197.134]) by hub.freebsd.org (Postfix) with ESMTP id 095AE15276 for ; Sun, 29 Aug 1999 23:24:45 -0700 (PDT) (envelope-from grog@freebie.lemis.com) Received: from freebie.lemis.com (freebie.lemis.com [192.109.197.137]) by allegro.lemis.com (8.9.1/8.9.0) with ESMTP id PAA27207; Mon, 30 Aug 1999 15:52:25 +0930 (CST) Received: (from grog@localhost) by freebie.lemis.com (8.9.3/8.9.0) id PAA76783; Mon, 30 Aug 1999 15:52:24 +0930 (CST) Date: Mon, 30 Aug 1999 15:52:24 +0930 From: Greg Lehey To: Matthew Dillon Cc: Parag Patel , Bernd Walter , Mike Smith , freebsd-current@FreeBSD.ORG Subject: Re: 4.0-CURRENT SMP crash with vinum raid-5 and softupdates Message-ID: <19990830155224.C13904@freebie.lemis.com> References: <80967.935993388@pinhead.parag.codegen.com> <199908300615.XAA10147@apollo.backplane.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Mailer: Mutt 0.95.4i In-Reply-To: <199908300615.XAA10147@apollo.backplane.com>; from Matthew Dillon on Sun, Aug 29, 1999 at 11:15:47PM -0700 WWW-Home-Page: http://www.lemis.com/~grog X-PGP-Fingerprint: 6B 7B C3 8C 61 CD 54 AF 13 24 52 F8 6D A4 95 EF Organization: LEMIS, PO Box 460, Echunga SA 5153, Australia Phone: +61-8-8388-8286 Fax: +61-8-8388-8725 Mobile: +61-41-739-7062 Sender: owner-freebsd-current@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG On Sunday, 29 August 1999 at 23:15:47 -0700, Matthew Dillon wrote: >> Greg's debugging this on the machine here at the moment, which is >> crashing the same way in what appears to be the same place. >> >> If anyone else wants to take a crack it, the magic vinum debug number >> needed is 328 and not 324. >> >> So far Greg's discovered that the field is correctly set a little while >> before the crash, and then becomes NULL for no apparent reason. Looks >> like a really nasty bug. >> >> The nice thing is this loaner machine has no real job to do besides >> crashing, so it can crash as often as required. :) > > I looked at the vinum code a little. I would look for possible situations > where the bp may be re-issued as an I/O before actually completing a prior > I/O. This could cause the iodone field to be cleared in the middle of > an I/O. I don't see anything specific in the code because I don't really > understand it yet. That's the best possibility that I can come up with. Well, I don't do things like that. The only place I reissue a command is in the iodone routine (complete_rqe or complete_raid5_write; the latter in this case). Greg -- See complete headers for address, home page and phone numbers finger grog@lemis.com for PGP public key To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-current" in the body of the message