From owner-freebsd-fs Mon Nov 15 12: 6:50 1999 Delivered-To: freebsd-fs@freebsd.org Received: from mail.du.gtn.com (mail.du.gtn.com [194.77.9.57]) by hub.freebsd.org (Postfix) with ESMTP id 07F1C150A7 for ; Mon, 15 Nov 1999 12:06:41 -0800 (PST) (envelope-from ticso@mail.cicely.de) Received: from mail.cicely.de (cicely.de [194.231.9.142]) by mail.du.gtn.com (8.9.3/8.9.3) with ESMTP id UAA28447; Mon, 15 Nov 1999 20:59:40 +0100 (MET) Received: (from ticso@localhost) by mail.cicely.de (8.9.0/8.9.0) id VAA06307; Mon, 15 Nov 1999 21:06:08 +0100 (CET) Date: Mon, 15 Nov 1999 21:06:08 +0100 From: Bernd Walter To: Greg Lehey Cc: Bernd Walter , Mattias Pantzare , freebsd-fs@FreeBSD.ORG Subject: Re: RAID-5 and failure Message-ID: <19991115210607.A6252@cicely7.cicely.de> References: <199911061716.SAA20783@zed.ludd.luth.se> <19991106183316.A9420@cicely7.cicely.de> <19991113213325.57908@mojave.sitaranetworks.com> <19991115203828.B5417@cicely7.cicely.de> <19991115145200.09633@mojave.sitaranetworks.com> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii X-Mailer: Mutt 1.0pre3i In-Reply-To: <19991115145200.09633@mojave.sitaranetworks.com> Sender: owner-freebsd-fs@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org On Mon, Nov 15, 1999 at 02:52:00PM -0500, Greg Lehey wrote: > On Monday, 15 November 1999 at 20:38:28 +0100, Bernd Walter wrote: > > On Sat, Nov 13, 1999 at 09:33:25PM -0500, Greg Lehey wrote: > >> > >> 4. The system crashes after writing the first data block for a RAID-5 > >> stripe and before writing the last data block. > >> > >> When the system comes up, both data and parity are inconsistent. > >> > >> 5. The system crashes after writing the last data block for a RAID-5 > >> stripe and before writing the last parity block. > >> > >> When the system comes up, data is consistent, and parity is > >> inconsistent. > >> > >> There are a number of ways of dealing with situations 4 and 5. The > >> real problem is that they only occur when the system crashes, so > >> whatever recovery information is required must be stored in > >> non-volatile storage. Some systems do include a NOVRAM for this kind > >> of information, but in general purpose systems the only possibility is > >> to write the information to disk, which would make the inherently slow > >> RAID-5 write even slower. My attitude here is that RAID-5 writes are > >> comparatively infrequent, and so are crashes. In the case of (5), you > >> could rebuild parity after a crash. In the case of (4), I have no > >> good answer. Suggestions welcome. > > > > Case 4 is not that different from case 5 as any differences should be > > handled by the FS using the volume. > > The problem is that in case 4 you don't have anything to go by. You > don't know which data are inconsistent unless you keep a log. The FS > using the volume has followed the kernel into the eternal bit bucket. > Of course - but that may happen with R0 too and even it may be possible with a single disk. The FS should realy be able to handle this case as it knows that there is an outstanding write operation. -- B.Walter COSMO-Project http://www.cosmo-project.de ticso@cicely.de Usergroup info@cosmo-project.de To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-fs" in the body of the message