From owner-freebsd-hackers Tue Mar 3 12:03:20 1998 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id MAA19846 for freebsd-hackers-outgoing; Tue, 3 Mar 1998 12:03:20 -0800 (PST) (envelope-from owner-freebsd-hackers@FreeBSD.ORG) Received: from sendero.simon-shapiro.org (sendero-fxp0.Simon-Shapiro.ORG [206.190.148.34]) by hub.freebsd.org (8.8.8/8.8.8) with SMTP id MAA19830 for ; Tue, 3 Mar 1998 12:03:03 -0800 (PST) (envelope-from shimon@sendero-fxp0.simon-shapiro.org) Received: (qmail 15459 invoked by uid 1000); 3 Mar 1998 20:10:08 -0000 Message-ID: X-Mailer: XFMail 1.3-alpha-021598 [p0] on FreeBSD X-Priority: 3 (Normal) Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 8bit MIME-Version: 1.0 In-Reply-To: <19980303191755.14264@freebie.lemis.com> Date: Tue, 03 Mar 1998 12:10:08 -0800 (PST) Reply-To: shimon@simon-shapiro.org Organization: The Simon Shapiro Foundation From: Simon Shapiro To: Greg Lehey Subject: Re: SCSI Bus redundancy... Cc: hackers@FreeBSD.ORG, blkirk@float.eli.net, jdn@acp.qiv.com, tlambert@primenet.com, sbabkin@dcn.att.com, Wilko Bulte Sender: owner-freebsd-hackers@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG On 03-Mar-98 Greg Lehey wrote: ... > Obviously there are a number of problems. But in fact it's not as > difficult as it sounds. There's a problem with RAID 5 anyway if > there's, say, a power failure during a write. After bringing it back > up again, you can recognize that there's a parity error, but where? This sounds like an uncomitted transaction. Quite easy to arrange for a rollback at boot time. You keep such gems in NVRAM, for example, or in a known place on the disks, or you implement a journal, or... ... > Vinum offers another alternative: attach a second plex with the same > data, maybe only a few megabytes at a time. During the time this area > of the volume is being updated, the plex supplies a backup in case of > failure. When the region is left, the plex is detached and reattached > at the next point in the array. If anything goes down, the correct > data will be in the auxiliary plex. Concatenation with journaling. Could be done. > Does that make sense? I'll try to formulate it more clearly if > anybody has difficulty with the concepts. The only problem I have here, is the assumption that the O/S will do all that. Not only it consumes much CPU, I/O bus, memory bandwidth, etc., but O/S crashes are the number one cause of failure in any modern computer. Putting all this logic there is asking for it to crash frequently, and run under load all the time. I think that the RAID logic should be outside the VPU/O/S proper, just like CRC checking is not done in the CPU anymode, and since SCSI and IDE, so is data separation, PLL detection loops, etc. If your data is so important to you, spend few dollars to get it done in a predictable and reliable manner. BTW, Since 1984 or so, I NEVER lost data due to disk failure. I lost a LOT of data due to O/S failures, and some data due to bugs in RAID logic. Although I do not belive Seagate's claim for 1,000,000 hours MTBF, I think the realized MTBF will far exceed any FreeBSD uptime. This discussion is very enlightening none the less. ---------- Sincerely Yours, Simon Shapiro Shimon@Simon-Shapiro.ORG Voice: 503.799.2313 To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-hackers" in the body of the message