From owner-freebsd-questions Thu Feb 13 02:48:33 1997 Return-Path: Received: (from root@localhost) by freefall.freebsd.org (8.8.5/8.8.5) id CAA03075 for questions-outgoing; Thu, 13 Feb 1997 02:48:33 -0800 (PST) Received: from hda.hda.com (ip42-max1-fitch.ziplink.net [199.232.245.42]) by freefall.freebsd.org (8.8.5/8.8.5) with SMTP id CAA03069 for ; Thu, 13 Feb 1997 02:48:28 -0800 (PST) Received: (from dufault@localhost) by hda.hda.com (8.6.12/8.6.12) id FAA13891; Thu, 13 Feb 1997 05:43:03 -0500 From: Peter Dufault Message-Id: <199702131043.FAA13891@hda.hda.com> Subject: Re: drive failure? In-Reply-To: from spork at "Feb 13, 97 02:54:34 am" To: spork@super-g.com (spork) Date: Thu, 13 Feb 1997 05:43:02 -0500 (EST) Cc: shovey@buffnet.net, questions@freebsd.org X-Mailer: ELM [version 2.4ME+ PL25 (25)] MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: owner-questions@freebsd.org X-Loop: FreeBSD.org Precedence: bulk > Hmmm... The Buslogic verify wants to "destroy all data on the disk". I > don't think I want that. Anyone have any ideas on this or know of the > utility mentioned? 1. Verify that AWRE and ARRE are set in mode page 1. Use the mode page editor in scsi(8). This will permit the drive to reallocate on write errors and RECOVERED read errors. It won't do anything for unrecovered read errors such as you have logged. 2. See that you have POST ERROR turned off in that same mode page to turn off the RECOVERED ERROR messages. I suspect the driver is treating that as an error instead of an informational message, though I haven't verified that. 3. Now you must write something to that bad spot on the disk and the drive will reallocate the sector. Note that the Adaptec utility can't properly synthesize data that it can't read either, though it may have a sophisticated recovery technique, and so the suggestion to turn the Adaptec utility loose on the disk may or may not be a good idea. The info number in the error message is the disk block that can't be read. If that block is on a swap partition dd the partition full of zeros when no swapping is enabled on that partition and be done with it. If the bad spot is in a file system try to figure out what to do - you have either lost part of a file or some file system metadata. Your options are restoring the partition after writing something to the block or writing arbitrary data to that block to force a reallocation and hoping for the best. One option suggested by Rod in the past is to read that block in a loop hoping at some point it will succeed and the drive will slip it - this is unlikely to work, though. Ask again if you can't either restore the partition or use "dd" to write something to that block. If you're willing and able and have your disk backed up, FIRST pick up ftp://freefall.freebsd.org/pub/dufault/scsinew2.tgz build it and run the "zones.wish" tk script. This should warn you about an improperly setup disk and display the bad spots on it. Or it may crash your system. Peter -- Peter Dufault (dufault@hda.com) Realtime Machine Control and Simulation HD Associates, Inc. Voice: 508 433 6936