From owner-freebsd-questions@FreeBSD.ORG  Mon Feb 26 05:50:06 2007
Return-Path: <owner-freebsd-questions@FreeBSD.ORG>
X-Original-To: freebsd-questions@freebsd.org
Delivered-To: freebsd-questions@freebsd.org
Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52])
	by hub.freebsd.org (Postfix) with ESMTP id 2221616A400
	for <freebsd-questions@freebsd.org>;
	Mon, 26 Feb 2007 05:50:06 +0000 (UTC)
	(envelope-from smithi@nimnet.asn.au)
Received: from gaia.nimnet.asn.au (nimbin.lnk.telstra.net [139.130.45.143])
	by mx1.freebsd.org (Postfix) with ESMTP id 43D8B13C478
	for <freebsd-questions@freebsd.org>;
	Mon, 26 Feb 2007 05:50:03 +0000 (UTC)
	(envelope-from smithi@nimnet.asn.au)
Received: from localhost (smithi@localhost)
	by gaia.nimnet.asn.au (8.8.8/8.8.8R1.5) with SMTP id QAA21193;
	Mon, 26 Feb 2007 16:49:47 +1100 (EST)
	(envelope-from smithi@nimnet.asn.au)
Date: Mon, 26 Feb 2007 16:49:46 +1100 (EST)
From: Ian Smith <smithi@nimnet.asn.au>
To: Jerry McAllister <jerrymc@msu.edu>
In-Reply-To: <20070225182126.GA54901@gizmo.acns.msu.edu>
Message-ID: <Pine.BSF.3.96.1070226153206.18258A-100000@gaia.nimnet.asn.au>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Cc: Marty Landman <martster@gmail.com>, freebsd-questions@freebsd.org
Subject: Re: input/output error on hd
X-BeenThere: freebsd-questions@freebsd.org
X-Mailman-Version: 2.1.5
Precedence: list
List-Id: User questions <freebsd-questions.freebsd.org>
List-Unsubscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-questions>, 
	<mailto:freebsd-questions-request@freebsd.org?subject=unsubscribe>
List-Archive: <http://lists.freebsd.org/pipermail/freebsd-questions>
List-Post: <mailto:freebsd-questions@freebsd.org>
List-Help: <mailto:freebsd-questions-request@freebsd.org?subject=help>
List-Subscribe: <http://lists.freebsd.org/mailman/listinfo/freebsd-questions>, 
	<mailto:freebsd-questions-request@freebsd.org?subject=subscribe>
X-List-Received-Date: Mon, 26 Feb 2007 05:50:06 -0000

On Sun, 25 Feb 2007, Jerry McAllister wrote:
 > On Sun, Feb 25, 2007 at 10:38:01AM -0500, Marty Landman wrote:
 > 
 > > On 2/24/07, Jerry McAllister <jerrymc@msu.edu> wrote:
 > > >
 > > >
 > > >Well, I think you are past any label problems now and on to the
 > > >bad sectors and/or finding superbocks.    Although I am not quite clear
 > > >from above what resulted in the 'INCOMPLETE LABEL...'  message.
 > > 
 > > 
 > > Ok then, can you point me to somewhere that can learn about superblocks and
 > > give me an idea of what to do next? 
 > 
 > Most of what I figured out several years ago (and have essentially
 > forgotten since) came from the handbook and something I found by
 > searching the web that gave the layout of blocks and chains.  I would
 > have to go back searching again.

Failing someone who actually knows what they're talking about re UFS
structures chipping in to this discussion, all I can offer in addition
is what my own exploration of manuals and a bit of googling turned up .. 

Firstly, Marty, you should run dumpfs(8) on your ad1s1a.  With the -m
switch, this produces a single line suitable for feeding into newfs with
all parameters, and is probably worth saving for all slices in case of
any subsequent emergencies.  I've just done that for mine, anyway, along
with fdisk and boot0cfg -v output, and bsdlabel output for UFS slices. 

Without the -m switch, feed the output to a file, or less, as it's very
voluminous.  For a 240GB drive, it'll likely be huge.  However the data
at the head is probably what's needed, though I can't make much of it.

This post by Ian Dowse explains how to compute where the superblocks
are, for a quoted example dumpfs: http://noc.caravan.ru/faq/SBLOCK.html

Note however that Ian is talking about UFS1 (where the superblock offset
was 32) but if you consult fsck_ffs(8) you'll see (under -b) that for
UFS2, which you almost certainly would have used, it's at 140 .. I
gather that's the offset from the start of each cylinder group?


 > >   Also assuming my bad sectors really are
 > > totally bad, wouldn't fsck allow me to mark them as unusable and move on?
 > 
 > No, fsck does not do that.   Marking blocks bad happend below the
 > level of the OS - generally in the disk controller itself.   It remaps
 > sectors until it runs out of spares and when it runs out, it starts
 > reporting unrecoverable errors.   This is not even reported to the OS
 > until it runs out of spares.
 > 
 > The only thing you can do with those bad sectors is to try and figure 
 > out if any of them are superblocks.  If they are, you can probably
 > rebuild it from other superblock clones.   If it is not, it is probably
 > lost data.  In that case try to overwrite the bad sector.  If that 
 > works, then the sector itself is OK, but the data that was there is
 > gone.   If it doesn't work, then it is bad and there is a good chance
 > that more than data got nuked in the power failure - eg, it damaged
 > the disk or controller in some way.

Seeing if fsck_ffs will use any discovered alternate superblocks would
be the first step, and if so, whether that helps to get it mounted.  I'd
certainly be careful to mount it read-only before trying data recovery! 

Since Marty has already been bravely using dd :) rewriting those sectors
should be easy enough, bearing in mind the apparent off-by-one numbering
difference between the sectors dd found bad and those fsck reported bad.

 > But, the next thing seems to be learning about how to follow the file
 > chains and how to find and read and write superblocks.  Alternatively
 > you can decide it isn't worth the effort to recover and try and write
 > over the drive completely - just totally trash it - and see if those
 > bad sectors will write.  If you did that, then you would have to rebuild 
 > the slice and partition table and do a newfs before you could again
 > use the drive and everything previously on it would be lost.

Well if a dd rewriting those specific contiguous sectors failed, I doubt
that newfs would do any better, so the dd is definitely worth a try, but
I wouldn't write anything further to the fs until all else has failed.

 > Good luck.

I can only echo that, again.

 > Maybe someone who has some experience in tracking file chains can
 > respond and give you more helpp than can I.

Ditto for that .. I'm now very thoroughly out of my depth here, though
I've learned a few new things through the exercise. 

Maybe mailing Ian Dowse with circumstances and the dumpfs head might be
worth a try, Marty?  See the website committers' page for his address.

Cheers, Ian