Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 30 May 2011 03:33:49 -0700
From:      Jeremy Chadwick <freebsd@jdc.parodius.com>
To:        Olaf Seibert <O.Seibert@cs.ru.nl>
Cc:        freebsd-stable@freebsd.org
Subject:   Re: ZFS I/O errors
Message-ID:  <20110530103349.GA73825@icarus.home.lan>
In-Reply-To: <20110530101051.GA49825@twoquid.cs.ru.nl>
References:  <20110530093546.GX6733@twoquid.cs.ru.nl> <20110530101051.GA49825@twoquid.cs.ru.nl>

next in thread | previous in thread | raw e-mail | index | archive | help
On Mon, May 30, 2011 at 12:10:51PM +0200, Olaf Seibert wrote:
> On Mon 30 May 2011 at 11:35:46 +0200, Olaf Seibert wrote:
> > How do I identify which files it is listing here?

On Solaris anyway, "zpool status -v" is supposed to show this.
Occasionally it shows something like <0xXX>:<something>, rather than a
path/filename, and on FreeBSD it appears to behave the same.  Per your
own output:

        tank/vol-fourquid-1:<0x0>
        tank/vol-fourquid-1:<0xc8190da>

I'm not sure why this didn't actually map to a filename on the system
however.  I've never quite understood what the hexadecimal values shown
represent (I have ideas but it'd be useful to know what they meant).

> The nighly 'find' has found the files for me. It is actually a bunch of
> directories, that were likely in use when the crash occurred.
> 
> They give an "interesting" error when you try to ls them:
> 
> find: /tank/vol-fourquid-1/evadh/CLEF-IP11/PARSED_CORPUS/EP/000000/45/97:       Illegal byte sequence

Yes, and this is what error 86 was in the very first line of your kernel
output:

May 30 10:38:28 fourquid root: ZFS: zpool I/O failure, zpool=tank error=86

$ perror 86
Illegal byte sequence

> The file system is compressed, that may be the reason it can identify
> "illegal byte sequence"s.
> 
> It isn't even possible to rm -r the directories, or even to mv them...
> 
> (fortunately the standard trick works: move the parent directory
> instead, create a new one in its old place, and move the old, good,
> contents back).
> 
> but now I seem to be left with some directories (elsewhere) that I still
> can't remove...

I sincerely hope your "zpool scrub" addresses these problems.  Otherwise
I hope you have backup and can recreate the pool (zpool destroy, etc.).

Try running without compression and see if that improves things.

It's important to note that the I/O errors shown happened on "random
disks" (meaning more than just one device).  What you didn't disclose is
what the disks are attached to.  "camcontrol devlist -v" would have been
a good start, followed by any details of controller/driver/etc. you're
using.  Possibly it's an underlying (silent) storage driver bug.

Finally, and leaving the most important point for last: you didn't state
what FreeBSD version you're using, and ceased to provide uname -a output
(to see kernel build date, etc.).  It matters greatly.

-- 
| Jeremy Chadwick                                   jdc@parodius.com |
| Parodius Networking                       http://www.parodius.com/ |
| UNIX Systems Administrator                  Mountain View, CA, USA |
| Making life hard for others since 1977.               PGP 4BD6C0CB |




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20110530103349.GA73825>