Skip site navigation (1)Skip section navigation (2)
Date:      Sun, 10 Mar 2013 02:45:57 +0900
From:      Stephane LAPIE <stephane.lapie@darkbsd.org>
To:        freebsd-fs@freebsd.org
Subject:   Re: Panic loop on ZFS with 9.1-RELEASE
Message-ID:  <513B7555.1010701@darkbsd.org>
In-Reply-To: <513B6E1E.6080805@darkbsd.org>
References:  <513B58B6.2090903@darkbsd.org> <513B6E1E.6080805@darkbsd.org>

next in thread | previous in thread | raw e-mail | index | archive | help
This is an OpenPGP/MIME signed message (RFC 2440 and 3156)
--------------enigBBCFCDBBE6DBB23636CFC8BA
Content-Type: text/plain; charset=UTF-8
Content-Transfer-Encoding: quoted-printable

Pinpoint analysis of the zpool on the broken vdev gives the following
information :

# zdb -AAA -e -mm prana 1 33

Metaslabs:
    vdev          1
    metaslabs   145   offset                spacemap          free    =20
    ---------------   -------------------   ---------------   -----------=
--
    metaslab     33   offset  21000000000   spacemap    303   free    11.=
9G
WARNING: zfs: allocating allocated segment(offset=3D2335563722752 size=3D=
1024)

Assertion failed: sm->sm_space =3D=3D space (0x2f927f400 =3D=3D 0x2f927f8=
00),
file
/usr/storage/tech/eirei-no-za.yomi.darkbsd.org/usr/src/cddl/lib/libzpool/=
=2E./../../sys/cddl/contrib/opensolaris/uts/common/fs/zfs/space_map.c,
line 353.
pid 51 (zdb), uid 0: exited on signal 6 (core dumped)
Abort trap (core dumped)

Just in case, root vdev 1 is made of the following devices :
children[1]:
    type: 'raidz'
    id: 1
    guid: 1078755695237588414
    nparity: 1
    metaslab_array: 175
    metaslab_shift: 36
    ashift: 9
    asize: 10001970626560
    is_log: 0
    children[0]:
        type: 'disk'
        id: 0
        guid: 12900041001921590764
        path: '/dev/da10'
        phys_path: '/dev/da10'
        whole_disk: 0
        DTL: 4127
    children[1]:
        type: 'disk'
        id: 1
        guid: 7211789756938666186
        path: '/dev/da3'
        phys_path: '/dev/da3'
        whole_disk: 1
        DTL: 4119
    children[2]:
        type: 'disk'
        id: 2
        guid: 12094368820342087236
        path: '/dev/da5'
        phys_path: '/dev/da5'
        whole_disk: 1
        DTL: 212
    children[3]:
        type: 'disk'
        id: 3
        guid: 6868867539761908697
        path: '/dev/da4'
        phys_path: '/dev/da4'
        whole_disk: 0
        DTL: 4173
    children[4]:
        type: 'disk'
        id: 4
        guid: 3091570768700552191
        path: '/dev/da6'
        phys_path: '/dev/da6'
        whole_disk: 0
        DTL: 4182

At this point I am nearly considering ripping these out and zpool
importing while ignoring missing devices... :/

On 03/10/2013 02:15 AM, Stephane LAPIE wrote:
> Posting a quick update.
>
> I ran a "zdb -emm" command to figure out what was going on, and it blew=

> up in my face with an abort trap here :
> - vdev 0 has 145 metaslabs, which are cleared without any problems.
> - vdev 1 has 145 metaslabs, but fails in the middle :
> metaslab     32   offset  20000000000   spacemap    289   free    1.64G=

>                   segments      19509   maxsize   41.7M   freepct    2%=

> metaslab     33   offset  21000000000   spacemap    303   free    11.9G=

> error: zfs: allocating allocated segment(offset=3D2335563722752 size=3D=
1024)
> Abort trap(core dumped)
>
> Converting offset 2335563722752 from earlier kernel panic messages give=
s
> : 21fca723000, which matches the broken metaslab found by zdb.
>
> Is there anything I can do at this point, using zdb?
> It just sounds surrealistic I have ONE broken metaslab (seemingly?) and=

> that I can't recover anything...
>
> Cheers,
>
> On 03/10/2013 12:43 AM, Stephane LAPIE wrote:
>> Hello list,
>>
>> I currently am faced with a sudden death case I can't understand at al=
l,
>> and I would be very appreciating of any explanation or assistance :(
>>
>> Here is my current kernel version :
>> FreeBSD  9.1-STABLE FreeBSD 9.1-STABLE #5 r245055: Thu Jan 17 13:12:59=

>> JST 2013
>> darksoul@eirei-no-za.yomi.darkbsd.org:/usr/obj/usr/storage/tech/eirei-=
no-za.yomi.darkbsd.org/usr/src/sys/DARK-2012KERN=20
>> amd64
>> (Kernel is basically a lightened GENERIC kernel without VESA options a=
nd
>> unneeded controllers removed)
>>
>> The pool is a set of 3x raidz1 (5 drives), + 2 cache devices + mirrore=
d
>> transaction log
>>
>> Booting and trying to import the pool is met with :
>> Solaris(panic): zfs: panic: allocating allocated
>> segment(offset=3D2335563722752 size=3D1024)
>>
>> Booting single mode on my emergency flash card with a base OS and zpoo=
l
>> import -o readonly=3Don is met with :
>> panic: solaris assert: zio->io_type !=3D ZIO_TYPE_WRITE ||
>> spa_writeable(spa), file:
>> /usr/storage/tech/eirei-no-za.yomi.darkbsd.org/usr/src/sys/modules/zfs=
/../../cddl/contrib/opensolaris/uts/common/fs/zfs/zio.c,
>> line: 2461
>>
>> I tried zpool import -F -f, zpool import -F -f -m after removing the
>> mirrored transaction log devices, but after 40s of trying to import, i=
t
>> just blows up.
>>
>> I am currently running "zdb -emm" as per the procedure suggested here =
:
>> http://simplex.swordsaint.net/?p=3D199 if only to get some debug infor=
mation.
>>
>> Thanks in advance for your time.
>>
>> Cheers,
>>
>>
>> --=20
>> Stephane LAPIE, EPITA SRS, Promo 2005
>> "Even when they have digital readouts, I can't understand them."
>> --MegaTokyo
>
> --=20
> Stephane LAPIE, EPITA SRS, Promo 2005
> "Even when they have digital readouts, I can't understand them."
> --MegaTokyo


--------------enigBBCFCDBBE6DBB23636CFC8BA
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: OpenPGP digital signature
Content-Disposition: attachment; filename="signature.asc"

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.11 (GNU/Linux)
Comment: Using GnuPG with undefined - http://www.enigmail.net/

iF4EAREIAAYFAlE7dVUACgkQDJ4OK7D3FWQVbgD/ajAVkm/QkciZuNmMT11EkSDV
3semGzseAYsYopdpI40A/0u9MYMeUjVK1r8nUaNQXBkCQhxE760uIAgJMn+Hp3w4
=Z06k
-----END PGP SIGNATURE-----

--------------enigBBCFCDBBE6DBB23636CFC8BA--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?513B7555.1010701>