Date: Fri, 30 May 2008 20:22:05 +0200 From: Fabian Keil <freebsd-listen@fabiankeil.de> To: freebsd-current@freebsd.org Subject: panic: solaris assert: vdev_config_sync(rvd, txg) == 0, file: /usr/src/sys/modules/zfs/../../cddl/contrib/opensolaris/uts/common/fs/zfs/spa.c, line: 3014 Message-ID: <20080530202205.4b79fecd@fabiankeil.de>
next in thread | raw e-mail | index | archive | help
--Sig_/BaiOXDnb5h_oQO_qvW64AHN Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: quoted-printable A few days ago I used fdisk -p, modified two slice types in the output and used it as fdisk "config file" with the intention to merely change the slice types on disk. As an extra service, fdisk "adjusted" the size of the last slice (ad0s3) for me, thus the last sectors of ad0s3f became unreachable and geli could no longer read the meta information. ad0s3f.eli is part of the following ZFS pool: fk@TP51 ~ $sudo zpool status tank pool: tank state: ONLINE scrub: scrub completed with 0 errors on Wed May 28 22:05:33 2008 config: NAME STATE READ WRITE CKSUM tank ONLINE 0 0 0 ad0s3f.eli ONLINE 0 0 0 ad0s2.eli ONLINE 0 0 0 errors: No known data errors After fdisk's "adjustment" ad0s2.eli was still available, while ad0s3f.eli wasn't. This reproducible caused the following panic a few seconds after loading the zfs module: Unread portion of the kernel message buffer: panic: solaris assert: vdev_config_sync(rvd, txg) =3D=3D 0, file: /usr/src/= sys/modules/zfs/../../cddl/contrib/opensolaris/uts/common/fs/zfs/spa.c, lin= e: 3014 cpuid =3D 0 KDB: enter: panic panic: from debugger cpuid =3D 0 Uptime: 59s Physical memory: 998 MB Dumping 85 MB: 70 54 38 22 6 [...] (kgdb) where #0 doadump () at pcpu.h:196 #1 0xc05c2446 in boot (howto=3D260) at /usr/src/sys/kern/kern_shutdown.c:4= 18 #2 0xc05c2673 in panic (fmt=3DVariable "fmt" is not available. ) at /usr/src/sys/kern/kern_shutdown.c:572 #3 0xc04ab827 in db_panic (addr=3DCould not find the frame base for "db_pa= nic". ) at /usr/src/sys/ddb/db_command.c:446 #4 0xc04ac1dc in db_command (last_cmdp=3D0xc08d4190, cmd_table=3D0x0, dopa= ger=3D1) at /usr/src/sys/ddb/db_command.c:413 #5 0xc04ac2ea in db_command_loop () at /usr/src/sys/ddb/db_command.c:466 #6 0xc04adadd in db_trap (type=3D3, code=3D0) at /usr/src/sys/ddb/db_main.= c:228 #7 0xc05e97e6 in kdb_trap (type=3D3, code=3D0, tf=3D0xf3b10b24) at /usr/sr= c/sys/kern/subr_kdb.c:534 #8 0xc08192eb in trap (frame=3D0xf3b10b24) at /usr/src/sys/i386/i386/trap.= c:683 #9 0xc07feddb in calltrap () at /usr/src/sys/i386/i386/exception.s:165 #10 0xc05e996a in kdb_enter (why=3D0xc085d1da "panic", msg=3D0xc085d1da "pa= nic") at cpufunc.h:60 #11 0xc05c265c in panic (fmt=3D0xc552b214 "solaris assert: %s, file: %s, li= ne: %d") at /usr/src/sys/kern/kern_shutdown.c:556 #12 0xc54edb91 in spa_sync (spa=3DVariable "spa" is not available. ) at /usr/src/sys/modules/zfs/../../cddl/contrib/opensolaris/uts/common/fs/= zfs/spa.c:3014 #13 0xc54f4aca in txg_sync_thread (arg=3D0xc4c56400) at /usr/src/sys/module= s/zfs/../../cddl/contrib/opensolaris/uts/common/fs/zfs/txg.c:331 #14 0xc05a63e4 in fork_exit (callout=3D0xc54f48e0 <txg_sync_thread>, arg=3D= 0xc4c56400, frame=3D0xf3b10d38) at /usr/src/sys/kern/kern_fork.c:812 #15 0xc07fee50 in fork_trampoline () at /usr/src/sys/i386/i386/exception.s:= 270 With both pool members unavailable the panic didn't occur: fk@TP51 ~ $sudo zpool status pool: tank state: FAULTED status: One or more devices could not be used because the label is missing= =20 or invalid. There are insufficient replicas for the pool to contin= ue functioning. action: Destroy and re-create the pool from a backup source. see: http://www.sun.com/msg/ZFS-8000-5E scrub: none requested config: NAME STATE READ WRITE CKSUM tank FAULTED 0 0 0 corrupted data ad0s3f UNAVAIL 0 0 0 corrupted data ad0s2 UNAVAIL 0 0 0 corrupted data After disabling the corrupted pool, I was also unable to import any other: [My notes are incomplete, but I think I just used "zpool export tank" here.] fk@TP51 ~ $sudo zpool status no pools available=20 fk@TP51 ~ $sudo zpool import sv120 Assertion failed: ((null)), function fd =3D=3D 0, file /usr/src/cddl/lib/li= bzfs/../../../cddl/contrib/opensolaris/lib/libzfs/common/libzfs_import.c, l= ine 771. Abort trap: 6 (core dumped) I'm using FreeBSD 8.0-CURRENT #0: Tue May 27 21:38:01 CEST 2008 fk@TP51.local:/usr/obj/usr/src/sys/THINKPAD i386. Should I file a PR about this (the ZFS part)? Given that a fdisk hack with the offending "adjustment" code removed was able to get the whole ad0s3f back, I'm also wondering if it wouldn't make sense to provide fdisk with a "no adjustments, please" option? Fabian --Sig_/BaiOXDnb5h_oQO_qvW64AHN Content-Type: application/pgp-signature; name=signature.asc Content-Disposition: attachment; filename=signature.asc -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.9 (FreeBSD) iEYEARECAAYFAkhARc0ACgkQBYqIVf93VJ0wpQCgr41yXEDU3UvU4SOAQuwABsmu cboAoKWAtQlIUJ+h7V88lZShAksoEApI =UnIz -----END PGP SIGNATURE----- --Sig_/BaiOXDnb5h_oQO_qvW64AHN--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20080530202205.4b79fecd>