From owner-freebsd-geom@FreeBSD.ORG Mon Mar 21 11:06:57 2011 Return-Path: Delivered-To: freebsd-geom@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 8E2901065678 for ; Mon, 21 Mar 2011 11:06:57 +0000 (UTC) (envelope-from owner-bugmaster@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:4f8:fff6::28]) by mx1.freebsd.org (Postfix) with ESMTP id 7EB768FC12 for ; Mon, 21 Mar 2011 11:06:57 +0000 (UTC) Received: from freefall.freebsd.org (localhost [127.0.0.1]) by freefall.freebsd.org (8.14.4/8.14.4) with ESMTP id p2LB6v89085981 for ; Mon, 21 Mar 2011 11:06:57 GMT (envelope-from owner-bugmaster@FreeBSD.org) Received: (from gnats@localhost) by freefall.freebsd.org (8.14.4/8.14.4/Submit) id p2LB6uSJ085978 for freebsd-geom@FreeBSD.org; Mon, 21 Mar 2011 11:06:56 GMT (envelope-from owner-bugmaster@FreeBSD.org) Date: Mon, 21 Mar 2011 11:06:56 GMT Message-Id: <201103211106.p2LB6uSJ085978@freefall.freebsd.org> X-Authentication-Warning: freefall.freebsd.org: gnats set sender to owner-bugmaster@FreeBSD.org using -f From: FreeBSD bugmaster To: freebsd-geom@FreeBSD.org Cc: Subject: Current problem reports assigned to freebsd-geom@FreeBSD.org X-BeenThere: freebsd-geom@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GEOM-specific discussions and implementations List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 21 Mar 2011 11:06:57 -0000 Note: to view an individual PR, use: http://www.freebsd.org/cgi/query-pr.cgi?pr=(number). The following is a listing of current problems submitted by FreeBSD users. These represent problem reports covering all versions including experimental development code and obsolete releases. S Tracker Resp. Description -------------------------------------------------------------------------------- o kern/154226 geom [geom] GEOM label does not change when you modify them o kern/152609 geom [geli] geli onetime on gzero panics o kern/150858 geom [geom] [geom_label] [patch] glabel(8) is not compatibl o kern/150626 geom [geom] [gjournal] gjournal(8) destroys label o kern/150555 geom [geom] gjournal unusable on GPT partitions o kern/150334 geom [geom] [udf] [patch] geom label does not support UDF o kern/149762 geom volume labels with rogue characters o bin/149215 geom [panic] [geom_part] gpart(8): Delete linux's slice via o kern/147667 geom [gmirror] Booting with one component of a gmirror, the o kern/145818 geom [geom] geom_stat_open showing cached information for n o kern/145042 geom [geom] System stops booting after printing message "GE o kern/144905 geom [geom][geom_part] panic in gpart_ctlreq when unpluggin o kern/143455 geom gstripe(8) in RELENG_8 (31st Jan 2010) broken o kern/142563 geom [geom] [hang] ioctl freeze in zpool o kern/141740 geom [geom] gjournal(8): g_journal_destroy concurrent error o kern/140352 geom [geom] gjournal + glabel not working o kern/135898 geom [geom] Severe filesystem corruption - large files or l o kern/134922 geom [gmirror] [panic] kernel panic when use fdisk on disk o kern/134113 geom [geli] Problem setting secondary GELI key o kern/133931 geom [geli] [request] intentionally wrong password to destr o bin/132845 geom [geom] [patch] ggated(8) does not close files opened a o kern/132273 geom glabel(8): [patch] failing on journaled partition o kern/131353 geom [geom] gjournal(8) kernel lock o kern/129674 geom [geom] gjournal root did not mount on boot o kern/129645 geom gjournal(8): GEOM_JOURNAL causes system to fail to boo o kern/129245 geom [geom] gcache is more suitable for suffix based provid f kern/128276 geom [gmirror] machine lock up when gmirror module is used o kern/127420 geom [geom] [gjournal] [panic] Journal overflow on gmirrore o kern/124973 geom [gjournal] [patch] boot order affects geom_journal con o kern/124969 geom gvinum(8): gvinum raid5 plex does not detect missing s o kern/123962 geom [panic] [gjournal] gjournal (455Gb data, 8Gb journal), o kern/123122 geom [geom] GEOM / gjournal kernel lock o kern/122738 geom [geom] gmirror list "losts consumers" after gmirror de o kern/122067 geom [geom] [panic] Geom crashed during boot o kern/121364 geom [gmirror] Removing all providers create a "zombie" mir o kern/120091 geom [geom] [geli] [gjournal] geli does not prompt for pass o kern/115856 geom [geli] ZFS thought it was degraded when it should have o kern/115547 geom [geom] [patch] [request] let GEOM Eli get password fro o kern/114532 geom [geom] GEOM_MIRROR shows up in kldstat even if compile f kern/113957 geom [gmirror] gmirror is intermittently reporting a degrad o kern/113837 geom [geom] unable to access 1024 sector size storage o kern/113419 geom [geom] geom fox multipathing not failing back o kern/107707 geom [geom] [patch] [request] add new class geom_xbox360 to o kern/94632 geom [geom] Kernel output resets input while GELI asks for o kern/90582 geom [geom] [panic] Restore cause panic string (ffs_blkfree o bin/90093 geom fdisk(8) incapable of altering in-core geometry f kern/88601 geom [geli] geli cause kernel panic under heavy disk usage o kern/87544 geom [gbde] mmaping large files on a gbde filesystem deadlo o bin/86388 geom [geom] [geom_part] periodic(8) daily should backup gpa o kern/84556 geom [geom] [panic] GBDE-encrypted swap causes panic at shu o kern/79251 geom [2TB] newfs fails on 2.6TB gbde device o kern/79035 geom [vinum] gvinum unable to create a striped set of mirro o bin/78131 geom gbde(8) "destroy" not working. 53 problems total. From owner-freebsd-geom@FreeBSD.ORG Tue Mar 22 16:17:42 2011 Return-Path: Delivered-To: freebsd-geom@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id E6DC4106564A for ; Tue, 22 Mar 2011 16:17:42 +0000 (UTC) (envelope-from canevet@embl.fr) Received: from emblmta1.embl.fr (emblmta1.embl.fr [193.49.43.176]) by mx1.freebsd.org (Postfix) with ESMTP id 8519B8FC18 for ; Tue, 22 Mar 2011 16:17:42 +0000 (UTC) X-IronPort-AV: E=Sophos;i="4.63,225,1299452400"; d="asc'?scan'208";a="1510124" Received: from unknown (HELO [172.26.15.11]) ([172.26.15.11]) by emblmta1.embl.fr with ESMTP/TLS/DHE-RSA-CAMELLIA256-SHA; 22 Mar 2011 16:48:13 +0100 From: =?ISO-8859-1?Q?Micka=EBl_Can=E9vet?= To: freebsd-bugs@freebsd.org In-Reply-To: <20110322124635.GA1618@in-addr.com> References: <1300791194.2566.37.camel@pc286.embl.fr> <20110322124635.GA1618@in-addr.com> Content-Type: multipart/signed; micalg="pgp-sha1"; protocol="application/pgp-signature"; boundary="=-Xxa6XAJ1UXFPXHhS5d9G" Date: Tue, 22 Mar 2011 16:48:11 +0100 Message-ID: <1300808893.2530.1.camel@pc286.embl.fr> Mime-Version: 1.0 X-Mailer: Evolution 2.32.2 Cc: freebsd-geom@freebsd.org Subject: Re: "Fatal double fault" panic X-BeenThere: freebsd-geom@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: GEOM-specific discussions and implementations List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 22 Mar 2011 16:17:43 -0000 --=-Xxa6XAJ1UXFPXHhS5d9G Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable Hi,=20 I found that /etc/periodic/security/100.chksetuid does a find on the whole filesystem every night. I have a lot of files (around 40 millions), maybe it's the origin of my crash. The thing is that my redundant production NAS has crashed, but my backup server that is not redundant (no HAST layer) and has more files (65 millions) never crashed. So maybe the problem comes from geom. In the mean time, I will disable this check in /etc/periodic.conf. Cheers, Micka=C3=ABl On Tue, 2011-03-22 at 08:46 -0400, Gary Palmer wrote:=20 > On Tue, Mar 22, 2011 at 11:53:14AM +0100, Micka?l Can?vet wrote: > > Hi, > >=20 > > I have a redundant NAS made of FreeBSD + HAST + ZFS and 24TB of disks. > >=20 > > This morning my primary node crashed around 4:20am. > >=20 > > On the console I can see: > >=20 > > Fatal double fault > > rip =3D 0xffffffff805e78b8 > > rsp =3D 0xffffff8485d43fc0 > > rbp =3D 0xffffff8485d44010 > > cpuid =3D 1; apic id =3D 12 > > panic: double fault > > cpuid =3D 1 > > KDB: stack backstrace: > > #0 0xffffffff805f4e0e at kdb_backtrace+0x5e > > #1 0xffffffff805c2d07 at panic+0x187 > > #2 0xffffffff808ac366 at dblfault_handler+0x96 > > #3 0xffffffff808950bd at Xdblfault+0xad > > Uptime: 4d14h7m5s > > Cannot sump, Device not defined or unavailable. > >=20 > > The only thing I can see on my munin graphs is a strange IO activity > > (disk and network over my HAST link) that starts at 3am every morning > > and last about 1 hour and a half (and so until crash this morning). I > > double checked my scheduled scripts and I do not do anything at that > > time. So I suspect a system script to be responsible of this activity. > > I'm not sure that this IO activity results in the crash, but that the > > only track I have. >=20 > 3am is when the scripts in /etc/periodic/daily fire >=20 > # grep daily /etc/crontab > # Perform daily/weekly/monthly maintenance. > 1 3 * * * root periodic daily >=20 >=20 > Regards, >=20 > Gary >=20 --=-Xxa6XAJ1UXFPXHhS5d9G Content-Type: application/pgp-signature; name="signature.asc" Content-Description: This is a digitally signed message part -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.11 (GNU/Linux) iEYEABECAAYFAk2IxKsACgkQZjBmN5Hi/YbxSgCfV1bKqGSFmhShgDR9FnrGZtUL 8iIAnimtZp4YlThyDyKJ97dOCmZ2X3y4 =NJrL -----END PGP SIGNATURE----- --=-Xxa6XAJ1UXFPXHhS5d9G--