Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 22 Mar 2011 16:48:11 +0100
From:      =?ISO-8859-1?Q?Micka=EBl_Can=E9vet?= <canevet@embl.fr>
To:        freebsd-bugs@freebsd.org
Cc:        freebsd-geom@freebsd.org
Subject:   Re: "Fatal double fault" panic
Message-ID:  <1300808893.2530.1.camel@pc286.embl.fr>
In-Reply-To: <20110322124635.GA1618@in-addr.com>
References:  <1300791194.2566.37.camel@pc286.embl.fr> <20110322124635.GA1618@in-addr.com>

next in thread | previous in thread | raw e-mail | index | archive | help

--=-Xxa6XAJ1UXFPXHhS5d9G
Content-Type: text/plain; charset="UTF-8"
Content-Transfer-Encoding: quoted-printable

Hi,=20

I found that /etc/periodic/security/100.chksetuid does a find on the
whole filesystem every night.

I have a lot of files (around 40 millions), maybe it's the origin of my
crash. The thing is that my redundant production NAS has crashed, but my
backup server that is not redundant (no HAST layer) and has more files
(65 millions) never crashed. So maybe the problem comes from geom.

In the mean time, I will disable this check in /etc/periodic.conf.

Cheers,
Micka=C3=ABl

On Tue, 2011-03-22 at 08:46 -0400, Gary Palmer wrote:=20
> On Tue, Mar 22, 2011 at 11:53:14AM +0100, Micka?l Can?vet wrote:
> > Hi,
> >=20
> > I have a redundant NAS made of FreeBSD + HAST + ZFS and 24TB of disks.
> >=20
> > This morning my primary node crashed around 4:20am.
> >=20
> > On the console I can see:
> >=20
> > Fatal double fault
> > rip =3D 0xffffffff805e78b8
> > rsp =3D 0xffffff8485d43fc0
> > rbp =3D 0xffffff8485d44010
> > cpuid =3D 1; apic id =3D 12
> > panic: double fault
> > cpuid =3D 1
> > KDB: stack backstrace:
> > #0 0xffffffff805f4e0e at kdb_backtrace+0x5e
> > #1 0xffffffff805c2d07 at panic+0x187
> > #2 0xffffffff808ac366 at dblfault_handler+0x96
> > #3 0xffffffff808950bd at Xdblfault+0xad
> > Uptime: 4d14h7m5s
> > Cannot sump, Device not defined or unavailable.
> >=20
> > The only thing I can see on my munin graphs is a strange IO activity
> > (disk and network over my HAST link) that starts at 3am every morning
> > and last about 1 hour and a half (and so until crash this morning). I
> > double checked my scheduled scripts and I do not do anything at that
> > time. So I suspect a system script to be responsible of this activity.
> > I'm not sure that this IO activity results in the crash, but that the
> > only track I have.
>=20
> 3am is when the scripts in /etc/periodic/daily fire
>=20
> # grep daily /etc/crontab
> # Perform daily/weekly/monthly maintenance.
> 1       3       *       *       *       root    periodic daily
>=20
>=20
> Regards,
>=20
> Gary
>=20



--=-Xxa6XAJ1UXFPXHhS5d9G
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: This is a digitally signed message part

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v1.4.11 (GNU/Linux)

iEYEABECAAYFAk2IxKsACgkQZjBmN5Hi/YbxSgCfV1bKqGSFmhShgDR9FnrGZtUL
8iIAnimtZp4YlThyDyKJ97dOCmZ2X3y4
=NJrL
-----END PGP SIGNATURE-----

--=-Xxa6XAJ1UXFPXHhS5d9G--




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?1300808893.2530.1.camel>