Date: Thu, 5 Oct 2006 11:30:27 +0300 From: Kostik Belousov <kostikbel@gmail.com> To: Vivek Khera <vivek@khera.org> Cc: stable@freebsd.org Subject: Re: ffs snapshot lockup Message-ID: <20061005083027.GK89654@deviant.kiev.zoral.com.ua> In-Reply-To: <40CE3CF0-49D2-4335-A0B8-34B5251E9E19@khera.org> References: <917B087C-5E13-4D7F-94FA-95CB0E5C1884@khera.org> <20060922190328.GA64849@xor.obsecurity.org> <555B84D2-520F-44D6-84D6-CF9CE7EE47C7@khera.org> <20060922203654.GA65693@xor.obsecurity.org> <847DD3A5-D5DD-4D3E-B755-64B13D1DA506@khera.org> <20061003084315.GA89654@deviant.kiev.zoral.com.ua> <40CE3CF0-49D2-4335-A0B8-34B5251E9E19@khera.org>
next in thread | previous in thread | raw e-mail | index | archive | help
--+Hr//EUsa8//ouuB Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Wed, Oct 04, 2006 at 05:16:53PM -0400, Vivek Khera wrote: >=20 > On Oct 3, 2006, at 4:43 AM, Kostik Belousov wrote: >=20 > >>Details are posted at http://vivek.khera.org/scratch/crashlogs/ > >> > >>I have the crashdumps available to a kernel hacker upon request (i'd > >>rather not make them generally available to the public...) > >> > >It seems that you have snapshotted fs exported by nfsd ? At least, =20 > >18a is > >definitely the case. I have the patch (for current) that shall fix =20 > >the issue. > >In fact, you need two patches: >=20 > As per advice of Kris Kenneway, I turned off the software watchdog to =20 > rule out that as my problem. Then I ran a level 3 dump. Dump of root =20 > fs went fine, then it proceeded to do /usr. After a few minutes it =20 > locked up. Typescript 20 at the above URL shows the debugging info =20 > from the break into debugger of the locked up system. Since /usr was =20 > locked, nobody could log in at all. >=20 > The network load was minimal at the time. I had everyone log out and =20 > close mail etc. >=20 What were the symptoms of locked system ? Could you log in on console, or do something at the shell prompt on console ? Also, did the system respond to the pings ? Fs-related deadlocks (as well as stalled disk io) usually do not prevent lowest levels of the isr/network stack from working. Again, I do not see the fs deadlock per se in the supplied script. Dump does disk io, it seems that nfsd tries to serve some request. Sshd looks to be ready to accept connections. If console is available, but ping responses not arrive, this is definitely network card problem. --+Hr//EUsa8//ouuB Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.5 (FreeBSD) iD8DBQFFJMKjC3+MBN1Mb4gRAqFmAKCucL3cM5rG0NRnf62VisTTomK/xACbBVeb cxFiuvG1eCxhPMPaLmWX+tg= =opbq -----END PGP SIGNATURE----- --+Hr//EUsa8//ouuB--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20061005083027.GK89654>