Date: Fri, 6 Oct 2006 13:57:14 -0400 From: Kris Kennaway <kris@obsecurity.org> To: Kostik Belousov <kostikbel@gmail.com> Cc: Vivek Khera <vivek@khera.org>, stable@freebsd.org Subject: Re: ffs snapshot lockup Message-ID: <20061006175714.GA15880@xor.obsecurity.org> In-Reply-To: <20061006073950.GD26993@deviant.kiev.zoral.com.ua> References: <917B087C-5E13-4D7F-94FA-95CB0E5C1884@khera.org> <20060922190328.GA64849@xor.obsecurity.org> <555B84D2-520F-44D6-84D6-CF9CE7EE47C7@khera.org> <20060922203654.GA65693@xor.obsecurity.org> <847DD3A5-D5DD-4D3E-B755-64B13D1DA506@khera.org> <20061003084315.GA89654@deviant.kiev.zoral.com.ua> <40CE3CF0-49D2-4335-A0B8-34B5251E9E19@khera.org> <20061005083027.GK89654@deviant.kiev.zoral.com.ua> <5178C89F-B645-4A82-A7C9-FC09D458FE30@khera.org> <20061006073950.GD26993@deviant.kiev.zoral.com.ua>
next in thread | previous in thread | raw e-mail | index | archive | help
--ReaqsoxgOBHFXBhH Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Fri, Oct 06, 2006 at 10:39:50AM +0300, Kostik Belousov wrote: > On Thu, Oct 05, 2006 at 10:01:07AM -0400, Vivek Khera wrote: > >=20 > > On Oct 5, 2006, at 4:30 AM, Kostik Belousov wrote: > >=20 > > >> > > >>The network load was minimal at the time. I had everyone log out and > > >>close mail etc. > > >> > > > > > >What were the symptoms of locked system ? Could you log in on =20 > > >console, or > > >do something at the shell prompt on console ? > >=20 > > Console was non-responsive. This time dump locked doing /usr so =20 > > pretty much anything you try to run will block. When the lockup =20 > > happens when dump is running on my home dir (/u/yertle1) partition, =20 > > as long as you don't need that partition you can log in and run any =20 > > programs you like. I have a service account whose home dir is in /=20 > > var and was able to login that time to that account. No such luck =20 > > this time since any activity pretty much uses /usr. > >=20 > > Ping was responding (our monitoring didn't complain it was down). > >=20 > > The only thing I could do was break to debugger on the console. > >=20 > This is very strange. You 3 instances of getty where just reading the > tty input, and all suspectible processes (like sshd) are waiting on net > events. No processes are blocked on the fs. One nfsd is serving the reque= st, > and dump is active. To repeat something I said earlier: when creating a snapshot (e.g. which dump -L does), the entire system may become unresponsive untilk the snapshot completes, which can take many minutes. How long are you waiting before pronouncing the system deadlocked? What does ^T on the console (e.g. when trying to log in), show you? Kris --ReaqsoxgOBHFXBhH Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.5 (FreeBSD) iD8DBQFFJpj5Wry0BWjoQKURAmfFAJ9m7/cnSphAOrDm14h3bGiJWJ6V1ACgzecB UUG5ZVEc4KNGnVT+a6182Yw= =Ba68 -----END PGP SIGNATURE----- --ReaqsoxgOBHFXBhH--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20061006175714.GA15880>