Date: Wed, 19 Jul 2006 17:43:05 +0300 From: Kostik Belousov <kostikbel@gmail.com> To: User Freebsd <freebsd@hub.org> Cc: freebsd-stable@freebsd.org, Robert Watson <rwatson@freebsd.org> Subject: Re: file system deadlock - the whole story? Message-ID: <20060719144305.GM1464@deviant.kiev.zoral.com.ua> In-Reply-To: <20060719112208.Y1799@ganymede.hub.org> References: <20060705100403.Y80381@fledge.watson.org> <cone.1152136419.991036.72616.1000@zoraida.natserv.net> <20060705234514.I70011@fledge.watson.org> <20060715000351.U1799@ganymede.hub.org> <20060715035308.GJ32624@deviant.kiev.zoral.com.ua> <20060718074804.W1799@ganymede.hub.org> <20060719112424.GK1464@deviant.kiev.zoral.com.ua> <20060719082627.H1799@ganymede.hub.org> <20060719151327.H5132@fledge.watson.org> <20060719112208.Y1799@ganymede.hub.org>
next in thread | previous in thread | raw e-mail | index | archive | help
--j3zO+32zXj6UcJCE Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Wed, Jul 19, 2006 at 11:23:21AM -0300, User Freebsd wrote: > On Wed, 19 Jul 2006, Robert Watson wrote: >=20 > > > >On Wed, 19 Jul 2006, User Freebsd wrote: > > > >>Also note that under FreeBSD 4.x, all three of these machines were pret= ty=20 > >>much my more solid machines, with even more vServers running on them th= en=20 > >>I'm able to run with 6.x ... once I got rid of using unionfs, stability= =20 > >>skyrocketed :( > >> > >>Hrmmmm ... but, your 'controller driver' comment ... that is one common= =20 > >>thing amongst all three servers ... they are all running the iir driver= =20 > >>... not sure the *exact* controller, but pluto (older Dual-PIII) shows = it=20 > >>as: > > > >Yes, this was going to be my next question -- if you're seeing wedges=20 > >under load and there's a common controller in use, maybe we're looking a= t=20 > >a driver bug. Bugs of those sort typically look a lot like what you=20 > >describe: an I/O is "lost" and so eveything that depends on the I/O wedg= es=20 > >waiting for it, leading to a lot of processes hanging around waiting for= =20 > >vnode locks, etc. >=20 > 'k, but how do we debug *that*? :( If it was one, I'd suspect hardware= =20 > ... but *three*, and only acting up *after* upgrading to FreeBSD 6.x, and= =20 > only acting up under load ... Obvious step would be to replace controller by some different kind. --j3zO+32zXj6UcJCE Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.4 (FreeBSD) iD8DBQFEvkT5C3+MBN1Mb4gRAkNDAKCZORLzP9p4pyUCwPjj///jfwGC7ACg5UCP bGGFe0/owbW1Z5J1eN26Gbs= =2gY6 -----END PGP SIGNATURE----- --j3zO+32zXj6UcJCE--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20060719144305.GM1464>