From owner-freebsd-stable@FreeBSD.ORG Wed Jul 19 14:43:20 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 4442A16A4DA; Wed, 19 Jul 2006 14:43:20 +0000 (UTC) (envelope-from kostikbel@gmail.com) Received: from fw.zoral.com.ua (fw.zoral.com.ua [213.186.206.134]) by mx1.FreeBSD.org (Postfix) with ESMTP id 3CE1D43D5A; Wed, 19 Jul 2006 14:43:14 +0000 (GMT) (envelope-from kostikbel@gmail.com) Received: from deviant.kiev.zoral.com.ua (root@deviant.kiev.zoral.com.ua [10.1.1.148]) by fw.zoral.com.ua (8.13.4/8.13.4) with ESMTP id k6JEh6Mw037258 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Wed, 19 Jul 2006 17:43:06 +0300 (EEST) (envelope-from kostikbel@gmail.com) Received: from deviant.kiev.zoral.com.ua (kostik@localhost [127.0.0.1]) by deviant.kiev.zoral.com.ua (8.13.6/8.13.6) with ESMTP id k6JEh6af032818; Wed, 19 Jul 2006 17:43:06 +0300 (EEST) (envelope-from kostikbel@gmail.com) Received: (from kostik@localhost) by deviant.kiev.zoral.com.ua (8.13.6/8.13.6/Submit) id k6JEh5kQ032817; Wed, 19 Jul 2006 17:43:05 +0300 (EEST) (envelope-from kostikbel@gmail.com) X-Authentication-Warning: deviant.kiev.zoral.com.ua: kostik set sender to kostikbel@gmail.com using -f Date: Wed, 19 Jul 2006 17:43:05 +0300 From: Kostik Belousov To: User Freebsd Message-ID: <20060719144305.GM1464@deviant.kiev.zoral.com.ua> References: <20060705100403.Y80381@fledge.watson.org> <20060705234514.I70011@fledge.watson.org> <20060715000351.U1799@ganymede.hub.org> <20060715035308.GJ32624@deviant.kiev.zoral.com.ua> <20060718074804.W1799@ganymede.hub.org> <20060719112424.GK1464@deviant.kiev.zoral.com.ua> <20060719082627.H1799@ganymede.hub.org> <20060719151327.H5132@fledge.watson.org> <20060719112208.Y1799@ganymede.hub.org> Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="j3zO+32zXj6UcJCE" Content-Disposition: inline In-Reply-To: <20060719112208.Y1799@ganymede.hub.org> User-Agent: Mutt/1.4.2.1i X-Virus-Scanned: ClamAV version 0.88.2, clamav-milter version 0.88.2 on fw.zoral.com.ua X-Virus-Status: Clean X-Spam-Status: No, score=0.4 required=5.0 tests=ALL_TRUSTED, DNS_FROM_RFC_ABUSE,SPF_NEUTRAL autolearn=no version=3.1.3 X-Spam-Checker-Version: SpamAssassin 3.1.3 (2006-06-01) on fw.zoral.com.ua Cc: freebsd-stable@freebsd.org, Robert Watson Subject: Re: file system deadlock - the whole story? X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 19 Jul 2006 14:43:20 -0000 --j3zO+32zXj6UcJCE Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Wed, Jul 19, 2006 at 11:23:21AM -0300, User Freebsd wrote: > On Wed, 19 Jul 2006, Robert Watson wrote: >=20 > > > >On Wed, 19 Jul 2006, User Freebsd wrote: > > > >>Also note that under FreeBSD 4.x, all three of these machines were pret= ty=20 > >>much my more solid machines, with even more vServers running on them th= en=20 > >>I'm able to run with 6.x ... once I got rid of using unionfs, stability= =20 > >>skyrocketed :( > >> > >>Hrmmmm ... but, your 'controller driver' comment ... that is one common= =20 > >>thing amongst all three servers ... they are all running the iir driver= =20 > >>... not sure the *exact* controller, but pluto (older Dual-PIII) shows = it=20 > >>as: > > > >Yes, this was going to be my next question -- if you're seeing wedges=20 > >under load and there's a common controller in use, maybe we're looking a= t=20 > >a driver bug. Bugs of those sort typically look a lot like what you=20 > >describe: an I/O is "lost" and so eveything that depends on the I/O wedg= es=20 > >waiting for it, leading to a lot of processes hanging around waiting for= =20 > >vnode locks, etc. >=20 > 'k, but how do we debug *that*? :( If it was one, I'd suspect hardware= =20 > ... but *three*, and only acting up *after* upgrading to FreeBSD 6.x, and= =20 > only acting up under load ... Obvious step would be to replace controller by some different kind. --j3zO+32zXj6UcJCE Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.4 (FreeBSD) iD8DBQFEvkT5C3+MBN1Mb4gRAkNDAKCZORLzP9p4pyUCwPjj///jfwGC7ACg5UCP bGGFe0/owbW1Z5J1eN26Gbs= =2gY6 -----END PGP SIGNATURE----- --j3zO+32zXj6UcJCE--