From owner-freebsd-current@FreeBSD.ORG Mon Nov 30 08:13:45 2009 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1A444106566B for ; Mon, 30 Nov 2009 08:13:45 +0000 (UTC) (envelope-from peterjeremy@acm.org) Received: from mail34.syd.optusnet.com.au (mail34.syd.optusnet.com.au [211.29.133.218]) by mx1.freebsd.org (Postfix) with ESMTP id 97C0D8FC08 for ; Mon, 30 Nov 2009 08:13:44 +0000 (UTC) Received: from server.vk2pj.dyndns.org (c122-106-232-83.belrs3.nsw.optusnet.com.au [122.106.232.83]) by mail34.syd.optusnet.com.au (8.13.1/8.13.1) with ESMTP id nAU8DWlO030130 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Mon, 30 Nov 2009 19:13:34 +1100 Received: from server.vk2pj.dyndns.org (localhost.vk2pj.dyndns.org [127.0.0.1]) by server.vk2pj.dyndns.org (8.14.3/8.14.3) with ESMTP id nAU8DVuc002326; Mon, 30 Nov 2009 19:13:31 +1100 (EST) (envelope-from peter@server.vk2pj.dyndns.org) Received: (from peter@localhost) by server.vk2pj.dyndns.org (8.14.3/8.14.3/Submit) id nAU8DUor002325; Mon, 30 Nov 2009 19:13:30 +1100 (EST) (envelope-from peter) Date: Mon, 30 Nov 2009 19:13:30 +1100 From: Peter Jeremy To: Thomas Backman Message-ID: <20091130081330.GA2202@server.vk2pj.dyndns.org> References: <20091128212226.GA9841@server.vk2pj.dyndns.org> <3ABF47F1-86EC-4CF2-9D42-86344D0F455B@exscape.org> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="T4sUOijqQbZv57TR" Content-Disposition: inline In-Reply-To: <3ABF47F1-86EC-4CF2-9D42-86344D0F455B@exscape.org> X-PGP-Key: http://members.optusnet.com.au/peterjeremy/pubkey.asc User-Agent: Mutt/1.5.20 (2009-06-14) Cc: freebsd-current@freebsd.org Subject: Re: Non-responsive 8.0-RC1 X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 30 Nov 2009 08:13:45 -0000 --T4sUOijqQbZv57TR Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On 2009-Nov-29 08:56:55 +0100, Thomas Backman wrote: > >On Nov 28, 2009, at 10:22 PM, Peter Jeremy wrote: > >> My main server is running 8.0/amd64 from between RC1 and RC2 and I've >> recently had a couple of long-duration hangs on it during which time >> processes doing I/O will stop responding. I forgot to mention that I checked SMART state on the disks and also did a 'zpool scrub' after the first occurrence - no problems showed up. It actually "hung" again just after I sent the original mail. This time I managed to get console access and could check the kernel state. This showed that a number of processes were blocked on ZFS locks. The most commonly reported state was 'tx->tx_quiesce_done_cv)'. It had been up for about 30 days before I noticed any problems and seems to have been getting more obvious so it is also possible that it's related to uptime - either a resource leak somewhere (though there was nothing obvious) or memory fragmentation. >Hmm, I know there was some fix to the scheduler re: thread priority, >and it wouldn't surprise me if it was after your revision. After looking around in the kernel, I'm now confident that it's not a priority-inversion issue as the BOINC processes all appeared to be running normally and not holding locks. >My advice would be to upgrade to -RELEASE if possible. If not, at >least check whether your build should be affected. I have updated to a recent 8-stable and will see what happens. --=20 Peter Jeremy --T4sUOijqQbZv57TR Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.13 (FreeBSD) iEYEARECAAYFAksTfqoACgkQ/opHv/APuIdDiQCeMYxNFM0rgtiJUjt9hKnsC9U/ khMAn3omYgPFukvzSo4XEWISEinxBAAL =R42y -----END PGP SIGNATURE----- --T4sUOijqQbZv57TR--