From owner-freebsd-current@FreeBSD.ORG Sat Nov 28 21:22:31 2009 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 4B8BC1065676 for ; Sat, 28 Nov 2009 21:22:31 +0000 (UTC) (envelope-from peterjeremy@acm.org) Received: from mail14.syd.optusnet.com.au (mail14.syd.optusnet.com.au [211.29.132.195]) by mx1.freebsd.org (Postfix) with ESMTP id D53D68FC0A for ; Sat, 28 Nov 2009 21:22:30 +0000 (UTC) Received: from server.vk2pj.dyndns.org (c122-106-232-83.belrs3.nsw.optusnet.com.au [122.106.232.83]) by mail14.syd.optusnet.com.au (8.13.1/8.13.1) with ESMTP id nASLMR3a006838 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Sun, 29 Nov 2009 08:22:28 +1100 X-Bogosity: Ham, spamicity=0.000000 Received: from server.vk2pj.dyndns.org (localhost.vk2pj.dyndns.org [127.0.0.1]) by server.vk2pj.dyndns.org (8.14.3/8.14.3) with ESMTP id nASLMQQ6046411 for ; Sun, 29 Nov 2009 08:22:26 +1100 (EST) (envelope-from peter@server.vk2pj.dyndns.org) Received: (from peter@localhost) by server.vk2pj.dyndns.org (8.14.3/8.14.3/Submit) id nASLMQj5046410 for freebsd-current@freebsd.org; Sun, 29 Nov 2009 08:22:26 +1100 (EST) (envelope-from peter) Date: Sun, 29 Nov 2009 08:22:26 +1100 From: Peter Jeremy To: freebsd-current@freebsd.org Message-ID: <20091128212226.GA9841@server.vk2pj.dyndns.org> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="jRHKVT23PllUwdXP" Content-Disposition: inline X-PGP-Key: http://members.optusnet.com.au/peterjeremy/pubkey.asc User-Agent: Mutt/1.5.20 (2009-06-14) Subject: Non-responsive 8.0-RC1 X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 28 Nov 2009 21:22:31 -0000 --jRHKVT23PllUwdXP Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable My main server is running 8.0/amd64 from between RC1 and RC2 and I've recently had a couple of long-duration hangs on it during which time processes doing I/O will stop responding. The first time, it stopped responding for about 25 minutes and then spontaneously corrected itself. I was logged in remotely the whole time and Ctrl-T was responding throughout (claiming the process was 'runnable'). I tried starteding a second session - which got as far as reporting the SSH banner I have configured and then did nothing. The second time lasted about 5 minutes. I can't find anything in any log files or dmesg. 'vmstat -m' output looks sensible. Unfortunately, I didn't have access to the console on either occasion. The system is a dual-core Athlon with the base OS (root/usr/var) on UFS and the remainder of the filesystem ZFS. It's running SCHEDULE. It runs a pair of BOINC processes in the background. The first time, it should have been otherwise unused apart from a mairix (mail indexing tool) process that I'd just started. The second time, it would have been running a buildkernel. Based on it managing to report the ssh banner (which is stored in /etc) but not getting to a shell prompt (my home directory is ZFS), my initial suspicion was ZFS but it occurs to me that it could be a priority-inversion problem with the BOINC processes. Can anyone suggest where to go looking for a cause? --=20 Peter Jeremy --jRHKVT23PllUwdXP Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.13 (FreeBSD) iEYEARECAAYFAksRlJEACgkQ/opHv/APuIf8PgCeMihYhUwdm+tUyu2VVwXNYv10 HYcAoL6S8npxf5ANu7a1mgHsAsVcuRsx =3Hpb -----END PGP SIGNATURE----- --jRHKVT23PllUwdXP--