From owner-freebsd-current@FreeBSD.ORG Wed Jun 20 16:18:12 2007 Return-Path: X-Original-To: Current@FreeBSD.org Delivered-To: freebsd-current@FreeBSD.ORG Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 3AAC216A4CC for ; Wed, 20 Jun 2007 16:18:12 +0000 (UTC) (envelope-from marcus@FreeBSD.org) Received: from creme-brulee.marcuscom.com (creme-brulee.marcuscom.com [24.172.16.118]) by mx1.freebsd.org (Postfix) with ESMTP id 5D90F13C534 for ; Wed, 20 Jun 2007 16:18:01 +0000 (UTC) (envelope-from marcus@FreeBSD.org) Received: from [IPv6:2001:470:1f00:2464::4] (shumai.marcuscom.com [IPv6:2001:470:1f00:2464::4]) by creme-brulee.marcuscom.com (8.14.1/8.14.1) with ESMTP id l5KGIp97011740; Wed, 20 Jun 2007 12:18:51 -0400 (EDT) (envelope-from marcus@FreeBSD.org) From: Joe Marcus Clarke To: Kris Kennaway In-Reply-To: <20070620160306.GA74674@rot26.obsecurity.org> References: <1182354823.6504.23.camel@shumai.marcuscom.com> <20070620160306.GA74674@rot26.obsecurity.org> Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="=-7jQwtNdqI5Hzw+ksxLKN" Organization: FreeBSD, Inc. Date: Wed, 20 Jun 2007 12:17:54 -0400 Message-Id: <1182356274.6504.30.camel@shumai.marcuscom.com> Mime-Version: 1.0 X-Mailer: Evolution 2.10.2 FreeBSD GNOME Team Port X-Spam-Status: No, score=-2.6 required=5.0 tests=BAYES_00,NO_RELAYS autolearn=no version=3.2.0 X-Spam-Checker-Version: SpamAssassin 3.2.0 (2007-05-01) on creme-brulee.marcuscom.com Cc: Current@FreeBSD.org Subject: Re: ZFS and deadlock with {nullfs,NFS} X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 20 Jun 2007 16:18:12 -0000 --=-7jQwtNdqI5Hzw+ksxLKN Content-Type: text/plain Content-Transfer-Encoding: quoted-printable On Wed, 2007-06-20 at 12:03 -0400, Kris Kennaway wrote: > On Wed, Jun 20, 2007 at 11:53:43AM -0400, Joe Marcus Clarke wrote: > > I've resurrected by amd64 Tinderbox with a ZFS base, and I've been > > seeing a 100% reproducible deadlock when I use it with either localhost > > NFS or nullfs. When this occurs, the CPU is 100% idle, but I can no > > longer connect via SSH, and the box will only reboot from the debugger. > > I know there are some tuning bits I can tweak, but all I've run across > > is for memory consumption. Any pointers would be helpful. I'm also at > > the debugger, so if there is anything I can do to help troubleshoot why > > this is happening, please let me know. =20 > >=20 > > This box is -CURRENT as of June 19, 2007. It has a GENERIC kernel minu= s > > devices I do not have (i.e. SMP kernel). I am currently using nullfs > > for the Tinderbox. The process that most regularly locks up is mtree. > > Here is the trace: >=20 > > A full process list from the debugger can be found at > > http://www.marcuscom.com/downloads/cobbler_proc.txt . >=20 > 404 at the moment, but look for processes involving zil* in the > backtrace. I had to disable zil (vfs.zfs.zil_disable=3D1 tunable) to > prevent low-memory deadlocks on my machines. Since then it's been > fine. Fixed, sorry. >=20 > You may also wish to use my patches (see the archives) to improve > performance and low-memory behaviour. Thanks for the advice. I'll check. I didn't think low memory since it didn't look like I was using much. Even now with the box locked, I have 1035 MB free with no swap in use (this box has 2 GB total). Joe --=20 Joe Marcus Clarke FreeBSD GNOME Team :: gnome@FreeBSD.org FreeNode / #freebsd-gnome http://www.FreeBSD.org/gnome --=-7jQwtNdqI5Hzw+ksxLKN Content-Type: application/pgp-signature; name=signature.asc Content-Description: This is a digitally signed message part -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.7 (FreeBSD) iD8DBQBGeVMxb2iPiv4Uz4cRAlOqAKCXNQ87JvyoD9B7QW7P8b6gPTv4JACgiukS u+MotSjBQlsWSnzjEKnc06k= =732j -----END PGP SIGNATURE----- --=-7jQwtNdqI5Hzw+ksxLKN--