From owner-freebsd-current@FreeBSD.ORG Wed Sep 5 14:19:23 2007 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id AAAC116A418 for ; Wed, 5 Sep 2007 14:19:23 +0000 (UTC) (envelope-from pjd@garage.freebsd.pl) Received: from mail.garage.freebsd.pl (arm132.internetdsl.tpnet.pl [83.17.198.132]) by mx1.freebsd.org (Postfix) with ESMTP id 4A20D13C45E for ; Wed, 5 Sep 2007 14:19:23 +0000 (UTC) (envelope-from pjd@garage.freebsd.pl) Received: by mail.garage.freebsd.pl (Postfix, from userid 65534) id 69F7A45E98; Wed, 5 Sep 2007 16:19:21 +0200 (CEST) Received: from localhost (pjd.wheel.pl [10.0.1.1]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by mail.garage.freebsd.pl (Postfix) with ESMTP id 49A6A45E90; Wed, 5 Sep 2007 16:19:13 +0200 (CEST) Date: Wed, 5 Sep 2007 16:17:59 +0200 From: Pawel Jakub Dawidek To: Kenneth Vestergaard Schmidt Message-ID: <20070905141759.GJ12013@garage.freebsd.pl> References: Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="EEx6GiKZGZ1wKUra" Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.4.2.3i X-PGP-Key-URL: http://people.freebsd.org/~pjd/pjd.asc X-OS: FreeBSD 7.0-CURRENT i386 X-Spam-Checker-Version: SpamAssassin 3.0.4 (2005-06-05) on mail.garage.freebsd.pl X-Spam-Level: X-Spam-Status: No, score=-5.9 required=3.0 tests=ALL_TRUSTED,BAYES_00 autolearn=ham version=3.0.4 Cc: freebsd-current@freebsd.org Subject: Re: Unkillable and runaway processes X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 05 Sep 2007 14:19:23 -0000 --EEx6GiKZGZ1wKUra Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Tue, Sep 04, 2007 at 03:08:20PM +0200, Kenneth Vestergaard Schmidt wrote: > Hello. >=20 > Our ZFS testbed is experiencing some weird problems with rsync. We run a > nightly backup of about 1.6 TB data (that's how much is stored, not how > much is transferred), but after the initial sync I haven't been able to > get the machine through one full cycle. >=20 > After many hours of rsyncing data from 50+ machines, suddenly one > rsync-process will hang, spinning on the CPU. >=20 > It switches state between CPU0, CPU1, RUN and 'zfs:(&', but doesn't > really do anything. It can't be killed, and you can't reboot the machine > - it'll get past syncing disks, but won't shutdown or reboot. >=20 > I can't do an 'ls' in the directory that rsync is running on - it'll > just hang, too. >=20 > The machine is running current from August 29th. >=20 > I could use some pointers on what to do - is there some way I can debug > this better, maybe give some better info? Try disabling ZIL. This looks like a bug was already reported by Kris. This was already reported to OpenSolaris. --=20 Pawel Jakub Dawidek http://www.wheel.pl pjd@FreeBSD.org http://www.FreeBSD.org FreeBSD committer Am I Evil? Yes, I Am! --EEx6GiKZGZ1wKUra Content-Type: application/pgp-signature Content-Disposition: inline -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.4 (FreeBSD) iD8DBQFG3rqXForvXbEpPzQRAn58AJ0eoBSM3HrN1eVBpIX13DqV+kYjXQCcCO3J /n1Zb7ziZA0fAc1mK/iUe5Q= =DZa/ -----END PGP SIGNATURE----- --EEx6GiKZGZ1wKUra--