From owner-freebsd-fs@FreeBSD.ORG Wed Mar 6 11:46:32 2013 Return-Path: Delivered-To: freebsd-fs@freebsd.org Received: from mx1.freebsd.org (mx1.FreeBSD.org [8.8.178.115]) by hub.freebsd.org (Postfix) with ESMTP id AFE174B4 for ; Wed, 6 Mar 2013 11:46:32 +0000 (UTC) (envelope-from freebsd-fs@m.gmane.org) Received: from plane.gmane.org (plane.gmane.org [80.91.229.3]) by mx1.freebsd.org (Postfix) with ESMTP id 4BCE3399 for ; Wed, 6 Mar 2013 11:46:31 +0000 (UTC) Received: from list by plane.gmane.org with local (Exim 4.69) (envelope-from ) id 1UDCo7-00038q-A8 for freebsd-fs@freebsd.org; Wed, 06 Mar 2013 12:46:51 +0100 Received: from lara.cc.fer.hr ([161.53.72.113]) by main.gmane.org with esmtp (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 06 Mar 2013 12:46:51 +0100 Received: from ivoras by lara.cc.fer.hr with local (Gmexim 0.1 (Debian)) id 1AlnuQ-0007hv-00 for ; Wed, 06 Mar 2013 12:46:51 +0100 X-Injected-Via-Gmane: http://gmane.org/ To: freebsd-fs@freebsd.org From: Ivan Voras Subject: ZFS deadlock (?) Date: Wed, 06 Mar 2013 12:46:11 +0100 Lines: 49 Message-ID: Mime-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha1; protocol="application/pgp-signature"; boundary="------------enig725D25E923B3C6A3C5A88E96" X-Complaints-To: usenet@ger.gmane.org X-Gmane-NNTP-Posting-Host: lara.cc.fer.hr User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:14.0) Gecko/20120812 Thunderbird/14.0 X-Enigmail-Version: 1.4.3 X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 06 Mar 2013 11:46:32 -0000 This is an OpenPGP/MIME signed message (RFC 2440 and 3156) --------------enig725D25E923B3C6A3C5A88E96 Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: quoted-printable Maybe it's just me, but I really still have a hard time trusting ZFS. Here's what happened today in a machine with FreeNAS 8.3: 41526 root 1 45 0 17856K 1860K tx->tx 0 0:00 0.00% /sbin/zfs create -o compression=3Dlzjb -V 100g store/test 41641 root 1 44 0 17856K 1860K tx->tx 0 0:00 0.00% /sbin/zfs create -o compression=3Dlzjb -V 200g store/bla 41492 root 1 45 0 17856K 1860K tx->tx 2 0:00 0.00% /sbin/zfs create -o compression=3Dlzjb -V 100g store/test 41392 root 1 44 0 17856K 1860K tx->tx 0 0:00 0.00% /sbin/zfs create -o compression=3Dlzjb -V 100G store/test These processes were started from the web GUI and are apparently deadlocked or not proceeding because of some other reason. I tried creating the "test" volume first (the earliest PID), and when it didn't finish in a few minutes I cancelled it from the web GUI and started again. Apparently, all of these processes are in some kind of deadlock state. No IO is happening on the machine. An attempt to do a "zpool scrub store" resulted in zpool also being (dead)locked, in a different place: [ctrl-t] load: 0.00 cmd: zpool 41724 [scl->scl_cv)] 37.22r 0.00u 0.00s 0% 1696k Now, all IO on the file system is (dead) locked and cannot proceed. There is nothing significant in /var/log/messages. --------------enig725D25E923B3C6A3C5A88E96 Content-Type: application/pgp-signature; name="signature.asc" Content-Description: OpenPGP digital signature Content-Disposition: attachment; filename="signature.asc" -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.19 (FreeBSD) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/ iEYEARECAAYFAlE3LIsACgkQ/QjVBj3/HSw0RQCeOlWSrQs/hLfEWvz733H3+zZ2 Nd8AnAlMM8FNn7n+5OLlmkzbBB+mttvo =sozH -----END PGP SIGNATURE----- --------------enig725D25E923B3C6A3C5A88E96--