From owner-freebsd-stable@FreeBSD.ORG Tue Mar 9 09:15:55 2010 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 9021E106566B for ; Tue, 9 Mar 2010 09:15:55 +0000 (UTC) (envelope-from stb@lassitu.de) Received: from gilb.zs64.net (gilb.zs64.net [212.12.50.234]) by mx1.freebsd.org (Postfix) with ESMTP id 5BC298FC15 for ; Tue, 9 Mar 2010 09:15:55 +0000 (UTC) Received: by gilb.zs64.net (Postfix, from stb@lassitu.de) id 1AC1D4D7E0 for ; Tue, 9 Mar 2010 09:15:54 +0000 (UTC) From: Stefan Bethke Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Date: Tue, 9 Mar 2010 10:15:53 +0100 Message-Id: <864468D4-DCE9-493B-9280-00E5FAB2A05C@lassitu.de> To: FreeBSD Stable Mime-Version: 1.0 (Apple Message framework v1077) X-Mailer: Apple Mail (2.1077) Subject: Many processes stuck in zfs X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 09 Mar 2010 09:15:55 -0000 Over the past couple of months, I've more or less regularly observed = machines having more and more processes stuck in the zfs wchan. The = processes never recover from that, and trying to reboot only gets the = entire system stuck, without any console messages. I can enter the = debugger, and I have saved a couple of dumps. The situation seems to be triggered by zfs receive'ing snapshots from = the sister machine (both synchronize their active ZFS filesystems to = each other, using zfs send and zfs receive). It appears it's the = receiving causing trouble. Both machines run 8-stable from mid-February, with a single-disk ZFS = pool, with ARC limited to 512M, prefetch and ZIL disabled via = loader.conf. What should I be looking at to further diagnose? Thanks, Stefan --=20 Stefan Bethke Fon +49 151 14070811