From owner-freebsd-fs@FreeBSD.ORG Sat Oct 20 15:49:23 2012 Return-Path: Delivered-To: freebsd-fs@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 3277B13D; Sat, 20 Oct 2012 15:49:23 +0000 (UTC) (envelope-from freebsd@penx.com) Received: from btw.pki2.com (btw.pki2.com [IPv6:2001:470:a:6fd::2]) by mx1.freebsd.org (Postfix) with ESMTP id 8FE708FC0A; Sat, 20 Oct 2012 15:49:22 +0000 (UTC) Received: from [127.0.0.1] (localhost [127.0.0.1]) by btw.pki2.com (8.14.5/8.14.5) with ESMTP id q9KFnHXG055574; Sat, 20 Oct 2012 08:49:17 -0700 (PDT) (envelope-from freebsd@penx.com) Subject: Re: ZFS hang status update From: Dennis Glatting To: Andriy Gapon In-Reply-To: <50825598.3070505@FreeBSD.org> References: <1350698905.86715.33.camel@btw.pki2.com> <1350711509.86715.59.camel@btw.pki2.com> <50825598.3070505@FreeBSD.org> Content-Type: multipart/mixed; boundary="=-T3qbQpcvQzfHgpDvLbi3" Date: Sat, 20 Oct 2012 08:49:17 -0700 Message-ID: <1350748157.88577.15.camel@btw.pki2.com> Mime-Version: 1.0 X-Mailer: Evolution 2.32.1 FreeBSD GNOME Team Port X-yoursite-MailScanner-Information: Dennis Glatting X-yoursite-MailScanner-ID: q9KFnHXG055574 X-yoursite-MailScanner: Found to be clean X-MailScanner-From: freebsd@penx.com X-Content-Filtered-By: Mailman/MimeDel 2.1.14 Cc: freebsd-fs@freebsd.org X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 20 Oct 2012 15:49:23 -0000 --=-T3qbQpcvQzfHgpDvLbi3 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit On Sat, 2012-10-20 at 10:41 +0300, Andriy Gapon wrote: > on 20/10/2012 08:38 Dennis Glatting said the following: > > This is da0 (the cache --SSD) on which camcontrol hanged. It is on the > > same controller. > > Hmm, hanging camcontrol is a bad sign. It would be interesting to get procstat > -k information just for the hanging camcontrol process. > Also, is it possible to eliminate this disk from the configuration? > Attached (I hope but also found here: http://www.pki2.com/zfs_stats_efficiency-week.png) is a munin graph of "ZFS ARC Efficiency - by week" for the server I have been talking about. You will notice before each crash the prefetch efficiency is going down. This graph is for daily graph. http://www.pki2.com/zfs_stats_efficiency-day.png --=-T3qbQpcvQzfHgpDvLbi3--