From owner-freebsd-fs@freebsd.org Tue Jul 7 09:31:30 2015 Return-Path: Delivered-To: freebsd-fs@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id BD703AF5F for ; Tue, 7 Jul 2015 09:31:30 +0000 (UTC) (envelope-from wjw@digiware.nl) Received: from mailman.ysv.freebsd.org (mailman.ysv.freebsd.org [IPv6:2001:1900:2254:206a::50:5]) by mx1.freebsd.org (Postfix) with ESMTP id A152E1D88 for ; Tue, 7 Jul 2015 09:31:30 +0000 (UTC) (envelope-from wjw@digiware.nl) Received: by mailman.ysv.freebsd.org (Postfix) id A003EAF5E; Tue, 7 Jul 2015 09:31:30 +0000 (UTC) Delivered-To: fs@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 9F87FAF5D for ; Tue, 7 Jul 2015 09:31:30 +0000 (UTC) (envelope-from wjw@digiware.nl) Received: from smtp.digiware.nl (smtp.digiware.nl [31.223.170.169]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 605D51D87; Tue, 7 Jul 2015 09:31:30 +0000 (UTC) (envelope-from wjw@digiware.nl) Received: from rack1.digiware.nl (unknown [127.0.0.1]) by smtp.digiware.nl (Postfix) with ESMTP id DED5B153416; Tue, 7 Jul 2015 11:31:20 +0200 (CEST) X-Virus-Scanned: amavisd-new at digiware.nl Received: from smtp.digiware.nl ([127.0.0.1]) by rack1.digiware.nl (rack1.digiware.nl [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id nI7N6dzDh7-R; Tue, 7 Jul 2015 11:31:01 +0200 (CEST) Received: from [IPv6:2001:4cb8:3:1:301d:d194:f8e3:4290] (unknown [IPv6:2001:4cb8:3:1:301d:d194:f8e3:4290]) by smtp.digiware.nl (Postfix) with ESMTP id 3999515344D; Tue, 7 Jul 2015 11:31:01 +0200 (CEST) Message-ID: <559B9C54.9060903@digiware.nl> Date: Tue, 07 Jul 2015 11:31:00 +0200 From: Willem Jan Withagen Organization: Digiware Management b.v. User-Agent: Mozilla/5.0 (Windows NT 6.3; WOW64; rv:31.0) Gecko/20100101 Thunderbird/31.7.0 MIME-Version: 1.0 To: Bob Friesenhahn , Steve Wills CC: fs@freebsd.org Subject: Re: This diskfailure should not panic a system, but just disconnect disk from ZFS References: <5585767B.4000206@digiware.nl> <20150620221431.GB26416@mouf.net> In-Reply-To: Content-Type: text/plain; charset=windows-1252; format=flowed Content-Transfer-Encoding: 7bit X-BeenThere: freebsd-fs@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: Filesystems List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 07 Jul 2015 09:31:30 -0000 On 21-6-2015 23:01, Bob Friesenhahn wrote: > On Sat, 20 Jun 2015, Steve Wills wrote: >>> rev=0x00 hdr=0x00 >>> vendor = 'Areca Technology Corp.' >>> device = 'ARC-1120 8-Port PCI-X to SATA RAID Controller' >>> class = mass storage >>> subclass = RAID >> >> You may be hitting the zfs deadman panic, which is triggered when the >> controller hangs. This can in some cases be caused by disks that die >> in unusual >> ways. > > Notice that the RAID controller is a PCI-X device (shared parallel, not > dedicated serial like PCIe). The whole PCI backplane could have hung. I had this panic problem a while ago, but since then it has sort of recured quite a few times.... However this times I was working on the system and noticed it right away. So I just went into the basement and chekced the box. Console is not really dead: - I can switch terminals but cannot login - I can ping but cannot ssh into it. - Can not break into the kernel There is totally no I/O shown of the disk. No of the leds flash for lile atleast 30 sec... Just the reset button get me back to normal... So that suggest a lot more that something is really hung. Question is how can I debug this? Breaking into the kernel (ctl-del-esc) does not seem to work... Also contemplating to get an Areca controller for PCIe instead but that is shelling out again another $250. And that just to get JBODs --WjW