From owner-freebsd-stable@FreeBSD.ORG Wed Feb 1 13:40:24 2012 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 6326F106566C for ; Wed, 1 Feb 2012 13:40:24 +0000 (UTC) (envelope-from wjw@digiware.nl) Received: from mail.digiware.nl (mail.ip6.digiware.nl [IPv6:2001:4cb8:1:106::2]) by mx1.freebsd.org (Postfix) with ESMTP id 9C3638FC18 for ; Wed, 1 Feb 2012 13:40:23 +0000 (UTC) Received: from rack1.digiware.nl (localhost.digiware.nl [127.0.0.1]) by mail.digiware.nl (Postfix) with ESMTP id 54AF8153433 for ; Wed, 1 Feb 2012 14:40:21 +0100 (CET) X-Virus-Scanned: amavisd-new at digiware.nl Received: from mail.digiware.nl ([127.0.0.1]) by rack1.digiware.nl (rack1.digiware.nl [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id EQCoyKnlJ-on for ; Wed, 1 Feb 2012 14:40:20 +0100 (CET) Received: from [192.168.10.67] (opteron [192.168.10.67]) by mail.digiware.nl (Postfix) with ESMTP id 62DD415343B for ; Wed, 1 Feb 2012 14:40:20 +0100 (CET) Message-ID: <4F2940C1.10901@digiware.nl> Date: Wed, 01 Feb 2012 14:40:17 +0100 From: Willem Jan Withagen User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:9.0) Gecko/20111222 Thunderbird/9.0.1 MIME-Version: 1.0 To: "stable@freebsd.org" Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Cc: Subject: Troube with SSD X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 01 Feb 2012 13:40:24 -0000 Hi, I have this ZFS server up for about 27 days, and about 3 weeks ago (was not really paying attention) it turns out it lost its SSD that I'm using for log and cache. There is also a poor and lonely memory stick for log. So the box did not really suffer file loss. system is running: FreeBSD zfs.digiware.nl 8.2-STABLE FreeBSD 8.2-STABLE #58: Thu Nov 17 09:43:46 CET 2011 root@zfs.digiware.nl:/home/obj/usr/src/src8/src/sys/ZFS amd64 more info like dmesg, pciconf, kernconf, zpool iostat at: http://www.tegenbosch28.nl/FreeBSD/systems/ZFS/ But it is weird to just lose a SSD from the bus. And it has happened before. And you can see that AHCI really banged on the frontdoor... The device is a Corsair 60Gb Force GT. And thusfar I have not found any suggestions that that serie of devices is prone to doing this. It was a real dead device, the only way to get it back: powercycle the device by pulling it, and stick it back then camcontrol rescan I've now upgrade it to a 120Gb Corsair, to see if that has the same problem. Other FreeBSD-ers have like problems? Regards, --WjW Jan 7 10:04:24 zfs kernel: ahcich3: Timeout on slot 27 port 0 Jan 7 10:04:24 zfs kernel: ahcich3: is 00000000 cs 20000000 ss 38000000 rs 38000000 tfd c0 serr 00000000 cmd 0004dd17 Jan 7 10:04:56 zfs kernel: ahcich3: AHCI reset: device not ready after 31000ms (tfd = 00000080) Jan 7 10:05:26 zfs kernel: ahcich3: Timeout on slot 29 port 0 Jan 7 10:05:26 zfs kernel: ahcich3: is 00000000 cs 20000000 ss 00000000 rs 20000000 tfd 80 serr 00000000 cmd 0004dd17 Jan 7 10:05:57 zfs kernel: ahcich3: AHCI reset: device not ready after 31000ms (tfd = 00000080) Jan 7 10:06:27 zfs kernel: ahcich3: Timeout on slot 29 port 0 Jan 7 10:06:27 zfs kernel: ahcich3: is 00000000 cs 20000000 ss 00000000 rs 20000000 tfd 80 serr 00000000 cmd 0004dd17 Jan 7 10:06:27 zfs kernel: (ada2:ahcich3:0:0:0): lost device Jan 7 10:06:58 zfs kernel: ahcich3: AHCI reset: device not ready after 31000ms (tfd = 00000080) Jan 7 10:07:28 zfs kernel: ahcich3: Timeout on slot 29 port 0 Jan 7 10:07:28 zfs kernel: ahcich3: is 00000000 cs e0000000 ss e0000000 rs e0000000 tfd 80 serr 00000000 cmd 0004dd17 Jan 7 10:08:16 zfs kernel: ahcich3: AHCI reset: device not ready after 31000ms (tfd = 00000080) Jan 7 10:08:16 zfs kernel: ahcich3: Poll timeout on slot 31 port 0 Jan 7 10:08:16 zfs kernel: ahcich3: is 00000000 cs 80000000 ss 00000000 rs 80000000 tfd 80 serr 00000000 cmd 0004df17 Jan 7 10:08:46 zfs kernel: ahcich3: Timeout on slot 31 port 0 Jan 7 10:08:46 zfs kernel: ahcich3: is 00000000 cs 80000000 ss 00000000 rs 80000000 tfd 80 serr 00000000 cmd 0004df17 Jan 7 10:08:48 zfs kernel: (ada2:ahcich3:0:0:0): removing device entry Jan 7 10:09:33 zfs kernel: ahcich3: AHCI reset: device not ready after 31000ms (tfd = 00000080) Jan 7 10:09:33 zfs kernel: ahcich3: Poll timeout on slot 31 port 0 Jan 7 10:09:33 zfs kernel: ahcich3: is 00000000 cs 80000000 ss 00000000 rs 80000000 tfd 80 serr 00000000 cmd 0004df17