From owner-freebsd-stable@FreeBSD.ORG Wed Feb 1 13:51:02 2012 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id CE9C2106564A for ; Wed, 1 Feb 2012 13:51:02 +0000 (UTC) (envelope-from peter.maloney@brockmann-consult.de) Received: from moutng.kundenserver.de (moutng.kundenserver.de [212.227.126.171]) by mx1.freebsd.org (Postfix) with ESMTP id 76D578FC14 for ; Wed, 1 Feb 2012 13:51:02 +0000 (UTC) Received: from [10.3.0.26] ([141.4.215.32]) by mrelayeu.kundenserver.de (node=mrbap4) with ESMTP (Nemesis) id 0MWSYU-1S47yR1rqM-00XczP; Wed, 01 Feb 2012 14:51:01 +0100 Message-ID: <4F294344.2030006@brockmann-consult.de> Date: Wed, 01 Feb 2012 14:51:00 +0100 From: Peter Maloney User-Agent: Mozilla/5.0 (X11; U; Linux x86_64; en-US; rv:1.9.2.23) Gecko/20110922 Thunderbird/3.1.15 MIME-Version: 1.0 To: freebsd-stable@freebsd.org References: <4F2940C1.10901@digiware.nl> In-Reply-To: <4F2940C1.10901@digiware.nl> X-Enigmail-Version: 1.1.2 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit X-Provags-ID: V02:K0:tqywJGU9z7hZiYWkSML1KqKAWFUgmy8nmG2m0PIa9no IXectIcAn6YFkMkQvuZx29h82Us1fYEWrWP8/AsDqfW0b+4HYP EDb0RNgPW1RZbWhAufrwRd6jS9ZTi8vL38AOmk21orESHlwZF4 8XyFkk4VLlCwccfBTYZebun7qywQLzm29Y57O80ByyCsG/xF6d k1ZjRiGkgZat/QtKlfioJ8t9L/v2TkpBJQiTb9cnF6EMkFaKxo kDS6bW/yBO9HUrX5gImWJmzZZLgstUam3Tuujt4PLIDsQzyH64 eGECt2VWuYOv570AiwGdSGDOIxAUN2ZCHKOuz2hbjoxFSAukMK VrNphGvk7VrOrAK9CE6uUSWQQNGSs1cTUkk6kr47s Subject: Re: Troube with SSD X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 01 Feb 2012 13:51:02 -0000 I had the (probably) same problem with a Crucial SSD with old firmware. With my problem, the mps driver or mpslsi driver logs the timeouts rather than AHCI. With new disk firmware, it works fine so far (about 2-3 weeks). Here is my forum thread http://forums.freebsd.org/showthread.php?t=28252 I can cause a very similar error by hot pulling the disk. After putting the disk back in, I can't use the disk until rebooting. (And I reproduced the same problem with a Seagate Green spinning disk). After the firmware upgrade, this test passes, so I put it back to work in the machine. Here is a thread with a similar problem with an OCZ Vertex 3 http://forums.freebsd.org/showthread.php?t=27128 I didn't try "camcontrol rescan". But I tried "camcontrol reset ...." which caused a kernel panic (meaning FreeBSD is most likely at least partially to blame). ;) Peter On 02/01/2012 02:40 PM, Willem Jan Withagen wrote: > Hi, > > I have this ZFS server up for about 27 days, and about 3 weeks ago (was > not really paying attention) it turns out it lost its SSD that I'm using > for log and cache. There is also a poor and lonely memory stick for log. > So the box did not really suffer file loss. > > system is running: > FreeBSD zfs.digiware.nl 8.2-STABLE FreeBSD 8.2-STABLE #58: Thu Nov 17 > 09:43:46 CET 2011 > root@zfs.digiware.nl:/home/obj/usr/src/src8/src/sys/ZFS amd64 > > more info like dmesg, pciconf, kernconf, zpool iostat at: > http://www.tegenbosch28.nl/FreeBSD/systems/ZFS/ > > But it is weird to just lose a SSD from the bus. And it has happened > before. And you can see that AHCI really banged on the frontdoor... > > The device is a Corsair 60Gb Force GT. And thusfar I have not found any > suggestions that that serie of devices is prone to doing this. > > It was a real dead device, the only way to get it back: > powercycle the device by pulling it, and stick it back > then camcontrol rescan > > I've now upgrade it to a 120Gb Corsair, to see if that has the same problem. > > Other FreeBSD-ers have like problems? > > Regards, > --WjW > > > Jan 7 10:04:24 zfs kernel: ahcich3: Timeout on slot 27 port 0 > Jan 7 10:04:24 zfs kernel: ahcich3: is 00000000 cs 20000000 ss 38000000 > rs 38000000 tfd c0 serr 00000000 cmd 0004dd17 > Jan 7 10:04:56 zfs kernel: ahcich3: AHCI reset: device not ready after > 31000ms (tfd = 00000080) > Jan 7 10:05:26 zfs kernel: ahcich3: Timeout on slot 29 port 0 > Jan 7 10:05:26 zfs kernel: ahcich3: is 00000000 cs 20000000 ss 00000000 > rs 20000000 tfd 80 serr 00000000 cmd 0004dd17 > Jan 7 10:05:57 zfs kernel: ahcich3: AHCI reset: device not ready after > 31000ms (tfd = 00000080) > Jan 7 10:06:27 zfs kernel: ahcich3: Timeout on slot 29 port 0 > Jan 7 10:06:27 zfs kernel: ahcich3: is 00000000 cs 20000000 ss 00000000 > rs 20000000 tfd 80 serr 00000000 cmd 0004dd17 > Jan 7 10:06:27 zfs kernel: (ada2:ahcich3:0:0:0): lost device > Jan 7 10:06:58 zfs kernel: ahcich3: AHCI reset: device not ready after > 31000ms (tfd = 00000080) > Jan 7 10:07:28 zfs kernel: ahcich3: Timeout on slot 29 port 0 > Jan 7 10:07:28 zfs kernel: ahcich3: is 00000000 cs e0000000 ss e0000000 > rs e0000000 tfd 80 serr 00000000 cmd 0004dd17 > Jan 7 10:08:16 zfs kernel: ahcich3: AHCI reset: device not ready after > 31000ms (tfd = 00000080) > Jan 7 10:08:16 zfs kernel: ahcich3: Poll timeout on slot 31 port 0 > Jan 7 10:08:16 zfs kernel: ahcich3: is 00000000 cs 80000000 ss 00000000 > rs 80000000 tfd 80 serr 00000000 cmd 0004df17 > Jan 7 10:08:46 zfs kernel: ahcich3: Timeout on slot 31 port 0 > Jan 7 10:08:46 zfs kernel: ahcich3: is 00000000 cs 80000000 ss 00000000 > rs 80000000 tfd 80 serr 00000000 cmd 0004df17 > Jan 7 10:08:48 zfs kernel: (ada2:ahcich3:0:0:0): removing device entry > Jan 7 10:09:33 zfs kernel: ahcich3: AHCI reset: device not ready after > 31000ms (tfd = 00000080) > Jan 7 10:09:33 zfs kernel: ahcich3: Poll timeout on slot 31 port 0 > Jan 7 10:09:33 zfs kernel: ahcich3: is 00000000 cs 80000000 ss 00000000 > rs 80000000 tfd 80 serr 00000000 cmd 0004df17 > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to "freebsd-stable-unsubscribe@freebsd.org" -- -------------------------------------------- Peter Maloney Brockmann Consult Max-Planck-Str. 2 21502 Geesthacht Germany Tel: +49 4152 889 300 Fax: +49 4152 889 333 E-mail: peter.maloney@brockmann-consult.de Internet: http://www.brockmann-consult.de --------------------------------------------