From owner-freebsd-stable@FreeBSD.ORG Fri Feb 20 10:42:07 2015 Return-Path: Delivered-To: stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 9A031F86 for ; Fri, 20 Feb 2015 10:42:07 +0000 (UTC) Received: from babel.karthauser.co.uk (212-13-197-151.karthauser.co.uk [212.13.197.151]) by mx1.freebsd.org (Postfix) with ESMTP id 61C5174F for ; Fri, 20 Feb 2015 10:42:06 +0000 (UTC) Received: from unnamed-72.karthauser.co.uk (unnamed-72.karthauser.co.uk [90.155.77.72]) (Authenticated sender: joemail@tao.org.uk) by babel.karthauser.co.uk (Postfix) with ESMTPSA id D14AFBBF; Fri, 20 Feb 2015 10:34:45 +0000 (UTC) From: Dr Josef Karthauser Date: Fri, 20 Feb 2015 10:34:44 +0000 Subject: ada drives keep timing out! To: stable@freebsd.org Message-Id: <064CF905-DF19-40A7-8CE8-D9FFE17913EE@tao.org.uk> Mime-Version: 1.0 (Mac OS X Mail 8.2 \(2070.6\)) X-Mailer: Apple Mail (2.2070.6) Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Content-Filtered-By: Mailman/MimeDel 2.1.18-1 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.18-1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 20 Feb 2015 10:42:07 -0000 Hi there, I reported this last year, but I=E2=80=99d like to revisit it as it must = have a software remedy. I know that I=E2=80=99m not the only one to have = reported the problem. I have a ZFS pool with a number of western digital drives in it (WDC = WD1000FYPS-01ZKB0 02.01B01). Periodically a drive times out with this error: (ada2:ahcich2:0:0:0): Periph destroyed (aprobe0:ahcich2:0:0:0): NOP. ACB: 00 00 00 00 00 00 00 00 00 00 00 00 (aprobe0:ahcich2:0:0:0): CAM status: ATA Status Error (aprobe0:ahcich2:0:0:0): ATA status: d1 (BSY DRDY SERV ERR), error: 04 = (ABRT ) (aprobe0:ahcich2:0:0:0): RES: d1 04 ff ff ff ff ff ff ff ff ff (aprobe0:ahcich2:0:0:0): Error 5, Retries exhausted and drops out of the pool. I have to reset the bus to get it to reattach: camcontrol scan 2 camcontrol rescan 2 I have four drives and recently they detached with this frequency: Jan 12 13:25:23 server kernel: (ada3:ahcich3:0:0:0): Periph = destroyed Jan 22 22:07:57 server kernel: (ada0:ahcich0:0:0:0): Periph = destroyed Jan 29 08:12:28 server kernel: (ada1:ahcich1:0:0:0): Periph = destroyed Jan 30 02:16:45 server kernel: (ada3:ahcich3:0:0:0): Periph = destroyed Feb 8 20:07:39 server kernel: (ada1:ahcich1:0:0:0): Periph = destroyed Feb 19 02:27:18 server kernel: (ada0:ahcich0:0:0:0): Periph = destroyed Feb 20 08:24:40 server kernel: (ada2:ahcich2:0:0:0): Periph = destroyed The box is a: FreeBSD server 9.2-STABLE FreeBSD 9.2-STABLE #1 r253253M: Mon Mar = 10 22:53:08 GMT 2014 Is this likely to have been address in a more recent FreeBSD or is it = still an issue today and can I work with someone to find a remedy? Many thanks Joe=