From owner-freebsd-hackers@FreeBSD.ORG Thu Apr 10 09:56:17 2014 Return-Path: Delivered-To: freebsd-hackers@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 1E9D1638 for ; Thu, 10 Apr 2014 09:56:17 +0000 (UTC) Received: from ibiza.webweaving.org (ibiza.webweaving.org [204.109.56.32]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id D1E681BAA for ; Thu, 10 Apr 2014 09:56:16 +0000 (UTC) Received: from pikmeer.webweaving.org (pikmeer.webweaving.org [178.18.23.51]) by ibiza.webweaving.org (8.14.7/8.14.7) with ESMTP id s3A9RO2N087967 for ; Thu, 10 Apr 2014 09:27:24 GMT (envelope-from dirkx@webweaving.org) Received: from [10.11.0.104] (a83-163-239-115.adsl.xs4all.nl [83.163.239.115]) (authenticated bits=0) by pikmeer.webweaving.org (8.14.7/8.14.7) with ESMTP id s3A9Qqbm017471 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=NO) for ; Thu, 10 Apr 2014 09:27:23 GMT (envelope-from dirkx@webweaving.org) X-Authentication-Warning: pikmeer.webweaving.org: Host a83-163-239-115.adsl.xs4all.nl [83.163.239.115] claimed to be [10.11.0.104] From: Dirk-Willem van Gulik Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Subject: Hardware raid v.s. 'soft errors' Message-Id: Date: Thu, 10 Apr 2014 11:27:12 +0200 To: freebsd-hackers@freebsd.org Mime-Version: 1.0 (Mac OS X Mail 7.2 \(1874\)) X-Mailer: Apple Mail (2.1874) X-Greylist: Sender IP whitelisted, not delayed by milter-greylist-4.2.7 (ibiza.webweaving.org [204.109.56.32]); Thu, 10 Apr 2014 09:27:25 +0000 (UTC) X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.4.3 (pikmeer.webweaving.org [178.18.23.51]); Thu, 10 Apr 2014 09:27:24 +0000 (UTC) X-BeenThere: freebsd-hackers@freebsd.org X-Mailman-Version: 2.1.17 Precedence: list List-Id: Technical Discussions relating to FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 10 Apr 2014 09:56:17 -0000 Got two AAC hardware raid machines - slightly differently configured. = Both are (about once every other year) giving an error like g_vfs_done():aacd1p8[READ(offset=3D1757508976640, = length=3D65536)]error =3D 5 while the hardware raid is healthy - and passes all its validations. As = do the disks (we=92ve replaced disks on those machines some 4 or 5 times - with no real impact/change - still above issue every 18 months = or so) As far as I can trace this through the kernel - these errors *really* come from the ATA in the AAC - correct ? The odd thing is that it happens on two machines - with slightly = different AAC cards; and with a different upgrade history. Both are now 9.2-RELEASE-p3 - but the issue has propagated from 7.2 upward. Any suggestions as to where to look ? And specifically - is there a way = in the AAC to intercept events/errors at an even lower level ? Or should I be assuming this to be more in raid card firmware territory ? Dw=