From owner-freebsd-stable@FreeBSD.ORG Tue Feb 14 22:34:19 2012 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 98527106566B for ; Tue, 14 Feb 2012 22:34:19 +0000 (UTC) (envelope-from victor@bsdes.net) Received: from equilibrium.bsdes.net (244.Red-217-126-240.staticIP.rima-tde.net [217.126.240.244]) by mx1.freebsd.org (Postfix) with ESMTP id 37B6C8FC17 for ; Tue, 14 Feb 2012 22:34:19 +0000 (UTC) Received: by equilibrium.bsdes.net (Postfix, from userid 1001) id 9FF2339844; Tue, 14 Feb 2012 23:15:27 +0100 (CET) Date: Tue, 14 Feb 2012 23:15:27 +0100 From: Victor Balada Diaz To: Harald Schmalzbauer Message-ID: <20120214221527.GT2010@equilibrium.bsdes.net> References: <20120214091909.GP2010@equilibrium.bsdes.net> <20120214100513.GA94501@icarus.home.lan> <20120214135435.GQ2010@equilibrium.bsdes.net> <20120214141601.GA98986@icarus.home.lan> <4F3A83DE.3000200@ambtec.de> <20120214165029.GA1852@icarus.home.lan> <4F3A971F.9040407@omnilan.de> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: <4F3A971F.9040407@omnilan.de> User-Agent: Mutt/1.5.21 (2010-09-15) Cc: freebsd-stable@freebsd.org, Jeremy Chadwick , Claudius Herder Subject: Re: problems with AHCI on FreeBSD 8.2 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 14 Feb 2012 22:34:19 -0000 On Tue, Feb 14, 2012 at 06:17:19PM +0100, Harald Schmalzbauer wrote: > schrieb Jeremy Chadwick am 14.02.2012 17:50 (localtime): > > On Tue, Feb 14, 2012 at 04:55:10PM +0100, Claudius Herder wrote: > >> Hello, > >> > >> I have got a quite similar problem with AHCI on FreeBSD 8.2 and it still > >> persists on FreeBSD 9.0 release. > >> > >> Switching from ahci to ataahci resolved the problem for me too. > >> > >> I'm using gmirror for swap, system is on a zpool and the problem first > >> occurred during a zpool scrub, but it is easily reproducible with dd. > >> > >> The timeouts only occur when writing to disks, dd if=/dev/ada{0|1} > >> of=/dev/null is not an issue. > >> Sometimes I need to power off the server because after a reboot one disk > >> is still missing. > >> > >> I really would like to help in this issue, so let me know if you need > >> any more information. > > I find it interesting that, at least so far, the only people reporting > > problems of this type with the ahci.ko driver are people using Samsung > > disks. The only difference is that your models are F1s while the OPs > > are F2s. > > I saw such timeouts long ago and mav@ had a look at my postings and he > mentioned it could be a NCQ problem. > I suspected the disks firmware. > I never tracked it down further, because after replacing the Samsung (F3 > in that case) disks with hitachi ones solved all my problems and gave a > big performance kick as well (with zfs). > You can find the discussion here: > http://lists.freebsd.org/pipermail/freebsd-stable/2010-February/055374.html > You gave me a good idea: try to disable NCQ and see if that's the fault. So i went and applied the attached patch. After it, i can no longer reproduce the issue with ahci driver. I know this is not a solution because it disables NCQ at controller level instead of disk level, but at least we know for sure where the problem is. I think the solution would be to add a new quirk ADA_Q_NONCQ in sys/cam/ata/ata_da.c. Quirks infraestructure is already built, so adding a new quirk for this seems easy. Is someone interested? Do you think there is a better solution? If someone is interested i can build a patch to add ADA_Q_NONCQ quirk and add my drives to it. Regards. -- La prueba más fehaciente de que existe vida inteligente en otros planetas, es que no han intentado contactar con nosotros.