From owner-freebsd-questions@FreeBSD.ORG Thu Dec 8 11:23:39 2011 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id D730C106564A for ; Thu, 8 Dec 2011 11:23:39 +0000 (UTC) (envelope-from jcigar@ulb.ac.be) Received: from mxin.ulb.ac.be (mxin.ulb.ac.be [164.15.128.112]) by mx1.freebsd.org (Postfix) with ESMTP id 629288FC13 for ; Thu, 8 Dec 2011 11:23:38 +0000 (UTC) X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: Ai8QAIeR4E6kD30E/2dsb2JhbAAMN4UGpmgBAQEEMAEBgTUBAQEBAyNVARALGAkWCwICCQMCAQIBRQYNAQUCAQGtWZE0iiWBFgSOEoEWinqMcA Received: from bebif01.ulb.ac.be (HELO [10.0.0.194]) ([164.15.125.4]) by smtp.ulb.ac.be with ESMTP; 08 Dec 2011 12:23:38 +0100 Message-ID: <4EE09E39.1060309@ulb.ac.be> Date: Thu, 08 Dec 2011 12:23:37 +0100 From: Julien Cigar Organization: Belgian Biodiversity Platform User-Agent: Mozilla/5.0 (X11; FreeBSD i386; rv:8.0) Gecko/20111110 Thunderbird/8.0 MIME-Version: 1.0 To: "C. P. Ghost" References: <4EDE37A1.5030306@ulb.ac.be> In-Reply-To: Content-Type: multipart/mixed; boundary="------------090104020503000701010001" X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: FreeBSD Mailing List Subject: Re: AHCI timeout X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 08 Dec 2011 11:23:40 -0000 This is a multi-part message in MIME format. --------------090104020503000701010001 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit On 12/06/2011 19:25, C. P. Ghost wrote: > On Tue, Dec 6, 2011 at 4:41 PM, Julien Cigar wrote: >> Hello, >> >> I'm running 9.0-RC3 on a HP Proliant Microserver (N40L). A disk died in my >> graid3 array and I replaced it with a new one, and now have tons of: >> >> ahcich3: Timeout on slot 5 port 0 >> ahcich3: is 00000000 cs 00000000 ss 00003f60 rs 00003f60 tfd 40 serr >> 00000000 cmd 0000ed17 > > Check the connectors, both on disk and on the controller. They're > usually the culprit. Sometimes it is also a firmware problem, but > I'll try to replace the cables first. I tried with two different connectors but the problem persists. However, I noticed that the problem only appear at high I/O rates (during a graid3 resync for example): the machine runs Bacula and the backup job completed successfully this night, but it was a remote machine so the I/O writes didn't go above 2 MB/s ... Do you think the problem could be the firmware of the disk? > >> (...) >> >> Those are Seagate disks: >> >> jcigar@backup conf % sudo camcontrol devlist >> at scbus0 target 0 lun 0 (pass0,ada0) >> at scbus1 target 0 lun 0 (pass1,ada1) >> at scbus2 target 0 lun 0 (pass2,ada2) >> at scbus3 target 0 lun 0 (pass3,ada3) >> >> The controller is: >> >> ahci0@pci0:0:17:0: class=0x010601 card=0x1609103c chip=0x43911002 >> rev=0x40 hdr=0x00 >> vendor = 'ATI Technologies Inc' >> device = 'SB7x0/SB8x0/SB9x0 SATA Controller [AHCI mode]' >> class = mass storage >> subclass = SATA >> >> jcigar@backup conf % vmstat -i >> interrupt total rate >> irq17: ehci0 ehci1+ 2 0 >> irq18: ohci0 ohci1+ 30 0 >> irq256: bge0 31354 4 >> irq257: ahci0 19012658 2477 >> irq258: hpet0:t0 4926229 641 >> irq259: hpet0:t1 4635261 603 >> Total 28605534 3727 >> >> >> Any idea what could be the cause of this ... ? >> >> >> Thanks, >> Julien > > -cpghost. > -- No trees were killed in the creation of this message. However, many electrons were terribly inconvenienced. --------------090104020503000701010001--