From owner-freebsd-bugs@FreeBSD.ORG Wed Dec 3 10:41:05 2014 Return-Path: Delivered-To: freebsd-bugs@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id B38C618B for ; Wed, 3 Dec 2014 10:41:05 +0000 (UTC) Received: from kenobi.freebsd.org (kenobi.freebsd.org [IPv6:2001:1900:2254:206a::16:76]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 816C572 for ; Wed, 3 Dec 2014 10:41:05 +0000 (UTC) Received: from bugs.freebsd.org ([127.0.1.118]) by kenobi.freebsd.org (8.14.9/8.14.9) with ESMTP id sB3Af54q043568 for ; Wed, 3 Dec 2014 10:41:05 GMT (envelope-from bugzilla-noreply@freebsd.org) From: bugzilla-noreply@freebsd.org To: freebsd-bugs@FreeBSD.org Subject: [Bug 195349] CAM status: Command timeout since upgrade to 10.1 Date: Wed, 03 Dec 2014 10:41:05 +0000 X-Bugzilla-Reason: AssignedTo X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: Base System X-Bugzilla-Component: kern X-Bugzilla-Version: 10.1-RELEASE X-Bugzilla-Keywords: X-Bugzilla-Severity: Affects Some People X-Bugzilla-Who: pascal.guitierrez@gmail.com X-Bugzilla-Status: New X-Bugzilla-Priority: --- X-Bugzilla-Assigned-To: freebsd-bugs@FreeBSD.org X-Bugzilla-Target-Milestone: --- X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: 7bit X-Bugzilla-URL: https://bugs.freebsd.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: freebsd-bugs@freebsd.org X-Mailman-Version: 2.1.18-1 Precedence: list List-Id: Bug reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 03 Dec 2014 10:41:05 -0000 https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=195349 --- Comment #9 from pascal.guitierrez@gmail.com --- Dec 1 17:44:06 nas01 kernel: ahcich3: Timeout on slot 6 port 0 Dec 1 17:44:06 nas01 kernel: ahcich3: is 00000008 cs 00000000 ss 00000000 rs 00000040 tfd 50 serr 00000000 cmd 00006617 Dec 1 17:44:06 nas01 kernel: (ada3:ahcich3:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 00 80 9e 2f 40 3e 00 00 01 00 00 Dec 1 17:44:06 nas01 kernel: (ada3:ahcich3:0:0:0): CAM status: Command timeout Dec 1 17:44:06 nas01 kernel: ahcich2: (ada3:Timeout on slot 15 port 0 Dec 1 17:44:06 nas01 kernel: ahcich3:0:ahcich2: 0:is 00000008 cs 00000000 ss 00000000 rs 00008000 tfd 40 serr 00000000 cmd 00006f17 Dec 1 17:44:06 nas01 kernel: 0): Retrying command Dec 1 17:44:06 nas01 kernel: (ada2:ahcich2:0:0:0): READ_FPDMA_QUEUED. ACB: 60 80 e8 a9 2f 40 3e 00 00 00 00 00 Dec 1 17:44:06 nas01 kernel: ahcich1: (ada2:ahcich2:0:0:0): CAM status: Command timeout pciconf -vl ahci0@pci0:0:17:0: class=0x010601 card=0x1609103c chip=0x43911002 rev=0x40 hdr=0x00 vendor = 'Advanced Micro Devices [AMD] nee ATI' device = 'SB7x0/SB8x0/SB9x0 SATA Controller [AHCI mode]' class = mass storage subclass = SATA dmesg: ahci0: port 0xd000-0xd007,0xc000-0xc003,0xb000-0xb007,0xa000-0xa003,0x9000-0x900f mem 0xfe6ffc00-0xfe6fffff irq 19 at device 17.0 on pci0 ahci0: AHCI v1.20 with 4 3Gbps ports, Port Multiplier supported This issue is reproduceable by causing load (such as zpool scrub). Seems to hit the issue much more frequently with > 2 disks in the pool - we see it on our HP microservers with 4 and 5 disks. Downgrading SATA to v1 (via loader hints) and disabling NCQ (via camcontrol tags -N 1) seems to delay, not fix the inevitable timeouts. -- You are receiving this mail because: You are the assignee for the bug.