From owner-freebsd-bugs@freebsd.org Tue Sep 22 06:39:22 2020 Return-Path: Delivered-To: freebsd-bugs@mailman.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.nyi.freebsd.org (Postfix) with ESMTP id 478443DF4FF for ; Tue, 22 Sep 2020 06:39:22 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from mailman.nyi.freebsd.org (mailman.nyi.freebsd.org [IPv6:2610:1c1:1:606c::50:13]) by mx1.freebsd.org (Postfix) with ESMTP id 4BwWqt1FRyz4b1n for ; Tue, 22 Sep 2020 06:39:22 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: by mailman.nyi.freebsd.org (Postfix) id 290DD3DF816; Tue, 22 Sep 2020 06:39:22 +0000 (UTC) Delivered-To: bugs@mailman.nyi.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.nyi.freebsd.org (Postfix) with ESMTP id 28D393DF815 for ; Tue, 22 Sep 2020 06:39:22 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from mxrelay.nyi.freebsd.org (mxrelay.nyi.freebsd.org [IPv6:2610:1c1:1:606c::19:3]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256 client-signature RSA-PSS (4096 bits) client-digest SHA256) (Client CN "mxrelay.nyi.freebsd.org", Issuer "Let's Encrypt Authority X3" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 4BwWqt0MKFz4b6D for ; Tue, 22 Sep 2020 06:39:22 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from kenobi.freebsd.org (kenobi.freebsd.org [IPv6:2610:1c1:1:606c::50:1d]) (using TLSv1.3 with cipher TLS_AES_256_GCM_SHA384 (256/256 bits) key-exchange X25519 server-signature RSA-PSS (4096 bits) server-digest SHA256) (Client did not present a certificate) by mxrelay.nyi.freebsd.org (Postfix) with ESMTPS id E53871B57A for ; Tue, 22 Sep 2020 06:39:21 +0000 (UTC) (envelope-from bugzilla-noreply@freebsd.org) Received: from kenobi.freebsd.org ([127.0.1.5]) by kenobi.freebsd.org (8.15.2/8.15.2) with ESMTP id 08M6dL4o076581 for ; Tue, 22 Sep 2020 06:39:21 GMT (envelope-from bugzilla-noreply@freebsd.org) Received: (from www@localhost) by kenobi.freebsd.org (8.15.2/8.15.2/Submit) id 08M6dLJI076580 for bugs@FreeBSD.org; Tue, 22 Sep 2020 06:39:21 GMT (envelope-from bugzilla-noreply@freebsd.org) X-Authentication-Warning: kenobi.freebsd.org: www set sender to bugzilla-noreply@freebsd.org using -f From: bugzilla-noreply@freebsd.org To: bugs@FreeBSD.org Subject: [Bug 229745] ahcich: CAM status: Command timeout Date: Tue, 22 Sep 2020 06:39:18 +0000 X-Bugzilla-Reason: AssignedTo X-Bugzilla-Type: changed X-Bugzilla-Watch-Reason: None X-Bugzilla-Product: Base System X-Bugzilla-Component: kern X-Bugzilla-Version: 11.2-STABLE X-Bugzilla-Keywords: X-Bugzilla-Severity: Affects Some People X-Bugzilla-Who: nicolas.richeton@gmail.com X-Bugzilla-Status: New X-Bugzilla-Resolution: X-Bugzilla-Priority: --- X-Bugzilla-Assigned-To: bugs@FreeBSD.org X-Bugzilla-Flags: X-Bugzilla-Changed-Fields: cc Message-ID: In-Reply-To: References: Content-Type: text/plain; charset="UTF-8" Content-Transfer-Encoding: quoted-printable X-Bugzilla-URL: https://bugs.freebsd.org/bugzilla/ Auto-Submitted: auto-generated MIME-Version: 1.0 X-BeenThere: freebsd-bugs@freebsd.org X-Mailman-Version: 2.1.33 Precedence: list List-Id: Bug reports List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 22 Sep 2020 06:39:22 -0000 https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D229745 Nicolas Richeton changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |nicolas.richeton@gmail.com --- Comment #55 from Nicolas Richeton --- Hello,=20 On FreeBSD 11.3-RELEASE-p11 (FreeNAS) : I got the same issue with a ASMedia ASM1062 AHCI SATA controller. (2 SATA ports; PCIe x1 card on PCIe2.0 connector)=20 2 drives connected, configured as a ZFS mirror pool. I faced this issue when switching from 2 HDD to 2 SSD. It seems related to the volume of data flowing through the controller:=20 - When 2 HDD (SATA2) are used : everything is fine. - When 2 SSD (SATA3) are used : they are detected correctly, but when I sta= rt a copy/zfs scrub, I get a lot of :=20 Sep 18 16:54:21 nas (ada1:ahcich1:0:0:0): READ_FPDMA_QUEUED. ACB: 60 b0 70 = f1 cc 40 02 00 00 00 00 00 Sep 18 16:54:21 nas (ada1:ahcich1:0:0:0): CAM status: Uncorrectable parity/= CRC error Sep 18 16:54:21 nas (ada1:ahcich1:0:0:0): Retrying command Sep 18 16:54:51 nas ahcich1: Timeout on slot 5 port 0 Sep 18 16:54:51 nas ahcich1: is 00000000 cs 00000000 ss 00000020 rs 00000020 tfd 40 serr 00000000 cmd 0004c517 Sep 18 16:54:51 nas (ada1:ahcich1:0:0:0): WRITE_FPDMA_QUEUED. ACB: 61 30 d0= 6d e0 40 4f 00 00 00 00 00 Sep 18 16:54:51 nas (ada1:ahcich1:0:0:0): CAM status: Command timeout Sep 18 16:54:51 nas (ada1:ahcich1:0:0:0): Retrying command Sep 18 16:55:21 nas ahcich1: Timeout on slot 14 port 0 And sometimes :=20 Sep 13 14:10:06 nas (aprobe0:ahcich1:0:0:0): ATA_IDENTIFY. ACB: ec 00 00 00= 00 40 00 00 00 00 00 00 Sep 13 14:10:06 nas (aprobe0:ahcich1:0:0:0): CAM status: Command timeout Sep 13 14:10:06 nas (aprobe0:ahcich1:0:0:0): Retrying command In extreme cases, I loose the drive, the pool goes degraded, and I have to reboot to bring the disk back.=20 There are more messages if the io speed is high : copy through network gives some messages, scrub gives a lot of messages and progress pauses (and can result of the lost of one disk). It can also affect the other drive (ada0), but more errors on the second one (ada1) I changed the SSD =3D> same issue with SAMSUNG 860 EVO and Crucial MX500. I changed the cables =3D> same issue=20 BIOS is up to date (HP micro server Gen8) - When I plug one of the SSD on another SATA2-only port on the motherboard = (so 1 SSD SATA2 and 1 SSD SATA3 on ASMedia controller) =3D> everything is fine = when I do a scrub, probably because ZFS is waiting for the slower drive =3D> data = flow is smaller - When I do a ZFS replace : HDD + HDD->SSD (3 drives connected during the replace - 2 on ASMedia controller and 1 on motherboard) : there are no erro= rs (HDD are limiting the speed). Issues start when the pool is SSD-only, on SA= TA3.=20 Config : Sep 13 15:11:29 nas ahci0: port 0x5000-0x5007,0x5008-0x500b,0x5010-0x5017,0x5018-0x501b,0x5020-0x503f mem 0xfbff0000-0xfbff01ff irq 16 at device 0.0 on pci1 Sep 13 15:11:29 nas ahci0: AHCI v1.20 with 2 6Gbps ports, Port Multiplier supported Sep 13 15:11:29 nas ahci0: quirks=3D0xc00000 Sep 13 15:11:29 nas ahcich0: at channel 0 on ahci0 Sep 13 15:11:29 nas ahcich1: at channel 1 on ahci0 Sep 13 15:11:29 nas ahci1: port 0x1000-0x1007,0x1008-0x100b,0x1010-0x1017,0x1018-0x101b,0x1020-0x103f = mem 0xfacd0000-0xfacd07ff irq 17 at device 31.2 on pci0 Sep 13 15:11:29 nas ahci1: AHCI v1.30 with 6 6Gbps ports, Port Multiplier supported Sep 13 15:11:29 nas ahcich2: at channel 0 on ahci1 Sep 13 15:11:29 nas ahcich3: at channel 1 on ahci1 Sep 13 15:11:29 nas ahcich4: at channel 2 on ahci1 Sep 13 15:11:29 nas ahcich5: at channel 3 on ahci1 Sep 13 15:11:29 nas ahcich6: at channel 4 on ahci1 Sep 13 15:11:29 nas ahcich7: at channel 5 on ahci1 Sep 13 15:11:29 nas ahciem0: on ahci1 --=20 You are receiving this mail because: You are the assignee for the bug.=