From owner-freebsd-scsi@freebsd.org Mon Apr 25 12:17:46 2016 Return-Path: Delivered-To: freebsd-scsi@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 5023AB1B370 for ; Mon, 25 Apr 2016 12:17:46 +0000 (UTC) (envelope-from dan@langille.org) Received: from clavin1.langille.org (clavin.langille.org [162.208.116.86]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "clavin.langille.org", Issuer "StartCom Class 2 Primary Intermediate Server CA" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 2F4FB1F2E for ; Mon, 25 Apr 2016 12:17:45 +0000 (UTC) (envelope-from dan@langille.org) Received: from (clavin1.int.langille.org (clavin1.int.unixathome.org [10.4.7.7]) (Authenticated sender: hidden) with ESMTPSA id 43D373D2 for ; Mon, 25 Apr 2016 12:17:30 +0000 (UTC) From: Dan Langille Message-Id: <5EEF0794-B06E-4A72-89DA-7DCD94AE1FC6@langille.org> Mime-Version: 1.0 (Mac OS X Mail 9.3 \(3124\)) Subject: Re: terminated ioc 804b scsi 0 state c xfer 0 Date: Mon, 25 Apr 2016 08:17:30 -0400 References: <2E8752E5-76AF-4042-86D9-8C6733658A80@langille.org> To: freebsd-scsi@freebsd.org In-Reply-To: <2E8752E5-76AF-4042-86D9-8C6733658A80@langille.org> X-Mailer: Apple Mail (2.3124) Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable X-Content-Filtered-By: Mailman/MimeDel 2.1.21 X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 25 Apr 2016 12:17:46 -0000 >=20 > On Apr 24, 2016, at 9:35 AM, Dan Langille wrote: >=20 > More of the pasted output is also at = https://gist.github.com/dlangille/1fa3135334089c6603e2ec5da946d9ae = and = added smartctl output. >=20 > I have a FreeBSD 10.2-RELEASE-p14 box in which there is an LSI SAS2008 = card. It's running a zfs root system. >=20 > This morning the system was unresponsive via ssh. Attempts to log in = at the console did not yield a password prompt. >=20 > A power cycle brought the system online. Inspecting = /var/log/messages, I found about 63,000 entries similar to those which = appear below. >=20 > zpool status of all are OK. A scrub is in progress for one pool (since = before this issue arose). da7 is in that pool. >=20 >=20 > Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 = 8d 90 c6 18 00 00 10 00 length 8192 SMID 774 terminated ioc 804b scsi 0 = state c xfer 0 > Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 = 8b d9 97 70 00 00 20 00 length 16384 SMID 614 terminated ioc 804b scsi 0 = state c xfer 0 > Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 = 8b d9 97 50 00 00 20 00 length 16384 SMID 792 terminated ioc 804b scsi 0 = state c xfer 0 > Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 = 8b d9 97 08 00 00 20 00 length 16384 SMID 974 terminated ioc 804b scsi 0 = state c xfer 0 > Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 = 8b 6f ef 50 00 00 08 00 length 4096 SMID 674 terminated ioc 804b scsi 0 = state c xfer 0 > Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): WRITE(10). CDB: 2a 00 = 8b 0f a2 48 00 00 18 00 length 12288 SMID 177 terminated ioc 804b scsi 0 = state c xfer 12288 > Apr 24 11:25:55 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 = ab 8f a1 38 00 00 08 00 length 4096 SMID 908 terminated ioc 804b scsi 0 = state c xfer 0 > Apr 24 11:25:56 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 = 8b d9 97 70 00 00 20 00 length 16384 SMID 376 terminated ioc 804b scsi 0 = state c xfer 0 > Apr 24 11:25:56 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 = 8b d9 97 50 00 00 20 00 length 16384 SMID 172 terminated ioc 804b scsi 0 = state c xfer 0 >=20 > Is this a cabling issue? The drive is a SATA device (smartctl output = in the URL above). Anyone familiar with these errors? This morning: 13410079654596185797 REMOVED 0 0 0 was /dev/da7p3 At least I know i'm looking for Serial Number: 13Q8PNBYS =46rom the logs: Apr 25 05:34:50 knew kernel: da7 at mps1 bus 0 scbus1 target 17 lun 0 Apr 25 05:34:50 knew kernel: da7: s/n = 13Q8PNBYS detached Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 d8 = 33 53 e0 00 00 08 00 length 4096 SMID 88 terminated ioc 804b scsi 0 = state c xfer 0 Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 d8 = 33 26 f8 00 00 20 00 length 16384 SMID 204 terminated ioc 804b scsi 0 = state c xfer(da7:mps1:0:17:0): READ(10). CDB: 28 00 d8 33 53 e0 00 00 08 = 00=20 Apr 25 05:34:51 knew kernel: 0 Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): CAM status: = Unconditionally Re-queue Request Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 d8 = 33 26 d8 00 00 20 00 length 16384 SMID 260 terminated ioc 804b scsi 0 = state c xfer(da7: 0 Apr 25 05:34:51 knew kernel: mps1:0: (da7:mps1:0:17:0): READ(10). = CDB: 28 00 e6 6c 42 40 00 00 10 00 length 8192 SMID 484 terminated ioc = 804b scsi 0 state c xfer 17:0 Apr 25 05:34:51 knew kernel: 0): (da7:mps1:0:17:0): WRITE(10). = CDB: 2a 00 e4 d8 2a 90 00 00 90 00 length 73728 SMID 548 terminated ioc = 804b scsi 0 state c xfeError 5, Periph was invalidated Apr 25 05:34:51 knew kernel: r 0 Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 d8 = 33 26 f8 00 00 20 00=20 Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 4d = ac ed b8 00 00 08 00 length 4096 SMID 435 terminated ioc 804b scsi 0 = state c xfer (da7:mps1:0:17:0): CAM status: Unconditionally Re-queue = Request Apr 25 05:34:51 knew kernel: 0 Apr 25 05:34:51 knew kernel: (da7:mps1: mps1:0:IOCStatus =3D 0x4b while = resetting device 0xa Apr 25 05:34:51 knew kernel: 17:mps1: 0): Unfreezing devq for target ID = 17 Apr 25 05:34:51 knew kernel: Error 5, Periph was invalidated Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 d8 = 33 26 d8 00 00 20 00=20 Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): CAM status: = Unconditionally Re-queue Request Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): Error 5, Periph was = invalidated Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 e6 = 6c 42 40 00 00 10 00=20 Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): CAM status: = Unconditionally Re-queue Request Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): Error 5, Periph was = invalidated Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): WRITE(10). CDB: 2a 00 e4 = d8 2a 90 00 00 90 00=20 Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): CAM status: = Unconditionally Re-queue Request Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): Error 5, Periph was = invalidated Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): READ(10). CDB: 28 00 4d = ac ed b8 00 00 08 00=20 Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): CAM status: = Unconditionally Re-queue Request Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): Error 5, Periph was = invalidated Apr 25 05:34:51 knew kernel: GEOM_MIRROR: Device swap: provider da7p2 = disconnected. Apr 25 05:34:51 knew devd: Executing 'logger -p kern.notice -t ZFS 'vdev = is removed, pool_guid=3D15378250086669402288 = vdev_guid=3D13410079654596185797'' Apr 25 05:34:51 knew kernel: (da7:mps1:0:17:0): Periph destroyed Apr 25 05:34:51 knew ZFS: vdev is removed, = pool_guid=3D15378250086669402288 vdev_guid=3D13410079654596185797 --=20 Dan Langille - BSDCan / PGCon dan@langille.org