From owner-freebsd-scsi@freebsd.org Thu Mar 3 17:12:52 2016 Return-Path: Delivered-To: freebsd-scsi@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id E4FE5A93F55 for ; Thu, 3 Mar 2016 17:12:52 +0000 (UTC) (envelope-from scott4long@yahoo.com) Received: from nm10-vm10.bullet.mail.gq1.yahoo.com (nm10-vm10.bullet.mail.gq1.yahoo.com [98.136.218.91]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 4BB17818 for ; Thu, 3 Mar 2016 17:12:51 +0000 (UTC) (envelope-from scott4long@yahoo.com) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=yahoo.com; s=s2048; t=1457024979; bh=wQhsy0bDQ6PB7pAWu7hAfHGKEJ9t+G2FO3T7W7unxfc=; h=Subject:From:In-Reply-To:Date:Cc:References:To:From:Subject; b=TXCQ/BoplnKoZKnn/gktIBSX6dthuQD9coTueP4jOATb5XeiT9yX9cbN1PjP3ZcPnU9JX62s+iyUto3qARo7wMyhJ2NWD+viLz1N3aHEu/9fM4mN5kC3rSbEf+xQdGkAu5PG/vS83c8QQoPjK8E62r3z0niaZ+wMNBDFmatcWDpQH/0vnb4Q7lOCPVbIAhm9bZD/ErWaSHn3YDnE+gw0JEPRS20GbBivXQl7zCFrPcupATjnFNqRhZJFKVju3I0wuM3x8iCCZOiKRxOpCW0X8sUNoJHJAK37EI7nLAXaAn6RVRYOPzzkvKYsLSkt8XQjyB/MLDMM3lpyGdflcDhXMg== Received: from [216.39.60.180] by nm10.bullet.mail.gq1.yahoo.com with NNFMP; 03 Mar 2016 17:09:39 -0000 Received: from [98.136.164.75] by tm16.bullet.mail.gq1.yahoo.com with NNFMP; 03 Mar 2016 17:09:39 -0000 Received: from [127.0.0.1] by smtp237.mail.gq1.yahoo.com with NNFMP; 03 Mar 2016 17:09:39 -0000 X-Yahoo-Newman-Id: 198679.53640.bm@smtp237.mail.gq1.yahoo.com X-Yahoo-Newman-Property: ymail-3 X-YMail-OSG: g33At.AVM1lPUjEncDYhy_VugM2kMRGcQtyB9sLwHIWTIVF 65oLUMp3WqLwdvCzmdVZu89N84YvwREdvb7GE5ucHPTVJw2ET6vtHZH6P_VG SqcS4zzQ_XGZZRq156eIhL_6Ee0mOdAPEcwBKTNM6IlbQJ1GMlTa01dwZ48. EeJvxNK5eAsLkQLi9i1Kb7KKgjvltKNshfYLw.cKDI6GxD7o0WLrYZLmceoR CuN6DQfHYGlTJhiE6BSHNZi6L.LA1atRpwlJPQDtEcRTVSPFW98DM8UBCjlh V7DTSUc.NbrczOIAsihGu0DiDpgBX2d94S8njmWnKsAxLdW2t0L6px.qSwuj RYoP7zAJtfVNKzCWmGbcNYnMDHTX6sha3.tyA6LxopPvDpIL_FYaRihBWtD6 KYlznjG362QpVWefTjxBVvgqoKy5jlywFJsUCo3.iWz0jxAOT2KzLaRnq_BP ogn.2FENMaovODQEialHGlSOxrSVP5Lnc2NDnJz77AlaLkUA2auATahAOmDY xCfPxQJ51KLI7ZcuSwTJnU_MNIeDrPNcH5j71cdY2UTUKnA-- X-Yahoo-SMTP: clhABp.swBB7fs.LwIJpv3jkWgo2NU8- Content-Type: text/plain; charset=utf-8 Mime-Version: 1.0 (Mac OS X Mail 9.2 \(3112\)) Subject: Re: mpr(4) SAS3008 Repeated Crashing From: Scott Long In-Reply-To: Date: Thu, 3 Mar 2016 10:09:37 -0700 Cc: Steven Hartland , FreeBSD-scsi Content-Transfer-Encoding: quoted-printable Message-Id: References: <56D5FDB8.8040402@freebsd.org> <56D612FA.6090909@multiplay.co.uk> <56D805FD.50500@multiplay.co.uk> To: Borja Marcos X-Mailer: Apple Mail (2.3112) X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.21 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 03 Mar 2016 17:12:53 -0000 > On Mar 3, 2016, at 3:26 AM, Borja Marcos wrote: >=20 >=20 >> On 03 Mar 2016, at 10:38, Steven Hartland = wrote: >>=20 >> We've seen HW issues before where the first thing to start triggering = the problem was TRIM requests, it seems like its an afterthought in most = FW's unfortunately, so one of the first things to go bad. I'm not saying = this is you issue, but its something to keep in mind. >=20 > Thanks :) >=20 > Not trim related, it seems. I=E2=80=99ve ran the tests with = kern.cam.X.delete_method set to DISABLE and I still see errors: >=20 > In paranormal cases like this it would be awesome to have access to a = logic analyzer=E2=80=A6 I keep dreaming of course :) >=20 >=20 >=20 > Mar 3 11:12:53 clientes-ssd8 kernel: (noperiph:mpr0:0:4294967295:0): = SMID 2 Aborting command 0xfffffe0000c7cab0 > Mar 3 11:12:54 clientes-ssd8 kernel: (da2:mpr0:0:26:0): READ(10). = CDB: 28 00 26 d7 d0 98 00 00 20 00 length 16384 SMID 322 terminated ioc = 804b scsi 0 state c xfer 0 > Mar 3 11:12:54 clientes-ssd8 kernel: (da2:mpr0:0:26:0): READ(10). = CDB: 28 00 26 d7 d0 b8 00 00 18 00 length 12288 SMID 217 terminated ioc = 804b scsi 0 state c xfer 0 > Mar 3 11:12:54 clientes-ssd8 kernel: (da2:mpr0:0:26:0): SYNCHRONIZE = CACHE(10). CDB: 35 00 00 00 00 00 00 00 00 00 length 0 SMID 205 = terminated ioc 804b scsi 0 sta(da2:mpr0:0:26:0): READ(10). CDB: 28 00 26 = d7 d0 18 00 00 78 00=20 > Mar 3 11:12:54 clientes-ssd8 kernel: te c xfer 0 > Mar 3 11:12:54 clientes-ssd8 kernel: (da2:mpr0:0:26:0): CAM status: = Command timeout > Mar 3 11:12:54 clientes-ssd8 kernel: (da2:mpr0:0:26:0): Retrying = command > Mar 3 11:12:54 clientes-ssd8 kernel: (da2:mpr0:0:26:0): SYNCHRONIZE = CACHE(10). CDB: 35 00 00 00 00 00 00 00 00 00=20 > Mar 3 11:12:54 clientes-ssd8 kernel: (da2:mpr0:0:26:0): CAM status: = SCSI Status Error > Mar 3 11:12:54 clientes-ssd8 kernel: (da2:mpr0:0:26:0): SCSI status: = Check Condition > Mar 3 11:12:54 clientes-ssd8 kernel: (da2:mpr0:0:26:0): SCSI sense: = UNIT ATTENTION asc:29,0 (Power on, reset, or bus device reset occurred) > Mar 3 11:12:54 clientes-ssd8 kernel: (da2:mpr0:0:26:0): Retrying = command (per sense data) >=20 SYNC CACHE seems to have been involved this time, and while it=E2=80=99s = sometimes a source of trouble with SATA disks, I=E2=80=99m very hesitant = to blame it. Given the seemingly random nature of your problems, I=E2=80=99= m not as certain anymore to rule out a fault of the disk enclosure. = This looks to be a different disk than your last report, and your = statement that a sibling system exhibits no problems is very = interesting. Maybe there=E2=80=99s an issue with the power supply, and = the disks are getting under-voltage conditions periodically. If you can = run smartctl against the disks, the output might be useful. Also, if = you=E2=80=99re able, could you make sure that both this system and the = one that is working well are being fed with sufficient and similar AC = power? And if the power supply modules in your enclosures are = swappable, maybe swap them between systems and see if the problem = follows the module? If that doesn=E2=80=99t fix it then I=E2=80=99ll = think of ways to provide more instrumentation. Scott