From owner-freebsd-scsi@FreeBSD.ORG Fri Jun 8 21:25:38 2012 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [69.147.83.52]) by hub.freebsd.org (Postfix) with ESMTP id 19A92106564A for ; Fri, 8 Jun 2012 21:25:38 +0000 (UTC) (envelope-from dustinwenz@ebureau.com) Received: from internet02.ebureau.com (internet02.tru-signal.biz [65.127.24.21]) by mx1.freebsd.org (Postfix) with ESMTP id D0AFA8FC14 for ; Fri, 8 Jun 2012 21:25:37 +0000 (UTC) Received: from service02.office.ebureau.com (service02.office.ebureau.com [192.168.20.15]) by internet02.ebureau.com (Postfix) with ESMTP id 9CF55CA2AF1 for ; Fri, 8 Jun 2012 16:25:31 -0500 (CDT) Received: from localhost (localhost [127.0.0.1]) by service02.office.ebureau.com (Postfix) with ESMTP id 82C9C9E1D27E for ; Fri, 8 Jun 2012 16:25:31 -0500 (CDT) X-Virus-Scanned: amavisd-new at ebureau.com Received: from service02.office.ebureau.com ([127.0.0.1]) by localhost (service02.office.iscompanies.com [127.0.0.1]) (amavisd-new, port 10024) with ESMTP id mvy5cWgOQrp6 for ; Fri, 8 Jun 2012 16:25:31 -0500 (CDT) Received: from square.office.iscompanies.com (square.office.iscompanies.com [10.10.20.22]) by service02.office.ebureau.com (Postfix) with ESMTPSA id 247A79E1D26F for ; Fri, 8 Jun 2012 16:25:31 -0500 (CDT) From: Dustin Wenz Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: quoted-printable Date: Fri, 8 Jun 2012 16:25:31 -0500 Message-Id: <60F17E0E-EE4A-4F37-9925-055315B987B1@ebureau.com> To: freebsd-scsi@freebsd.org Mime-Version: 1.0 (Apple Message framework v1257) X-Mailer: Apple Mail (2.1257) Subject: Marginal disks prevent boot with mps(4) X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 08 Jun 2012 21:25:38 -0000 I just installed a build of 9.0-STABLE in order to test the changes = since release. I was hoping that some of the error-handling in mps would = alter the behavior I've seen with some SATA disks (particularly, Seagate = ST3000DM001 disks) connected through an LSI SAS 9201-16e HBA. It is apparently possible for these disks to get in a state where their = presence prevents the machine from booting. This problem has existed for = some time, according to some archive-searching I've done, but there = isn't much consensus on how to fix it. The disks are good enough that they can be probed at startup, but some = part of initialization cannot complete. This is the message I see = repeated forever upon boot (the probe number does change slightly): (probe14:mps0:0:14:0): INQUIRY. CDB: 12 0 0 0 24 0 length 36 = SMID 215 terminated ioc 804b scsi 0 state c xfer 0 There is a comment in mps_sas.c which suggests that this error is = usually transient, but that seems not to be the case here. Can anyone = suggest a modification that might permit booting in this state? - .Dustin