From owner-freebsd-scsi Thu Jul 1 12:55:41 1999 Delivered-To: freebsd-scsi@freebsd.org Received: from panzer.kdm.org (panzer.kdm.org [216.160.178.169]) by hub.freebsd.org (Postfix) with ESMTP id 15E13155EB for ; Thu, 1 Jul 1999 12:55:36 -0700 (PDT) (envelope-from ken@panzer.kdm.org) Received: (from ken@localhost) by panzer.kdm.org (8.9.3/8.9.1) id NAA44126; Thu, 1 Jul 1999 13:54:18 -0600 (MDT) (envelope-from ken) Message-Id: <199907011954.NAA44126@panzer.kdm.org> Subject: Re: FreeBSD panics with Mylex DAC960SX In-Reply-To: <199907011848.NAA85021@aurora.sol.net> from Joe Greco at "Jul 1, 1999 01:47:59 pm" To: jgreco@ns.sol.net (Joe Greco) Date: Thu, 1 Jul 1999 13:54:18 -0600 (MDT) Cc: scsi@freebsd.org From: "Kenneth D. Merry" X-Mailer: ELM [version 2.4ME+ PL54 (25)] MIME-Version: 1.0 Content-Type: multipart/mixed; boundary=ELM930858858-44026-0_ Content-Transfer-Encoding: 7bit Sender: owner-freebsd-scsi@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org --ELM930858858-44026-0_ Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Joe Greco wrote... > > There are several questions I have, which I hope can be answered with some > > diagnostic patches I've appended. > > > > 1. Why does the da1 announcement just print 'A' and not the rest of the > > line? > > > > 2. Why does the camcontrol read capacity output indicate that the Mylex > > array is not ready, yet an open immediately after that seems to pass > > the read capacity by just fine? > > If you're talking about when I do the camcontrol after waiting a bit - > it is because the Mylex has completed its startup routine. Remember, all > of this is only happening when the Mylex has not yet finished its startup > procedure, which can happen in the events: > > 1) Things are OK but it just isn't done yet. This is a rare case, the > system just boots too slow with the RAM test and the 15-second boot > prompt. If I ESC the RAM test and hit return to boot right away, > from a cold power on, then I can get the crash. > > 2) Things are not OK; i.e. the RAID is damaged or offline. This is the > case where I actually first noticed the behaviour. > > > 3. Assuming the read capacity is returned without an error, why does the > > Mylex return a bogus sector size at least? (indicated by your > > diagnostic output from the slice code above) > > > > Hopefully I can at least get a clue to the answers for 1 and 2 with the > > patches appended. > > > > So, Joe, could you: > > > > - apply Bruce's patch (so you won't panic), or just keep the one you've got > > Kept mine. > > > - apply the attached patch to scsi_da.c > > Done. > > > - boot with -v (boot kernel -v at the loader prompt) > > - send the output from the boot > > Whoa... okay. Well, I just modified the kernel. The same things apply > that were being done; i.e. .profile does a camcontrol and then tries to do a > fsck -p. Lots of output. I'm including it all since I don't know what is > really significant. [ ... ] > That's it. Let me know if you need more. Thanks for all the work on this! I talked to Justin for a minute, and I think we've figured out what the problem is. It's a little more complicated than this, but the simple explanation is that we aren't doing the right thing when a command comes back with just a sense key and no ASC or ASCQ. It's hard to believe we haven't run into this before, but I think that's the problem. Try applying the attached patch to scsi_all.c. It isn't the final patch for this problem, the solution is probably a little more complicated than this. But hopefully this will let us know whether the problem is what we think it is. You should be able to boot okay with this patch, although you probably won't be able to fsck or mount the Mylex array until it's ready to run. Ken -- Kenneth Merry ken@plutotech.com --ELM930858858-44026-0_ Content-Type: text/plain; charset=US-ASCII Content-Disposition: attachment; filename=scsi_all.c.070199 Content-Description: scsi_all.c.070199 Content-Transfer-Encoding: 7bit ==== //depot/cam/sys/cam/scsi/scsi_all.c#59 - /a/ken/perforce/cam/sys/cam/scsi/scsi_all.c ==== *** /tmp/tmp.25512.0 Thu Jul 1 13:46:02 1999 --- /a/ken/perforce/cam/sys/cam/scsi/scsi_all.c Thu Jul 1 13:44:37 1999 *************** *** 765,771 **** * . . . . E - ENCLOSURE SERVICES DEVICE (SES) * DTLPWRSOMCAE ASC ASCQ Action Description * ------------ ---- ---- ------ -----------------------------------*/ ! /* DTLPWRSOMCAE */{SST(0x00, 0x00, SS_NEPDEF, "No additional sense information") }, /* T S */{SST(0x00, 0x01, SS_DEF, "Filemark detected") }, --- 765,771 ---- * . . . . E - ENCLOSURE SERVICES DEVICE (SES) * DTLPWRSOMCAE ASC ASCQ Action Description * ------------ ---- ---- ------ -----------------------------------*/ ! /* DTLPWRSOMCAE */{SST(0x00, 0x00, SS_DEF, "No additional sense information") }, /* T S */{SST(0x00, 0x01, SS_DEF, "Filemark detected") }, --ELM930858858-44026-0_-- To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-scsi" in the body of the message