Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 1 Jul 1999 13:54:18 -0600 (MDT)
From:      "Kenneth D. Merry" <ken@plutotech.com>
To:        jgreco@ns.sol.net (Joe Greco)
Cc:        scsi@freebsd.org
Subject:   Re: FreeBSD panics with Mylex DAC960SX
Message-ID:  <199907011954.NAA44126@panzer.kdm.org>
In-Reply-To: <199907011848.NAA85021@aurora.sol.net> from Joe Greco at "Jul 1, 1999 01:47:59 pm"

next in thread | previous in thread | raw e-mail | index | archive | help

--ELM930858858-44026-0_
Content-Type: text/plain; charset=US-ASCII
Content-Transfer-Encoding: 7bit

Joe Greco wrote...
> > There are several questions I have, which I hope can be answered with some
> > diagnostic patches I've appended.
> > 
> > 1.  Why does the da1 announcement just print 'A' and not the rest of the
> >     line?
> > 
> > 2.  Why does the camcontrol read capacity output indicate that the Mylex
> >     array is not ready, yet an open immediately after that seems to pass
> >     the read capacity by just fine?
> 
> If you're talking about when I do the camcontrol after waiting a bit -
> it is because the Mylex has completed its startup routine.  Remember, all
> of this is only happening when the Mylex has not yet finished its startup
> procedure, which can happen in the events:
> 
> 1) Things are OK but it just isn't done yet.  This is a rare case, the
>    system just boots too slow with the RAM test and the 15-second boot
>    prompt.  If I ESC the RAM test and hit return to boot right away,
>    from a cold power on, then I can get the crash.
> 
> 2) Things are not OK; i.e. the RAID is damaged or offline.  This is the
>    case where I actually first noticed the behaviour.
> 
> > 3.  Assuming the read capacity is returned without an error, why does the
> >     Mylex return a bogus sector size at least?  (indicated by your
> >     diagnostic output from the slice code above)
> > 
> > Hopefully I can at least get a clue to the answers for 1 and 2 with the
> > patches appended.
> > 
> > So, Joe, could you:
> > 
> > - apply Bruce's patch (so you won't panic), or just keep the one you've got
> 
> Kept mine.
> 
> > - apply the attached patch to scsi_da.c
> 
> Done.
> 
> > - boot with -v (boot kernel -v at the loader prompt)
> > - send the output from the boot
> 
> Whoa...  okay.  Well, I just modified the kernel.  The same things apply
> that were being done; i.e. .profile does a camcontrol and then tries to do a
> fsck -p.  Lots of output.  I'm including it all since I don't know what is
> really significant.

[ ... ]

> That's it.  Let me know if you need more.

Thanks for all the work on this!  I talked to Justin for a minute, and I
think we've figured out what the problem is.

It's a little more complicated than this, but the simple explanation is
that we aren't doing the right thing when a command comes back with just a
sense key and no ASC or ASCQ.  It's hard to believe we haven't run into
this before, but I think that's the problem.

Try applying the attached patch to scsi_all.c.  It isn't the final patch
for this problem, the solution is probably a little more complicated than
this.  But hopefully this will let us know whether the problem is what we
think it is.

You should be able to boot okay with this patch, although you probably
won't be able to fsck or mount the Mylex array until it's ready to run.

Ken
-- 
Kenneth Merry
ken@plutotech.com

--ELM930858858-44026-0_
Content-Type: text/plain; charset=US-ASCII
Content-Disposition: attachment; filename=scsi_all.c.070199
Content-Description: scsi_all.c.070199
Content-Transfer-Encoding: 7bit

==== //depot/cam/sys/cam/scsi/scsi_all.c#59 - /a/ken/perforce/cam/sys/cam/scsi/scsi_all.c ====
*** /tmp/tmp.25512.0	Thu Jul  1 13:46:02 1999
--- /a/ken/perforce/cam/sys/cam/scsi/scsi_all.c	Thu Jul  1 13:44:37 1999
***************
*** 765,771 ****
   * .  .  .  . E - ENCLOSURE SERVICES DEVICE (SES)
   * DTLPWRSOMCAE        ASC   ASCQ  Action  Description
   * ------------        ----  ----  ------  -----------------------------------*/
! /* DTLPWRSOMCAE */{SST(0x00, 0x00, SS_NEPDEF,
  			"No additional sense information") },
  /*  T    S      */{SST(0x00, 0x01, SS_DEF,
  			"Filemark detected") },
--- 765,771 ----
   * .  .  .  . E - ENCLOSURE SERVICES DEVICE (SES)
   * DTLPWRSOMCAE        ASC   ASCQ  Action  Description
   * ------------        ----  ----  ------  -----------------------------------*/
! /* DTLPWRSOMCAE */{SST(0x00, 0x00, SS_DEF,
  			"No additional sense information") },
  /*  T    S      */{SST(0x00, 0x01, SS_DEF,
  			"Filemark detected") },

--ELM930858858-44026-0_--


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-scsi" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?199907011954.NAA44126>