Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 18 Mar 2003 11:20:56 -0700
From:      Scott Long <scott_long@btc.adaptec.com>
To:        Matthew Reimer <mreimer@vpop.net>
Cc:        scsi@freebsd.org
Subject:   Re: Help interpreting SCSI errors
Message-ID:  <3E776388.4030607@btc.adaptec.com>
In-Reply-To: <3E77594A.7020702@vpop.net>

index | next in thread | previous in thread | raw e-mail

Matthew Reimer wrote:
> I'm trying to track down why I'm seeing this kind of error in our log 
> files:
> 
> swap_pager: indefinite wait buffer: device: #da/0x20001, blkno: 608, 
> size: 4096
> swap_pager: indefinite wait buffer: device: #da/0x20001, blkno: 7568, 
> size: 4096
> 
> We're running -stable with an Adaptec 2110S RAID controller and three 
> disks on SCSI ids 3, 4, and 5 in a RAID5 configuration.
> 
> Running "raidutil -e nonrecov d0" shows several sequences like the 
> following (separated by varying amounts of time). Every time, the 
> initial "Bad SCSI Status - Check Condition" comes from id 5.
> 
> Can anyone interpret the initial "bad scsi status" that kicks off the 
> bus reset, etc.? Whatever it is, it isn't degrading the volume, but if a 
> disk is going bad or if there's some other problem I would like to know.
> 
> Thanks in advance for any help you can offer.
> 
> Matt
> 

This is a very strange problem.  The initial sense information gives a
code of 0x0/0x0, which literally means 'no additional sense
information'.  Afterwards, the drive complains about the bus resets that
the card initiated.  So, there really isn't any useful information here.
It's possible that the drive is going bad, or that you are having
problems with cables or connectors, but it's hard to say.  Probably the
easiest thing to do would be to try swapping the drive out.

Scott



> ----
> 
> 03/13/2003  13:37:48   Level 3
> HBA=0 BUS=0 ID=5 LUN=0
> Bad SCSI Status - Check Condition
> 28 00 02 5C EE BF 00 00 20 00 00 00
> 
> 
> 03/13/2003  13:37:48   Level 3
> HBA=0 BUS=0 ID=5 LUN=0
> Request Sense
> 70 00 00 00 00 00 00 18 00 00 00 00 00 00 00 00 00 00
> No Sense
> 
> 
> 03/13/2003  13:38:24   Level 3
> Bus reset occurred on channel 0 - Command watchdog time-out caused the 
> bus to be
>  reset
> 
> 
> 03/13/2003  13:38:24   Level 3
> HBA=0 BUS=0 ID=5 LUN=0
> HBA Error - SCSI Bus Reset
> 
> 
> 03/13/2003  13:38:24   Level 3
> HBA=0 BUS=0 ID=5 LUN=0
> HBA Error - SCSI Bus Reset
> 
> 
> 03/13/2003  13:38:24   Level 3
> HBA=0 BUS=0 ID=5 LUN=0
> HBA Error - SCSI Bus Reset
> 
> 
> 03/13/2003  13:38:24   Level 3
> HBA=0 BUS=0 ID=5 LUN=0
> HBA Error - SCSI Bus Reset
> 
> 
> 03/13/2003  13:38:24   Level 3
> HBA=0 BUS=0 ID=5 LUN=0
> HBA Error - SCSI Bus Reset
> 
> 
> 03/13/2003  13:38:27   Level 3
> HBA=0 BUS=0 ID=5 LUN=0
> Bad SCSI Status - Check Condition
> 28 00 02 5C EA 7F 00 00 01 00 00 00
> 
> 
> 03/13/2003  13:38:27   Level 3
> HBA=0 BUS=0 ID=5 LUN=0
> Request Sense
> 70 00 06 00 00 00 00 18 00 00 00 00 29 02 00 00 00 00
> Unit Attention
> 
> 
> 03/13/2003  13:38:27   Level 3
> HBA=0 BUS=0 ID=4 LUN=0
> Bad SCSI Status - Check Condition
> 2A 00 02 5C E8 7F 00 00 01 00 00 00
> 
> 
> 03/13/2003  13:38:27   Level 3
> HBA=0 BUS=0 ID=4 LUN=0
> Request Sense
> 70 00 06 00 00 00 00 18 00 00 00 00 29 02 00 00 00 00
> Unit Attention
> 
> 
> 03/13/2003  13:38:27   Level 3
> HBA=0 BUS=0 ID=3 LUN=0
> Bad SCSI Status - Check Condition
> 2A 00 02 7D 8E 9F 00 00 20 00 00 00
> 
> 
> 03/13/2003  13:38:27   Level 3
> HBA=0 BUS=0 ID=3 LUN=0
> Request Sense
> 70 00 06 00 00 00 00 18 00 00 00 00 29 02 00 00 00 00
> Unit Attention
> 
> ----
> 
> 
> To Unsubscribe: send mail to majordomo@FreeBSD.org
> with "unsubscribe freebsd-scsi" in the body of the message



To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-scsi" in the body of the message



help

Want to link to this message? Use this
URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?3E776388.4030607>