From owner-freebsd-questions  Thu Sep 19 22:48: 3 2002
Delivered-To: freebsd-questions@freebsd.org
Received: from mx1.FreeBSD.org (mx1.FreeBSD.org [216.136.204.125])
	by hub.freebsd.org (Postfix) with ESMTP id 28B7A37B401
	for <freebsd-questions@freebsd.org>; Thu, 19 Sep 2002 22:48:02 -0700 (PDT)
Received: from tomts12-srv.bellnexxia.net (tomts12.bellnexxia.net [209.226.175.56])
	by mx1.FreeBSD.org (Postfix) with ESMTP id 30D3143E42
	for <freebsd-questions@freebsd.org>; Thu, 19 Sep 2002 22:48:01 -0700 (PDT)
	(envelope-from matt@gsicomp.on.ca)
Received: from xena.gsicomp.on.ca ([65.95.177.86])
          by tomts12-srv.bellnexxia.net
          (InterMail vM.5.01.04.19 201-253-122-122-119-20020516) with ESMTP
          id <20020920054758.SRAG4068.tomts12-srv.bellnexxia.net@xena.gsicomp.on.ca>
          for <freebsd-questions@freebsd.org>;
          Fri, 20 Sep 2002 01:47:58 -0400
Received: from hermes (hermes.gsicomp.on.ca [192.168.0.18])
	by xena.gsicomp.on.ca (8.11.3/8.11.3) with SMTP id g8K4Y9X06821
	for <freebsd-questions@freebsd.org>; Fri, 20 Sep 2002 00:34:09 -0400 (EDT)
	(envelope-from matt@gsicomp.on.ca)
Message-ID: <009a01c26069$4732a850$1200a8c0@gsicomp.on.ca>
From: "Matthew Emmerton" <matt@gsicomp.on.ca>
To: <freebsd-questions@freebsd.org>
Subject: Console messages from ahc0 - how serious are they?
Date: Fri, 20 Sep 2002 01:47:59 -0400
MIME-Version: 1.0
Content-Type: text/plain;
	charset="iso-8859-1"
Content-Transfer-Encoding: 7bit
X-Priority: 3
X-MSMail-Priority: Normal
X-Mailer: Microsoft Outlook Express 5.50.4807.1700
X-MimeOLE: Produced By Microsoft MimeOLE V5.50.4910.0300
Sender: owner-freebsd-questions@FreeBSD.ORG
Precedence: bulk
List-ID: <freebsd-questions.FreeBSD.ORG>
List-Archive: <http://docs.freebsd.org/mail/> (Web Archive)
List-Help: <mailto:majordomo@FreeBSD.ORG?subject=help> (List Instructions)
List-Subscribe: <mailto:majordomo@FreeBSD.ORG?subject=subscribe%20freebsd-questions>
List-Unsubscribe: <mailto:majordomo@FreeBSD.ORG?subject=unsubscribe%20freebsd-questions>
X-Loop: FreeBSD.ORG


I'm running 4.4-REL on a bunch of older systems with onboard Adaptec 7870
controllers and recently I've been getting lots of console output from one
of them.  I'm pretty sure it means that one of my drives is dying (which
I've suyspected for a while), but I'm just curious as to what the messages
really mean, and if I can determine the bad drive just from the messages.

The dmesg information (hardware probes):

ahc0: <Adaptec aic7870 SCSI adapter)> port 0xf800-0xf8ff mem
0xffbef000-0xffbeffff irq 11 at device 11.0 on pci0
aic7870: Wide Channel A, SCSI Id=7, 16/255 SCBs
da0 at ahc0 bus 0 target 0 lun 0
da0: <SEAGTE ST32430N 0510> Fixed Direct Access SCSI-2 device
da0: 10.000MB/s transfers (10.000MHz, offset 15), Tagged Queuing Enabled
da0: 2049MB (4197405 512 byte sectors: 64H 32S/T 2049C)
da1 at ahc0 bus 0 targer 4 lun 0
da1: <FUJITSU M2694E-512 8134> Fixed Direct Access SCSI-CCS device
da1: 3.300MB/s transfers
da1: 1033MB (2117025 512 byte sectors: 64H 32S/T 1033C)
da2 at ahc0 bus 0 target 6 lun 0
da2: <OEM DCRS04Z 0101> Fixed Direct Access SCSI-2 device
da2: 10.000MB/s transfer (10.000MHz, offset 15), Tagged Queueing Enabled
da2: 4340MB (8888543 512 byte sectors: 64H 32S/T 4340C)

The console error messages:

(da0:ahc0:0:0:0): BDR message in message buffer
(da0:ahc0:0:0:0): SCB 0xe - timed out
ahc0: Dumping Card State in Data-in phase, at SEQADDR 0x7a
< snip dump data >
(da0:ahc0:0:0:0): no longer in timeout, status = 34b
ahc0: Issued Channel A Bus Reset.  3 SCBs aborted

I get these kinds of errors on da0, da1 and da2.  However, I only see this
message on da1:

(da1:ahc0:0:4:0): Unexpected busfree in Data-in phase

Is this the error that triggers off all the bus reset (and subsequent
timeouts and aborts)?  Should i look at replacing da1 real soon now?

--
Matt Emmerton



To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-questions" in the body of the message