From owner-freebsd-questions Thu Sep 19 22:48: 3 2002 Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.FreeBSD.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 28B7A37B401 for ; Thu, 19 Sep 2002 22:48:02 -0700 (PDT) Received: from tomts12-srv.bellnexxia.net (tomts12.bellnexxia.net [209.226.175.56]) by mx1.FreeBSD.org (Postfix) with ESMTP id 30D3143E42 for ; Thu, 19 Sep 2002 22:48:01 -0700 (PDT) (envelope-from matt@gsicomp.on.ca) Received: from xena.gsicomp.on.ca ([65.95.177.86]) by tomts12-srv.bellnexxia.net (InterMail vM.5.01.04.19 201-253-122-122-119-20020516) with ESMTP id <20020920054758.SRAG4068.tomts12-srv.bellnexxia.net@xena.gsicomp.on.ca> for ; Fri, 20 Sep 2002 01:47:58 -0400 Received: from hermes (hermes.gsicomp.on.ca [192.168.0.18]) by xena.gsicomp.on.ca (8.11.3/8.11.3) with SMTP id g8K4Y9X06821 for ; Fri, 20 Sep 2002 00:34:09 -0400 (EDT) (envelope-from matt@gsicomp.on.ca) Message-ID: <009a01c26069$4732a850$1200a8c0@gsicomp.on.ca> From: "Matthew Emmerton" To: Subject: Console messages from ahc0 - how serious are they? Date: Fri, 20 Sep 2002 01:47:59 -0400 MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit X-Priority: 3 X-MSMail-Priority: Normal X-Mailer: Microsoft Outlook Express 5.50.4807.1700 X-MimeOLE: Produced By Microsoft MimeOLE V5.50.4910.0300 Sender: owner-freebsd-questions@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG I'm running 4.4-REL on a bunch of older systems with onboard Adaptec 7870 controllers and recently I've been getting lots of console output from one of them. I'm pretty sure it means that one of my drives is dying (which I've suyspected for a while), but I'm just curious as to what the messages really mean, and if I can determine the bad drive just from the messages. The dmesg information (hardware probes): ahc0: port 0xf800-0xf8ff mem 0xffbef000-0xffbeffff irq 11 at device 11.0 on pci0 aic7870: Wide Channel A, SCSI Id=7, 16/255 SCBs da0 at ahc0 bus 0 target 0 lun 0 da0: Fixed Direct Access SCSI-2 device da0: 10.000MB/s transfers (10.000MHz, offset 15), Tagged Queuing Enabled da0: 2049MB (4197405 512 byte sectors: 64H 32S/T 2049C) da1 at ahc0 bus 0 targer 4 lun 0 da1: Fixed Direct Access SCSI-CCS device da1: 3.300MB/s transfers da1: 1033MB (2117025 512 byte sectors: 64H 32S/T 1033C) da2 at ahc0 bus 0 target 6 lun 0 da2: Fixed Direct Access SCSI-2 device da2: 10.000MB/s transfer (10.000MHz, offset 15), Tagged Queueing Enabled da2: 4340MB (8888543 512 byte sectors: 64H 32S/T 4340C) The console error messages: (da0:ahc0:0:0:0): BDR message in message buffer (da0:ahc0:0:0:0): SCB 0xe - timed out ahc0: Dumping Card State in Data-in phase, at SEQADDR 0x7a < snip dump data > (da0:ahc0:0:0:0): no longer in timeout, status = 34b ahc0: Issued Channel A Bus Reset. 3 SCBs aborted I get these kinds of errors on da0, da1 and da2. However, I only see this message on da1: (da1:ahc0:0:4:0): Unexpected busfree in Data-in phase Is this the error that triggers off all the bus reset (and subsequent timeouts and aborts)? Should i look at replacing da1 real soon now? -- Matt Emmerton To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-questions" in the body of the message