From owner-freebsd-hardware Tue Oct 13 19:28:46 1998 Return-Path: Received: (from majordom@localhost) by hub.freebsd.org (8.8.8/8.8.8) id TAA19557 for freebsd-hardware-outgoing; Tue, 13 Oct 1998 19:28:46 -0700 (PDT) (envelope-from owner-freebsd-hardware@FreeBSD.ORG) Received: from bastuba.partitur.se (bastuba.partitur.se [193.219.246.194]) by hub.freebsd.org (8.8.8/8.8.8) with ESMTP id TAA19546 for ; Tue, 13 Oct 1998 19:28:41 -0700 (PDT) (envelope-from girgen@partitur.se) Received: from partitur.se (solist.partitur.se [193.219.246.204]) by bastuba.partitur.se (8.8.8/8.8.8) with ESMTP id EAA20935 for ; Wed, 14 Oct 1998 04:28:25 +0200 (CEST) (envelope-from girgen@partitur.se) Message-ID: <36240C47.ED276B4B@partitur.se> Date: Wed, 14 Oct 1998 04:28:23 +0200 From: Palle Girgensohn Organization: Partitur X-Mailer: Mozilla 4.5b2 [en] (X11; I; SunOS 5.6 sun4u) X-Accept-Language: sv,en MIME-Version: 1.0 To: freebsd-hardware@FreeBSD.ORG Subject: pax killing scsi w/ 2940UW (ahc) Content-Type: text/plain; charset=us-ascii Content-Transfer-Encoding: 7bit Sender: owner-freebsd-hardware@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.org Hi! Yesterday I wanted to move a directory structure to a different disk on one of our servers. Suddenly, the scsi bus was reset, and reset again and again. The system did hardly accept input. After about a minute or so, with the taper going nuts (from all the resets), the system rebooted??!! Nothing similar has ever happened before, not to me! Since this was midday, with people working, it was not good. :) No problem getting it up again, though. This evening, I though I'd take antoher shot at it. Same thing happens, only this time i hit ctrl-C, and pax stops fine. I'm using an adaptec 2940UW on a pentium Pro with four disks and a taper unit. The last disk is pretty newly installed. I have compiled the kernel with these options: options AHC_SCBPAGING_ENABLE options AHC_ALLOW_MEMIO options AHC_TAGENABLE This has been working fine for more than a year, but maybe this was too much for it? Don't remember moving large quantities with pax before; I probably used tar. uname -a: FreeBSD trumpet.partitur.se 2.2.7-STABLE FreeBSD 2.2.7-STABLE #0: Mon Sep 28 16:07:19 CEST 1998 girgen@trumpet.partitur.se:/disk1/src/sys/compile/TRUMPET i386 Here's a demsg starting from the problem time: sd3(ahc0:3:0): SCB 0xa - timed out in dataout phase, SCSISIGI == 0x4 SEQADDR = 0x127 SCSISEQ = 0x12 SSTAT0 = 0x0 SSTAT1 = 0x2 Ordered Tag queued sd3(ahc0:3:0): SCB 0xf timedout while recovery in progress sd3(ahc0:3:0): SCB 0x7 timedout while recovery in progress sd0(ahc0:0:0): SCB 0x1 timedout while recovery in progress sd3(ahc0:3:0): SCB 0x5 timedout while recovery in progress sd3(ahc0:3:0): SCB 0xa - timed out in dataout phase, SCSISIGI == 0x4 SEQADDR = 0x127 SCSISEQ = 0x12 SSTAT0 = 0x0 SSTAT1 = 0x2 sd3(ahc0:3:0): abort message in message buffer sd3(ahc0:3:0): SCB 0x7 - timed out in dataout phase, SCSISIGI == 0x14 SEQADDR = 0x127 SCSISEQ = 0x12 SSTAT0 = 0x0 SSTAT1 = 0x2 sd3(ahc0:3:0): no longer in timeout sd3(ahc0:3:0): no longer in timeout ahc0: Issued Channel A Bus Reset. 6 SCBs aborted sd3(ahc0:3:0): UNIT ATTENTION asc:29,0 sd3(ahc0:3:0): Power on, reset, or bus device reset occurred , retries:3 sd2(ahc0:2:0): UNIT ATTENTION asc:29,0 sd2(ahc0:2:0): Power on, reset, or bus device reset occurred , retries:4 sd0(ahc0:0:0): UNIT ATTENTION asc:29,0 sd0(ahc0:0:0): Power on, reset, or bus device reset occurred field replaceable unit: 1 , retries:3 sd1(ahc0:1:0): UNIT ATTENTION asc:29,0 sd1(ahc0:1:0): Power on, reset, or bus device reset occurred , retries:4 sd2(ahc0:2:0): SCB 0xa - timed out in dataout phase, SCSISIGI == 0x4 SEQADDR = 0x127 SCSISEQ = 0x12 SSTAT0 = 0x0 SSTAT1 = 0x2 Ordered Tag queued sd2(ahc0:2:0): SCB 0x2 timedout while recovery in progress sd3(ahc0:3:0): SCB 0xf timedout while recovery in progress sd2(ahc0:2:0): SCB 0xa - timed out in dataout phase, SCSISIGI == 0x4 SEQADDR = 0x126 SCSISEQ = 0x12 SSTAT0 = 0x0 SSTAT1 = 0x2 sd3(ahc0:3:0): abort message in message buffer sd3(ahc0:3:0): SCB 0xf - timed out in dataout phase, SCSISIGI == 0x14 SEQADDR = 0x126 SCSISEQ = 0x12 SSTAT0 = 0x0 SSTAT1 = 0x2 sd2(ahc0:2:0): no longer in timeout sd3(ahc0:3:0): no longer in timeout ahc0: Issued Channel A Bus Reset. 4 SCBs aborted sd2(ahc0:2:0): UNIT ATTENTION asc:29,0 sd2(ahc0:2:0): Power on, reset, or bus device reset occurred , retries:3 sd3(ahc0:3:0): UNIT ATTENTION asc:29,0 sd3(ahc0:3:0): Power on, reset, or bus device reset occurred , retries:3 sd0(ahc0:0:0): UNIT ATTENTION asc:29,0 sd0(ahc0:0:0): Power on, reset, or bus device reset occurred field replaceable unit: 1 , retries:3 sd2(ahc0:2:0): SCB 0x4 - timed out in dataout phase, SCSISIGI == 0x4 SEQADDR = 0x127 SCSISEQ = 0x12 SSTAT0 = 0x0 SSTAT1 = 0x2 Ordered Tag queued sd2(ahc0:2:0): SCB 0xa timedout while recovery in progress sd3(ahc0:3:0): SCB 0x7 timedout while recovery in progress sd2(ahc0:2:0): SCB 0x1 timedout while recovery in progress sd0(ahc0:0:0): SCB 0xe timedout while recovery in progress sd2(ahc0:2:0): SCB 0x4 - timed out in dataout phase, SCSISIGI == 0x4 SEQADDR = 0x127 SCSISEQ = 0x12 SSTAT0 = 0x0 SSTAT1 = 0x2 sd3(ahc0:3:0): abort message in message buffer sd2(ahc0:2:0): SCB 0xc timedout while recovery in progress sd0(ahc0:0:0): SCB 0x8 timedout while recovery in progress sd0(ahc0:0:0): SCB 0x5 timedout while recovery in progress sd3(ahc0:3:0): SCB 0xf timedout while recovery in progress sd3(ahc0:3:0): SCB 0xd timedout while recovery in progress sd3(ahc0:3:0): SCB 0x2 timedout while recovery in progress sd0(ahc0:0:0): SCB 0x9 timedout while recovery in progress sd3(ahc0:3:0): SCB 0x7 - timed out in dataout phase, SCSISIGI == 0x14 SEQADDR = 0x126 SCSISEQ = 0x12 SSTAT0 = 0x0 SSTAT1 = 0x2 sd2(ahc0:2:0): no longer in timeout sd3(ahc0:3:0): no longer in timeout ahc0: Issued Channel A Bus Reset. 12 SCBs aborted sd2(ahc0:2:0): UNIT ATTENTION asc:29,0 sd2(ahc0:2:0): Power on, reset, or bus device reset occurred , retries:3 sd3(ahc0:3:0): UNIT ATTENTION asc:29,0 sd3(ahc0:3:0): Power on, reset, or bus device reset occurred , retries:3 sd0(ahc0:0:0): UNIT ATTENTION asc:29,0 sd0(ahc0:0:0): Power on, reset, or bus device reset occurred field replaceable unit: 1 , retries:3 sd1(ahc0:1:0): UNIT ATTENTION asc:29,0 I will try removing the tagenable option. Wonder if it'll help? What else can be the cause? /Palle To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-hardware" in the body of the message