From owner-freebsd-questions Wed Apr 21 14:50: 6 1999 Delivered-To: freebsd-questions@freebsd.org Received: from resnet.uoregon.edu (resnet.uoregon.edu [128.223.144.32]) by hub.freebsd.org (Postfix) with ESMTP id 90F7414EE3 for ; Wed, 21 Apr 1999 14:49:59 -0700 (PDT) (envelope-from dwhite@resnet.uoregon.edu) Received: from localhost (dwhite@localhost) by resnet.uoregon.edu (8.8.8/8.8.8) with ESMTP id OAA23142; Wed, 21 Apr 1999 14:47:29 -0700 (PDT) (envelope-from dwhite@resnet.uoregon.edu) Date: Wed, 21 Apr 1999 14:47:28 -0700 (PDT) From: Doug White To: Chip Marshall Cc: freebsd-questions@FreeBSD.ORG Subject: Re: SCSI Problems leading to reboot In-Reply-To: <19990421171058.A9007@hindenburg.eboai.org> Message-ID: MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII Sender: owner-freebsd-questions@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG On Wed, 21 Apr 1999, Chip Marshall wrote: > At my job we have a Usenet news server setup with an AMD K6 with two > AHA-2940UW's, one for the system drives and the other for the news > drives. It is currently running FreeBSD 2.2.2. The problem is that is > reboots unexpectedly at fairly random times. I managed to find one of > the reboots in the log, and it reads: > > ^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@ > ^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@ > ^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@ Is this in the log? The line above this vvvvvv one would be helpful. > Apr 19 18:59:57 mozart /kernel: SEQADDR = 0x4 SCSISEQ = 0x12 SSTAT0 = 0x5 > SSTAT1 = 0xa > Apr 19 18:59:57 mozart /kernel: sd12(ahc1:2:0): Queueing an Abort SCB > Apr 19 18:59:57 mozart /kernel: sd12(ahc1:2:0): Abort Message Sent > Apr 19 18:59:57 mozart /kernel: sd12(ahc1:2:0): SCB 0x5 - timed out in message > out phase, SCSISIGI == 0xa4 > Apr 19 18:59:57 mozart /kernel: SEQADDR = 0xa1 SCSISEQ = 0x12 SSTAT0 = > 0x5 SSTAT > 1 = 0x2 > Apr 19 18:59:57 mozart /kernel: ahc1: Issued Channel A Bus Reset. 3 SCBs aborted > Apr 19 18:59:57 mozart /kernel: Clearing bus reset > Apr 19 18:59:57 mozart /kernel: Clearing 'in-reset' flag > Apr 19 18:59:57 mozart /kernel: sd12(ahc1:2:0): no longer in timeout > Apr 19 18:59:57 mozart /kernel: sd11(ahc1:1:0): UNIT ATTENTION asc:29,1 > Apr 19 18:59:58 mozart /kernel: , retries:4 One device goes berzerk (sd12) and the system tries to clean it up. It looks like something has completely hosed the SCSI controller to a point where it won't recover. Check termination. It's best if you're arond when ths happens, so you can tell what falls over. A disk may be failing and eating the SCSI bus with it. > followed by ye olde reboote. > What exactly is the SCSI chain trying to do? And why should it cause > the system to reboot so unhappily? The system disks disappear, and that's all it can do. Doug White Internet: dwhite@resnet.uoregon.edu | FreeBSD: The Power to Serve http://gladstone.uoregon.edu/~dwhite | www.freebsd.org To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-questions" in the body of the message