From owner-freebsd-questions  Wed Apr 21 14:50: 6 1999
Delivered-To: freebsd-questions@freebsd.org
Received: from resnet.uoregon.edu (resnet.uoregon.edu [128.223.144.32])
	by hub.freebsd.org (Postfix) with ESMTP id 90F7414EE3
	for <freebsd-questions@FreeBSD.ORG>; Wed, 21 Apr 1999 14:49:59 -0700 (PDT)
	(envelope-from dwhite@resnet.uoregon.edu)
Received: from localhost (dwhite@localhost)
          by resnet.uoregon.edu (8.8.8/8.8.8) with ESMTP id OAA23142;
          Wed, 21 Apr 1999 14:47:29 -0700 (PDT)
          (envelope-from dwhite@resnet.uoregon.edu)
Date: Wed, 21 Apr 1999 14:47:28 -0700 (PDT)
From: Doug White <dwhite@resnet.uoregon.edu>
To: Chip Marshall <chip@jlc.net>
Cc: freebsd-questions@FreeBSD.ORG
Subject: Re: SCSI Problems leading to reboot
In-Reply-To: <19990421171058.A9007@hindenburg.eboai.org>
Message-ID: <Pine.BSF.4.03.9904211445400.27954-100000@resnet.uoregon.edu>
MIME-Version: 1.0
Content-Type: TEXT/PLAIN; charset=US-ASCII
Sender: owner-freebsd-questions@FreeBSD.ORG
Precedence: bulk
X-Loop: FreeBSD.ORG

On Wed, 21 Apr 1999, Chip Marshall wrote:

> At my job we have a Usenet news server setup with an AMD K6 with two
> AHA-2940UW's, one for the system drives and the other for the news
> drives. It is currently running FreeBSD 2.2.2. The problem is that is
> reboots unexpectedly at fairly random times. I managed to find one of
> the reboots in the log, and it reads:
> 
> ^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@
> ^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@
> ^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@

Is this in the log?

The line above this vvvvvv one would be helpful.

> Apr 19 18:59:57 mozart /kernel: SEQADDR = 0x4 SCSISEQ = 0x12 SSTAT0 = 0x5 
> 	SSTAT1 = 0xa
> Apr 19 18:59:57 mozart /kernel: sd12(ahc1:2:0): Queueing an Abort SCB
> Apr 19 18:59:57 mozart /kernel: sd12(ahc1:2:0): Abort Message Sent
> Apr 19 18:59:57 mozart /kernel: sd12(ahc1:2:0): SCB 0x5 - timed out in message 
> 	out phase, SCSISIGI == 0xa4
> Apr 19 18:59:57 mozart /kernel: SEQADDR = 0xa1 SCSISEQ = 0x12 SSTAT0 =
> 0x5 SSTAT
> 1 = 0x2
> Apr 19 18:59:57 mozart /kernel: ahc1: Issued Channel A Bus Reset. 3 SCBs aborted
> Apr 19 18:59:57 mozart /kernel: Clearing bus reset
> Apr 19 18:59:57 mozart /kernel: Clearing 'in-reset' flag
> Apr 19 18:59:57 mozart /kernel: sd12(ahc1:2:0): no longer in timeout
> Apr 19 18:59:57 mozart /kernel: sd11(ahc1:1:0): UNIT ATTENTION asc:29,1 
> Apr 19 18:59:58 mozart /kernel: , retries:4

One device goes berzerk (sd12) and the system tries to clean it up. It
looks like something has completely hosed the SCSI controller to a point
where it won't recover.  

Check termination.  It's best if you're arond when ths happens, so you can
tell what falls over. A disk may be failing and eating the SCSI bus with
it.

> followed by ye olde reboote.
> What exactly is the SCSI chain trying to do? And why should it cause
> the system to reboot so unhappily?

The system disks disappear, and that's all it can do.

Doug White                               
Internet:  dwhite@resnet.uoregon.edu    | FreeBSD: The Power to Serve
http://gladstone.uoregon.edu/~dwhite    | www.freebsd.org


To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-questions" in the body of the message