Skip site navigation (1)Skip section navigation (2)
Date:      Wed, 21 Apr 1999 14:47:28 -0700 (PDT)
From:      Doug White <dwhite@resnet.uoregon.edu>
To:        Chip Marshall <chip@jlc.net>
Cc:        freebsd-questions@FreeBSD.ORG
Subject:   Re: SCSI Problems leading to reboot
Message-ID:  <Pine.BSF.4.03.9904211445400.27954-100000@resnet.uoregon.edu>
In-Reply-To: <19990421171058.A9007@hindenburg.eboai.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On Wed, 21 Apr 1999, Chip Marshall wrote:

> At my job we have a Usenet news server setup with an AMD K6 with two
> AHA-2940UW's, one for the system drives and the other for the news
> drives. It is currently running FreeBSD 2.2.2. The problem is that is
> reboots unexpectedly at fairly random times. I managed to find one of
> the reboots in the log, and it reads:
> 
> ^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@
> ^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@
> ^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@^@

Is this in the log?

The line above this vvvvvv one would be helpful.

> Apr 19 18:59:57 mozart /kernel: SEQADDR = 0x4 SCSISEQ = 0x12 SSTAT0 = 0x5 
> 	SSTAT1 = 0xa
> Apr 19 18:59:57 mozart /kernel: sd12(ahc1:2:0): Queueing an Abort SCB
> Apr 19 18:59:57 mozart /kernel: sd12(ahc1:2:0): Abort Message Sent
> Apr 19 18:59:57 mozart /kernel: sd12(ahc1:2:0): SCB 0x5 - timed out in message 
> 	out phase, SCSISIGI == 0xa4
> Apr 19 18:59:57 mozart /kernel: SEQADDR = 0xa1 SCSISEQ = 0x12 SSTAT0 =
> 0x5 SSTAT
> 1 = 0x2
> Apr 19 18:59:57 mozart /kernel: ahc1: Issued Channel A Bus Reset. 3 SCBs aborted
> Apr 19 18:59:57 mozart /kernel: Clearing bus reset
> Apr 19 18:59:57 mozart /kernel: Clearing 'in-reset' flag
> Apr 19 18:59:57 mozart /kernel: sd12(ahc1:2:0): no longer in timeout
> Apr 19 18:59:57 mozart /kernel: sd11(ahc1:1:0): UNIT ATTENTION asc:29,1 
> Apr 19 18:59:58 mozart /kernel: , retries:4

One device goes berzerk (sd12) and the system tries to clean it up. It
looks like something has completely hosed the SCSI controller to a point
where it won't recover.  

Check termination.  It's best if you're arond when ths happens, so you can
tell what falls over. A disk may be failing and eating the SCSI bus with
it.

> followed by ye olde reboote.
> What exactly is the SCSI chain trying to do? And why should it cause
> the system to reboot so unhappily?

The system disks disappear, and that's all it can do.

Doug White                               
Internet:  dwhite@resnet.uoregon.edu    | FreeBSD: The Power to Serve
http://gladstone.uoregon.edu/~dwhite    | www.freebsd.org



To Unsubscribe: send mail to majordomo@FreeBSD.org
with "unsubscribe freebsd-questions" in the body of the message




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?Pine.BSF.4.03.9904211445400.27954-100000>