Date: Wed, 8 Oct 1997 22:55:52 +0200 From: Stefan Esser <se@FreeBSD.ORG> To: Philippe Regnauld <regnauld@deepo.prosa.dk> Cc: freebsd-scsi@FreeBSD.ORG Subject: Re: 2.2.2 anc NCR875 failures Message-ID: <19971008225552.49139@mi.uni-koeln.de> In-Reply-To: <19971008113725.46245@deepo.prosa.dk>; from Philippe Regnauld on Wed, Oct 08, 1997 at 11:37:25AM %2B0200 References: <19971008113725.46245@deepo.prosa.dk>
next in thread | previous in thread | raw e-mail | index | archive | help
On 1997-10-08 11:37 +0200, Philippe Regnauld <regnauld@deepo.prosa.dk> wrote:
> I just got (a week ago) a new machine to run a keyserver on...
> The configuration is
>
> TX97/K6-180, NCR-875, 64MB RAM, 2 x 2.2 Atlas II UW disks.
>
> I've had the following failure three times so far, I would
> guess during some fair amount of disk i/o: (written on paper,
> I'm trying to reread myself):
>
>
> ncr0: ERROR (81:0) 8af80 (10/1b) @24:00000000
The NCR is failing on one the first instructions, and
the error code indicates that an illegal instruction
has been fetched. This was most probably caused by a
jump to the immediate operand of an instruction:
/*--------------------------< START >-----------------------*/ {
/*
** Claim to be still alive ...
*/
SCR_COPY (sizeof (((struct ncb *)0)->heartbeat)),
KVAR (KVAR_TIME_TV_SEC),
NADDR (heartbeat),
/*
** Make data structure address invalid.
** clear SIGP.
*/
SCR_LOAD_REG (dsa, 0xff),
0,
SCR_FROM_REG (ctest2),
===>>> 0,
The NCR processor tried to execute that constant 0, and
it was not recognized as a valid instruction ...
Hmmm, the (10/1b) in the error message indicate, that
synchronous transfers have been negotiated (the offset
is set to 0x10 == 16 bytes), but the clock pre-scaler
(0x1b) is not set correctly for the 53c875, it appears!
But I don't understand, how you can possibly complete a
single SCSI transfer, at twice the correct clock rate.
You did not tell, which version of the NCR driver (and
FreeBSD) that is. The pre-scaler may be correct, if you
are running the NCR driver as of FreeBSD-2.2.2 and if
the 53c875 is revision 2 or newer.
> In the two other cases, I had some other message, every
> 30 sec. or so, like "retrying block = xxxyyy". No crash,
> no reboot...
Hmmm, there is no such message anywhere in the NCR
driver.
> I had to go and manually reset the machine (off-site!) every
> time.
Sorry to hear that ...
> I tried reducing TAG number in ncrcontrol -- nada.
No, your problem is different from the QUEUE FULL
situation others are suffering from. But that may
still hurt you, if you got revision LXY4 firmware
in your Atlas II drives ...
> Help ?
Please let me know, what version of FreeBSD and the
NCR driver you are using. Booting with "-v -v" will
enable extra verbose boot message, and there will be
more information on the NCR initalization. I'd like
to know those messages.
I'm very sorry for the inconvenience. I'll try to help
you get this problem solved as quickly as possible, but
it does look like a hardware problem to me, currently.
But it may also be because of the timing loop used to
measure the NCR 875 clock frequency, which may fail on
your particular hardware, for as of now unknown reasons.
Regards, STefan
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?19971008225552.49139>
