Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 21 Jul 2003 15:44:36 -0400
From:      Chris Jackman <cjacknospamthanks@klatsch.org>
To:        freebsd-sparc64@freebsd.org
Subject:   correctable DMA error AFAR
Message-ID:  <20030721194436.GA42900@collab.or8.net>

next in thread | raw e-mail | index | archive | help


Hola senores!

Error messages:

pcib0: correctable DMA error AFAR 0x476d6140 AFSR 0x40e600003f800000
and
pcib0: correctable DMA error AFAR 0x40adbc40 AFSR 0x40c400003f800000

My e250 has locked up twice in the last few weeks with these
error messages.  The error gets repeated over and over
again on the serial console, and I can't do anything to the
box except power cycle it.  The first time it happened, I was
transferring about 10 5gig files from another machine
on the same switch.  The second time, the machine was idle.

I see the error message in /u/s/sys/sparc64/pci/psycho.c, 
in psycho_ce() at line 751.  My world and kernel are 
from ~July 10th, and I have the latest psycho.c (1.41).

How come this error is correctable, but the other functions 
around this one are all uncorrectable?  Perhaps this function 
should also panic, since my machine is unusable when this 
error occurs.  

Also, is there a way to send a break over the serial console? 
I can send it with cu using ~#, but the e250 doesn't respond
to it.  My guess is that it is a solaris feature to catch
the break signal, and drop to the Openboot Firmware.  

I'll hook up a keyboard to this machine with boot.conf
settings to get the console output to the serial port, 
and if the error happens again I'll try ctrl-alt-escape 
on the keyboard to try and get to the debugger.


Thanks!



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20030721194436.GA42900>