Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 18 Oct 2011 06:45:37 -0700
From:      Jeremy Chadwick <freebsd@jdc.parodius.com>
To:        Daniel O'Connor <doconnor@gsoft.com.au>
Cc:        Thomas Zander <thomas.e.zander@googlemail.com>, freebsd-stable <freebsd-stable@freebsd.org>
Subject:   Re: Interpreting MCA error output
Message-ID:  <20111018134537.GA16543@icarus.home.lan>
In-Reply-To: <59140016-4530-4343-9C30-CA9B16E9EEBD@gsoft.com.au>
References:  <CAFU734y3WsVFTpnGoGfbPH4vVBnoz8f=qGvYS4c%2BLya8PFQP_A@mail.gmail.com> <20111001102327.GA37434@icarus.home.lan> <CAFU734xHMugfW%2BZcO93OPqUEhJshYn-k%2B%2B3aGmcDVvGZVQ=s%2BQ@mail.gmail.com> <20111018071941.GA9482@icarus.home.lan> <087BDDD7-0B16-4415-8D98-11FA617E03F5@gsoft.com.au> <20111018113302.GA14363@icarus.home.lan> <59140016-4530-4343-9C30-CA9B16E9EEBD@gsoft.com.au>

next in thread | previous in thread | raw e-mail | index | archive | help
On Tue, Oct 18, 2011 at 11:31:37PM +1030, Daniel O'Connor wrote:
> 
> On 18/10/2011, at 22:03, Jeremy Chadwick wrote:
> >> It would be _super_ neat if the mcelog port installed a devd rule which emailed root@ with the human readable version of an MCA exception :)
> >> 
> >> I suspect this wouldn't be too difficult to do for a JKH.
> > 
> > Why?  Most MCEs on FreeBSD will panic the machine.  I would need to go
> > through the MCA code to see what all gets handled elegantly, but I
> > imagine there isn't much.
> 
> If you see an MCA in your log file it didn't panic your machine, I think they're not uncommon.

But that's already logged in /var/log/messages.  So I guess we're at a
standstill?

> > Also, isn't devd for device removal/insertions?  This would be using
> > devd for something it isn't intended for.  I guess I have "moral
> > objections" to it.  What you're really wanting is Solaris's fmd(1m)
> > daemon, which I believe is also tied heavily into Solaris's smf(5)
> > architecture.
> 
> I always thought devd was a fairly general event notification thing.

Possibly you should read devd(8), and after that, definitely look at
devctl(4).  The latter specifically is for devices.  MCA is not a device
(or rather, it is not implemented as a device framework on FreeBSD).

Truly what you're looking for is Solaris fmd(1m).  It's a "generic event
notification thing" that handles all sorts of events, ranging from
physical device additions/removals, to ZFS pool failures, to MCEs --
and it doesn't stop there.

Porting fmd(1m) to FreeBSD would be a wonderful, yet fairly intensive,
GSoC project.  Note that just blindly porting fmd(1m) wouldn't add all
the necessary shims in other pieces to notify fmd of an event, that
would have to be coded (added) over time.

-- 
| Jeremy Chadwick                                jdc at parodius.com |
| Parodius Networking                       http://www.parodius.com/ |
| UNIX Systems Administrator                   Mountain View, CA, US |
| Making life hard for others since 1977.               PGP 4BD6C0CB |




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20111018134537.GA16543>