Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 29 Jun 2010 09:19:46 +1000
From:      Antony Mawer <lists@mawer.org>
To:        John Baldwin <jhb@freebsd.org>
Cc:        Ireneusz Pluta <ipluta@wp.pl>, freebsd-hardware@freebsd.org
Subject:   Re: System hangs during heavy sequential write to mfi device
Message-ID:  <AANLkTilKthSDRM4cee11rBIbT8MQcWogSBkeoiWdrMsi@mail.gmail.com>
In-Reply-To: <201006281409.23546.jhb@freebsd.org>
References:  <4C2499B5.3030404@wp.pl> <201006281326.08896.jhb@freebsd.org> <4C28E287.5010103@wp.pl> <201006281409.23546.jhb@freebsd.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On Tue, Jun 29, 2010 at 4:09 AM, John Baldwin <jhb@freebsd.org> wrote:
> On Monday 28 June 2010 1:57:27 pm Ireneusz Pluta wrote:
>> John Baldwin pisze:
>> > On Monday 28 June 2010 12:00:06 pm Ireneusz Pluta wrote:
>> >
>> >> John Baldwin pisze:
>> >>
>> >>> On Friday 25 June 2010 4:59:57 pm Ireneusz Pluta wrote:
>> >>>
>> >>>
>> >>>> John Baldwin pisze:
>> >>>>
>> >>>>
>> >>>>> Hmmm. =A0You might have a hardware issue. =A0OTOH, you can try see=
ing if
> you
>> >>>>>
>> > have
>> >
>> >>>>> a BIOS option to disable PCIE error logging.
>> >>>>>
>> >>>>>
>> >>>> is it one of them?:
>> >>>>
>> >>>> Assert NMI on SERR
>> >>>> Assert NMI on PERR
>> >>>>
>> >>>> (pdf page 109 of: ->
>> >>>>
>> >>>>
>> >
> http://download.intel.com/support/motherboards/server/s5520hc/sb/e3952901=
3_s5520hc_s5500hcv_s5520hct_tps_r1_9.pdf)
>> >
>> >>>>
>> >>>>
>> >>> Well, that will turn off the NMIs. =A0Not sure if it will affect the=
 event
>> >>> logging, but it is worth a shot.
>> >>>
>> >>>
>> >> Per BIOS setup documentation:
>> >>
>> >> On SERR, generate an NMI and log an error.
>> >> Note: [Enabled] must be selected for the Assert NMI
>> >> on PERR setup option to be visible.
>> >>
>> >> and:
>> >>
>> >> On PERR, generate an NMI and log an error.
>> >> Note: This option is only active if the Assert NMI on
>> >> SERR option is [Enabled] selected.
>> >>
>> >> However, disabling them did not change anything.
>> >>
>> >
>> > Is it still logging errors and sending NMIs with them disabled?
>> >
>> with the options I mentioned disabled. They do not have to be the only
>> sources of NMIs, do they?
>
> Well, they should be the sources of the log messages you found in your sy=
stem
> event log. =A0There is a good chance that you have some broken hardware
> somewhere, I'm not sure how easy it is for you to debug that via swapping=
 out
> components, but the RAID controller is the first thing I would try.

You might want to try a BIOS update to see if it resolves these
problems ... I seem to remember some mention of these sorts of errors
in the change log of one of the recent Intel server board BIOSes.

-- Antony



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?AANLkTilKthSDRM4cee11rBIbT8MQcWogSBkeoiWdrMsi>