Date: Thu, 26 Jul 2007 12:11:49 +0100 (BST) From: Robert Watson <rwatson@FreeBSD.org> To: Kris Kennaway <kris@obsecurity.org> Cc: arch@FreeBSD.org, Anders Nordby <anders@FreeBSD.org>, jkim@FreeBSD.org Subject: Re: Removing NET_NEEDS_GIANT: first patch Message-ID: <20070726120713.Q15979@fledge.watson.org> In-Reply-To: <20070726105358.GA43979@rot26.obsecurity.org> References: <20070724110908.T83919@fledge.watson.org> <20070726102328.GA12293@fupp.net> <20070726105358.GA43979@rot26.obsecurity.org>
next in thread | previous in thread | raw e-mail | index | archive | help
On Thu, 26 Jul 2007, Kris Kennaway wrote:
>> I've used and still use debug_mpsafenet to get rid of watchdog timeout
>> problems on a lot of HP Proliant servers, particularly with the bge driver:
>>
>> Dec 21 06:42:51 videovm1 kernel: bge0: watchdog timeout -- resetting
>> Dec 21 06:42:51 videovm1 kernel: bge0: link state changed to DOWN
>> Dec 21 06:42:54 videovm1 kernel: bge0: link state changed to UP
>>
>> This problem goes away with debug.mpsafenet="0", for me.
>>
>> I can try to turn off this setting, and see how it goes. I remember there
>> was something one could do, to get more information about the watchdog
>> error, but can't remember what.
>
> Please do. There is no sense in crippling your network for the sake of an
> unre{solved,ported} driver bug.
I agree with what Kris said, only more so. :-) By masking the bug using
debug.mpsafenet, whatever bug is the root of the problem isn't getting fixed,
and instead keeps going out in releases. This sounds like it's most likely a
driver bug, although I wouldn't rule out some sort of interrupt problem. It
looks like jkim might be someone to talk to about this (CC'd).
Robert N M Watson
Computer Laboratory
University of Cambridge
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20070726120713.Q15979>
