Date: Thu, 01 Nov 2007 11:36:36 +0300 From: Vladimir Ivanov <wawa@yandex-team.ru> To: Jack Vogel <jfvogel@gmail.com> Cc: "freebsd-net@freebsd.org" <freebsd-net@freebsd.org>, FreeBSD Current <freebsd-current@freebsd.org>, FreeBSD Stable List <freebsd-stable@freebsd.org> Subject: Re: Proposed #ifdef change to em Message-ID: <47299014.6020207@yandex-team.ru> In-Reply-To: <2a41acea0710311728n69b5669fxb14fd382e3e072d4@mail.gmail.com> References: <2a41acea0710310935u6ed33491pcee4c6bd57d12d1a@mail.gmail.com> <4728AFCC.7020706@samsco.org> <47291716.1030904@yandex-team.ru> <2a41acea0710311728n69b5669fxb14fd382e3e072d4@mail.gmail.com>
next in thread | previous in thread | raw e-mail | index | archive | help
Hi, Jack Vogel wrote: > Vladimir, > > Your one phrase "more or less patched" invalidated the whole > data point. We are talking about code thats checked in and bound > for 6.3 :) Oops. I've got it. Maybe we talk about different kinds of watchdog. I have meant TX queue watchdogs. Yes, there is a problem with system watchdog in mainstream driver. Sometimes system stops to respond due to kernel activity for a one minute or less. Hardware watchdog can reset system this time. This issue is specific to taskq (fastintr) version of driver The fix is very simple: we've to schedule less priority to RX thread. We use PRI_MAX_KERN instead of PI_NET in Yandex' revision of driver. > > I have hundreds of machines here at Intel that DON'T have the > problem, that's why in early 20th century philosophy they realized > that verification as scientific method was ineffective, falsification > on the other hand is powerful. So if any users out there have > a problem I am trying to understand why. The only way that I > have so far reproduced something like their failure is when > FAST interrupts are enabled, THEN when I disable them on that > same machine the problem disappears. Right now I have still > not figured out why this is, I'm trying to do that as I write this. > > I am also not saying that nothing ever caused a watchdog > before FAST handling, only that as best that I can tell right now > the one repro I have on STABLE, October Snapshot, is related to it. > > Regards, > > Jack > WBR,Vladimir
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?47299014.6020207>