From owner-freebsd-net@FreeBSD.ORG Thu Nov 1 08:38:05 2007 Return-Path: Delivered-To: freebsd-net@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 91F6416A420; Thu, 1 Nov 2007 08:38:05 +0000 (UTC) (envelope-from wawa@yandex-team.ru) Received: from relanium.yandex.ru (relanium.yandex.ru [213.180.193.88]) by mx1.freebsd.org (Postfix) with ESMTP id 000BC13C4AA; Thu, 1 Nov 2007 08:38:04 +0000 (UTC) (envelope-from wawa@yandex-team.ru) Received: from [87.250.227.221] (v3-227-221.yandex.net [87.250.227.221]) by relanium.yandex.ru (8.14.1/8.14.1) with ESMTP id lA18aaLP040756 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO); Thu, 1 Nov 2007 11:36:37 +0300 (MSK) (envelope-from wawa@yandex-team.ru) Message-ID: <47299014.6020207@yandex-team.ru> Date: Thu, 01 Nov 2007 11:36:36 +0300 From: Vladimir Ivanov Organization: Yandex LLC User-Agent: Thunderbird 1.5.0.14pre (X11/20071023) MIME-Version: 1.0 To: Jack Vogel References: <2a41acea0710310935u6ed33491pcee4c6bd57d12d1a@mail.gmail.com> <4728AFCC.7020706@samsco.org> <47291716.1030904@yandex-team.ru> <2a41acea0710311728n69b5669fxb14fd382e3e072d4@mail.gmail.com> In-Reply-To: <2a41acea0710311728n69b5669fxb14fd382e3e072d4@mail.gmail.com> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit X-Antivirus: Dr.Web (R) for Mail Servers on relanium.yandex.ru host X-Antivirus-Code: 100000 Cc: "freebsd-net@freebsd.org" , Scott Long , FreeBSD Current , FreeBSD Stable List Subject: Re: Proposed #ifdef change to em X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 01 Nov 2007 08:38:05 -0000 Hi, Jack Vogel wrote: > Vladimir, > > Your one phrase "more or less patched" invalidated the whole > data point. We are talking about code thats checked in and bound > for 6.3 :) Oops. I've got it. Maybe we talk about different kinds of watchdog. I have meant TX queue watchdogs. Yes, there is a problem with system watchdog in mainstream driver. Sometimes system stops to respond due to kernel activity for a one minute or less. Hardware watchdog can reset system this time. This issue is specific to taskq (fastintr) version of driver The fix is very simple: we've to schedule less priority to RX thread. We use PRI_MAX_KERN instead of PI_NET in Yandex' revision of driver. > > I have hundreds of machines here at Intel that DON'T have the > problem, that's why in early 20th century philosophy they realized > that verification as scientific method was ineffective, falsification > on the other hand is powerful. So if any users out there have > a problem I am trying to understand why. The only way that I > have so far reproduced something like their failure is when > FAST interrupts are enabled, THEN when I disable them on that > same machine the problem disappears. Right now I have still > not figured out why this is, I'm trying to do that as I write this. > > I am also not saying that nothing ever caused a watchdog > before FAST handling, only that as best that I can tell right now > the one repro I have on STABLE, October Snapshot, is related to it. > > Regards, > > Jack > WBR,Vladimir