From owner-freebsd-stable@FreeBSD.ORG Sun Oct 8 21:11:22 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 632F016A407 for ; Sun, 8 Oct 2006 21:11:22 +0000 (UTC) (envelope-from mime@traveller.cz) Received: from ss.eunet.cz (ss.eunet.cz [193.85.228.13]) by mx1.FreeBSD.org (Postfix) with ESMTP id E61A943D45 for ; Sun, 8 Oct 2006 21:11:21 +0000 (GMT) (envelope-from mime@traveller.cz) Received: from localhost.i.cz (ss.eunet.cz [193.85.228.13]) by ss.eunet.cz (8.13.6/8.13.6) with ESMTP id k98LBFZH046323 (version=TLSv1/SSLv3 cipher=RC4-MD5 bits=128 verify=NO); Sun, 8 Oct 2006 23:11:17 +0200 (CEST) (envelope-from mime@traveller.cz) From: Michal Mertl To: Kris Kennaway In-Reply-To: <20061006182620.GB16605@xor.obsecurity.org> References: <45244053.6030706@samsco.org> <20061005200552.GA80162@xor.obsecurity.org> <1160117675.10606.17.camel@genius.i.cz> <20061006182620.GB16605@xor.obsecurity.org> Content-Type: text/plain Date: Sun, 08 Oct 2006 23:11:12 +0200 Message-Id: <1160341872.93717.0.camel@genius.i.cz> Mime-Version: 1.0 X-Mailer: Evolution 2.6.3 FreeBSD GNOME Team Port Content-Transfer-Encoding: 7bit Cc: freebsd-stable@freebsd.org Subject: Re: em, bge, network problems survey. X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 08 Oct 2006 21:11:22 -0000 Kris Kennaway wrote: > On Fri, Oct 06, 2006 at 08:54:35AM +0200, Michal Mertl wrote: > > Kris Kennaway wrote: > > > On Wed, Oct 04, 2006 at 05:14:27PM -0600, Scott Long wrote: > > > > All, > > > > > > > > I'm seeing some patterns here with all of the network driver problem > > > > reports, but I need more information to help narrow it down further. > > > > I ask all of you who are having problems to take a minute to fill > > > > out this survey and return it to Kris Kennaway (on cc:) and myself. > > > > Thanks. > > > > > > > > 1. Are you experiencing network hangs and/or "timeout" messages on the > > > > console? If yes, please provide a _brief_ description of the problem. > > > > > > OK, next question, to all em users: > > > > > > If your em device is using a shared interrupt, and you are NOT > > > experiencing timeout problems when using this device, please let me > > > know. > > > > I haven't seen any timeout message in long time but I experience frozen > > network (and also the already reported panic when doing ifconfig down/up > > then). > > Are these details in a PR? I don't know. As I have seen somebody else reporting the same issue (even with backtrace) and the problem was believed to be understood I dind't pay much attention, sorry. > > > I have also seen strange problem which may be completely unrelated: When > > doing 'find . -ls' on SMB mounted drive - find was spitting the contents > > of the drive but never finishes. Network seemed dead but when I > > interrupted find with Ctrl-C I got the replies to the pings sent when it > > was running (e.g. thousands ms) - this looks like something was > > preventing RX to work and the packets were just queued somewhere. I > > belive I should be able to easily reproduce it. > > > > genius# vmstat -i > > interrupt total rate > > irq0: clk 43784465 1000 > > irq1: atkbd0 66248 1 > > irq5: pcm0 5877 0 > > irq8: rtc 5603682 128 > > irq9: acpi0 8820 0 > > irq11: fwohci0 em* 205749 4 > > irq12: psm0 586848 13 > > irq14: ata0 340844 7 > > irq15: ata1 61 0 > > Total 50602594 1155 > > > > I don't think I remember debug.mpsafenet tunable being mentioned in the > > threads about the problems. It prevents all the problems on my system > > (UP non-APIC system), including the SMB issue mentioned above. > > I suspect both of your problems are some unrelated issue. I'd need > root access & a test setup before I can say more though. It is possible, but your patch to change INTR_FAST to INTR_MPSAFE seems to help with both of these too. I am planning to try to reproduce the SMB temporary network lockup (with original CURRENT driver) and probably do a panic then to get a core. Michal