From owner-freebsd-stable Mon May 20 23:45:41 2002 Delivered-To: freebsd-stable@freebsd.org Received: from tinker.exit.com (tinker.exit.com [206.223.0.1]) by hub.freebsd.org (Postfix) with ESMTP id 381AC37B408 for ; Mon, 20 May 2002 23:45:36 -0700 (PDT) Received: from realtime.exit.com (realtime [206.223.0.5]) by tinker.exit.com (8.12.3/8.12.3) with ESMTP id g4L6i7vn003410; Mon, 20 May 2002 23:44:07 -0700 (PDT) (envelope-from frank@exit.com) Received: from realtime.exit.com (localhost [127.0.0.1]) by realtime.exit.com (8.12.3/8.12.2) with ESMTP id g4L6i7cp018388; Mon, 20 May 2002 23:44:07 -0700 (PDT) (envelope-from frank@realtime.exit.com) Received: (from frank@localhost) by realtime.exit.com (8.12.3/8.12.3/Submit) id g4L6i4HK018378; Mon, 20 May 2002 23:44:04 -0700 (PDT) From: Frank Mayhar Message-Id: <200205210644.g4L6i4HK018378@realtime.exit.com> Subject: Re: 4.6-RC system hangs (fxp0, smp, sym) In-Reply-To: <3CE96053.23367D00@alogis.com> To: Holger Kipp Date: Mon, 20 May 2002 23:44:04 -0700 (PDT) Cc: Pete French , stable@FreeBSD.ORG, Maildrop Reply-To: frank@exit.com Organization: Exit Consulting X-Copyright0: Copyright 2002 Frank Mayhar. All Rights Reserved. X-Copyright1: Permission granted for electronic reproduction as Usenet News or email only. X-Mailer: ELM [version 2.4ME+ PL95a (25)] MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Content-Type: text/plain; charset=US-ASCII Sender: owner-freebsd-stable@FreeBSD.ORG Precedence: bulk List-ID: List-Archive: (Web Archive) List-Help: (List Instructions) List-Subscribe: List-Unsubscribe: X-Loop: FreeBSD.ORG Holger Kipp wrote: > System hangs after some traffic on a 10Mbit link. Hang can be triggered > by running 'ping -f' against a stupid Win98-System. Hang occurs after > 50.000 to 1.400.000 packets. I'll have to investigate further to see if > this depends on other system activity... I think that heavy network traffic is also implicated in my hangs. It mostly happens at night, which is when I beat on it the hardest. No messages, no "buffer is full" errors, in fact the system is completely wedged hard, it is completely unresponsive. At some point, though, it decides to start running again. > SMP Buildworld is very stable, so no bad memory or anything. Yeah, I see no evidence of hardware trouble here, either. Given that so many of us are seeing this, it's virtually certain to be an OS error. It's strange that it's cropping up _now_, though. I don't see any recent commits in either the sym or fxp drivers, much less something that might explain this. Hmm. > After copying 141 MB from one disk to another several times (with NCR, > not SYM, I have to admit), I still got no error with disk IO. Have you > tried using NCR instead? I don't know if your chipset is supported by > NCR device driver, though. Yeah, it is; I may try that when I have a chance. > Bus reset only happens after I take the offending interface down > with 'ifconfig fxp0 down'. Using SYM driver, this might take a minute > or two. With NCR, the system unhangs almost instantly... I'm unable to do _anything_ during a hang. I usually run X and it is wedged solid. -- Frank Mayhar frank@exit.com http://www.exit.com/ Exit Consulting http://www.gpsclock.com/ To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-stable" in the body of the message