From owner-freebsd-questions@FreeBSD.ORG Sun Sep 11 09:55:45 2005 Return-Path: X-Original-To: freebsd-questions@freebsd.org Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id A1EFE16A41F for ; Sun, 11 Sep 2005 09:55:45 +0000 (GMT) (envelope-from bsd@bathnetworks.com) Received: from lmail.bathnetworks.co.uk (mail.bathnetworks.com [84.92.24.252]) by mx1.FreeBSD.org (Postfix) with ESMTP id C2D4143D46 for ; Sun, 11 Sep 2005 09:55:44 +0000 (GMT) (envelope-from bsd@bathnetworks.com) Received: (qmail 2630 invoked by uid 510); 11 Sep 2005 09:56:18 +0100 Received: from 84.92.24.252 by lmail.bathnetworks.co.uk (envelope-from , uid 508) with qmail-scanner-1.24-st-qms (clamdscan: 0.86.2/1032. spamassassin: 3.0.2. perlscan: 1.24-st-qms. Clear:RC:0(84.92.24.252):SA:0(-2.9/5.0):. Processed in 0.375004 secs); 11 Sep 2005 08:56:18 -0000 X-Spam-Status: No, hits=-2.9 required=5.0 X-Antivirus-MYDOMAIN-Mail-From: bsd@bathnetworks.com via lmail.bathnetworks.co.uk X-Antivirus-MYDOMAIN: 1.24-st-qms (Clear:RC:0(84.92.24.252):SA:0(-2.9/5.0):. Processed in 0.375004 secs Process 2623) Received: from mail.bathnetworks.com (HELO ?84.92.24.252?) (bsd@bathnetworks.com@84.92.24.252) by lmail.bathnetworks.co.uk with SMTP; 11 Sep 2005 09:56:18 +0100 From: Robert Slade To: "freebsd-questions@freebsd.org" Content-Type: text/plain Message-Id: <1126428977.1803.48.camel@lmail.bathnetworks.co.uk> Mime-Version: 1.0 X-Mailer: Ximian Evolution 1.4.6 (1.4.6-2) Date: Sun, 11 Sep 2005 09:56:17 +0100 Content-Transfer-Encoding: 7bit Subject: PL 5000 Random Reboots X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 11 Sep 2005 09:55:45 -0000 Hi, I have been having problems with random reboots when using the SMP Kernel on the above Quad processor machine. It occurs with bot 5.4 and 6.0B4. It does not happen with a single processor kernel, well not in the last 12 hours. Searching the mailing list and google has thrown up a number of possible causes: a. there was an issue with the SMP kernel under load in 5.3 but the errata says the fix was incorporated in 5.4; b. there appears to be some problems with with the ACPI with some machines. Unfortunately, the SMP kernel needs ACPI so turning it off is not an option. c. there appears to be an issue with some network cards and USB with interupt 'storms'. The machine has 2 network cards both identified in dmesg as Compaq Netelligent 10/100, athough only one is in use. I am getting tl0: tx underrun -- increasing tx threshold to 512 bytes when using vncserver to run xwindows with KDE which maybe related. It looks like the most likely cause is ACPI, SMP or both, but as ACPI relates to interups, the interup 'storm' issue could be impacting. However I do not know enough about the workings of ACPI etc to be able to judge this. It looks to me that there is a number of things I could do to isolate the problem: 1. try building a kernel without SMP but with ACPI to see what effect that has. Question is is this possible with 6.0B4? 2. if the above gives a problem try changing the network cards for another type. 3. fix the underrun issue but I am not sure how to do that. I would be grateful for any comments ideas on the way forward as the machine is quite slow without the SMP kernel. Thanks Rob