From owner-freebsd-questions@FreeBSD.ORG Tue Jun 28 17:02:31 2005 Return-Path: X-Original-To: freebsd-questions@freebsd.org Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id AEA1316A41C for ; Tue, 28 Jun 2005 17:02:31 +0000 (GMT) (envelope-from gmulder@infotechfl.com) Received: from pigeon.infotechfl.com (mailrelay.infotechfl.com [209.251.147.6]) by mx1.FreeBSD.org (Postfix) with ESMTP id 6441D43D4C for ; Tue, 28 Jun 2005 17:02:31 +0000 (GMT) (envelope-from gmulder@infotechfl.com) Received: from [172.20.0.75] (gmulder.infotechfl.com [172.20.0.75]) by pigeon.infotechfl.com (8.11.6/8.11.6) with ESMTP id j5SH2U025967; Tue, 28 Jun 2005 13:02:30 -0400 Message-ID: <42C182D9.80801@infotechfl.com> Date: Tue, 28 Jun 2005 13:03:21 -0400 From: Gary Mu1der User-Agent: Mozilla Thunderbird 1.0.2 (Windows/20050317) X-Accept-Language: en-us, en MIME-Version: 1.0 To: Matt Juszczak References: <1119950428.22027.1.camel@tarkhil> <42C171AA.6090802@atopia.net> <42C173D9.3010408@Rainbow-IT.net> <42C175B6.7060702@atopia.net> <42C1777C.4000307@infotechfl.com> <20050628124701.L70927@neptune.atopia.net> In-Reply-To: <20050628124701.L70927@neptune.atopia.net> Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit Cc: freebsd-questions@freebsd.org Subject: Re: On recent crashes X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 28 Jun 2005 17:02:31 -0000 Matt Juszczak wrote: >> I can't reproduce the crash with a non-SMP GENERIC kernel. >> >> Gary > > So does this mean my problem is nagios? Couldn't say, but I would think not. Nagios should be running as an unprivileged user so it really shouldn't be crashing FreeBSD. However I'm fairly certain I can crash my SMP FreeBSD 5.4 server with enough ping activity over time, which indicates a bug somewhere in the network stack (maybe a race condition, given that it requires a SMP kernel). Nagios with a lot of hosts sends a lot of pings, which would imply that Nagios could eventually crash the server as well. However you said that Nagios process itself died - that would indicate some other problem w/Nagios. Gary ps. now testing FreeBSD 4.11 SMP with 130+ simultaneous ping/arp -d's (50,000 context switches a second!) and it has been rock solid for 30 minutes. 5.4 would crash within 60 seconds.