From owner-freebsd-questions@FreeBSD.ORG Wed Oct 8 06:30:21 2008 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 9A0B1106568B for ; Wed, 8 Oct 2008 06:30:21 +0000 (UTC) (envelope-from mister.olli@googlemail.com) Received: from fg-out-1718.google.com (fg-out-1718.google.com [72.14.220.156]) by mx1.freebsd.org (Postfix) with ESMTP id 1EEE08FC19 for ; Wed, 8 Oct 2008 06:30:20 +0000 (UTC) (envelope-from mister.olli@googlemail.com) Received: by fg-out-1718.google.com with SMTP id l26so2482632fgb.35 for ; Tue, 07 Oct 2008 23:30:19 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=googlemail.com; s=gamma; h=domainkey-signature:received:received:subject:from:reply-to:to:cc :in-reply-to:references:content-type:date:message-id:mime-version :x-mailer:content-transfer-encoding; bh=MPwMO6fLxE7TD/LgtL+M1mHC6ly+oy+0q604W7Jc8i4=; b=ZfW2Tl1JxaCJbD+HHoZUJIJGKHvZQTMXd2NkBUS+wQIl4smAXDnZGCsGICPQP/hOVJ xy4ZpYsFiu2BFsxJCfdt7wOXk02xSwoAllNF/SjAIeyiV1Wo9FHNnpU6v6tqk7eX6NOS +6F/qMjV73MUYBCVPevXtRu4iaxy/j4u0WPUk= DomainKey-Signature: a=rsa-sha1; c=nofws; d=googlemail.com; s=gamma; h=subject:from:reply-to:to:cc:in-reply-to:references:content-type :date:message-id:mime-version:x-mailer:content-transfer-encoding; b=qBCoRMEsGmNoUqJI8JFrCF6+MYV4KQdI7W35JXEwb5DwOB5zdY6UnpqlnceZnPbi8N GOOsLUgPC9osk+APkZONmzsJlAAwMGzmKS7AJzgtmdRoq1pFQBNm0tSdKpak/39GfDGp feLudDGxL7IaIUf/VZyh/0cRP0r1miJCokStU= Received: by 10.103.249.19 with SMTP id b19mr4173593mus.50.1223447419577; Tue, 07 Oct 2008 23:30:19 -0700 (PDT) Received: from ?10.141.4.16? ([80.187.211.61]) by mx.google.com with ESMTPS id u9sm26459808muf.9.2008.10.07.23.30.17 (version=SSLv3 cipher=RC4-MD5); Tue, 07 Oct 2008 23:30:18 -0700 (PDT) From: Mister Olli To: Jerry McAllister In-Reply-To: <20081006174502.GB71024@gizmo.acns.msu.edu> References: <1223273047.23248.25.camel@phoenix.blechhirn.net> <20081006171809.GA26368@icarus.home.lan> <20081006174502.GB71024@gizmo.acns.msu.edu> Content-Type: text/plain Date: Wed, 08 Oct 2008 08:30:12 +0200 Message-Id: <1223447412.5896.9.camel@phoenix.blechhirn.net> Mime-Version: 1.0 X-Mailer: Evolution 2.22.3.1 Content-Transfer-Encoding: 7bit Cc: Jeremy Chadwick , freebsd-questions@freebsd.org Subject: Re: analyzing freebsd core dumps X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: mister.olli@googlemail.com List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 08 Oct 2008 06:30:21 -0000 hi... thanks for the feedback on this topic. the first step to clean the machine and check all connectors has been done yesterday. I hope that this will fix the problem, and that it's not some kind of hardware failure. to run tests with memtest is quite a problem, since the machine has high availability requirements. to take it off for nearly one hour for cleaning and checking during daily work of our company was a pain. 6 hours or more of RAM tests is not possible. is there some other way to detect hardware failure with less time consuming tool/ process? greetz olli Am Montag, den 06.10.2008, 13:45 -0400 schrieb Jerry McAllister: > On Mon, Oct 06, 2008 at 10:18:09AM -0700, Jeremy Chadwick wrote: > > > On Mon, Oct 06, 2008 at 08:04:07AM +0200, Mister Olli wrote: > > > hi list... > > > > > > I have a freebsd maschine running for more 6 months without any > > > problems. > > > the machine's only service is to be an openvpn gateway for a hand of > > > users. > > > > > > 2 weeks ago the first problems started. the openvpn exited with signal > > > 11 and 4 and core dumps were written. > > > > > > the same happend yesterday with the postfix/cleanup process, and the > > > suddenly the machine rebooted without any further log messages. > > > > > > what is the best way to troubleshoot the cause of this problem? > > > > Signal 11 happening "out of no where" on machines which have been > > running fine, most of the time, is a sign of hardware failure (usually > > RAM, but sometimes motherboard or PSU). The fact you got a reboot is > > also further evidence of this. > > > > http://www.freebsd.org/doc/en/books/faq/troubleshoot.html#SIGNAL11 > > > > I would recommend taking the machine offline and running something like > > memtest86+ on it for 6-7 hours. Any errors seen are a pretty good sign > > that you should replace the memory or the motherboard. You can > > download an ISO or floppy disk images here: > > > > http://www.memtest.org/ > > > > Bottom line is that this is probably a hardware issue. > > Could also be a contacts if it is not the actual memory or board. > A marginal contact where something is plugged in can over time > build up deposits that make it fail. Of course, this is still > a hardware problem, but can often be cured by reseating everything. > If it is bad enough, it could also be exacerbated by reseating > everything. > > ////jerry > > > > > -- > > | Jeremy Chadwick jdc at parodius.com | > > | Parodius Networking http://www.parodius.com/ | > > | UNIX Systems Administrator Mountain View, CA, USA | > > | Making life hard for others since 1977. PGP: 4BD6C0CB | > > > > _______________________________________________ > > freebsd-questions@freebsd.org mailing list > > http://lists.freebsd.org/mailman/listinfo/freebsd-questions > > To unsubscribe, send any mail to "freebsd-questions-unsubscribe@freebsd.org"