From owner-freebsd-proliant@FreeBSD.ORG Wed Jun 18 16:41:06 2008 Return-Path: Delivered-To: freebsd-proliant@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 3B7C6106567C for ; Wed, 18 Jun 2008 16:41:06 +0000 (UTC) (envelope-from friedman@www1.emax.ca) Received: from www1.emax.ca (www1.emax.ca [69.13.248.199]) by mx1.freebsd.org (Postfix) with ESMTP id EE7018FC21 for ; Wed, 18 Jun 2008 16:41:05 +0000 (UTC) (envelope-from friedman@www1.emax.ca) Received: from www1.emax.ca (www1.emax.ca [69.13.248.199]) by www1.emax.ca (8.12.11/8.12.9) with ESMTP id m5IGT1e2082096; Wed, 18 Jun 2008 12:29:01 -0400 (EDT) (envelope-from friedman@www1.emax.ca) Received: (from friedman@localhost) by www1.emax.ca (8.12.11/8.12.9/Submit) id m5IGT03c082094; Wed, 18 Jun 2008 12:29:00 -0400 (EDT) (envelope-from friedman) Date: Wed, 18 Jun 2008 12:29:00 -0400 From: BarryFriedman@www1.emax.ca To: freebsd-proliant@freebsd.org Message-ID: <20080618162900.GA70649@emax.ca> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline User-Agent: Mutt/1.4.2.1i X-Scanned-By: MIMEDefang 2.42 Cc: bfriedman@emax.ca Subject: Server reboots at random :-( X-BeenThere: freebsd-proliant@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Technical discussion of FreeBSD on HP ProLiant server platforms." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 18 Jun 2008 16:41:06 -0000 I am running a DL380 G4 with an unmodified freeBSD 7.0-RELEASE kernel / no customization / all firmware up to date. I had it on my local network for a month and it seemed to be stable, however when it was moved into a hosting datacenter and connected to the internet I noticed that it was rebooting at random once or more a day. The machine is connected to two different UPSs so should not be losing power. ILO event log shows a series of Server reset and Server power restored messages with no other significant events logged. System logs do not show any cause of reboot, other than WARNING: / was not properly dismounted etc. which would seem to indicate that a hard reset has occurred. Power regulator mode in the ilo is disabled. Any ideas about how to troubleshoot this would be greatly appreciated. Thanks, -- Barry Friedman Emax Computer Systems Inc., Ottawa, Ont. Canada K1Z 5N9 From owner-freebsd-proliant@FreeBSD.ORG Wed Jun 18 21:54:43 2008 Return-Path: Delivered-To: freebsd-proliant@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 94BF11065672 for ; Wed, 18 Jun 2008 21:54:43 +0000 (UTC) (envelope-from edwin@mavetju.org) Received: from mail5out.barnet.com.au (mail5.barnet.com.au [202.83.178.78]) by mx1.freebsd.org (Postfix) with ESMTP id 4B1FE8FC16 for ; Wed, 18 Jun 2008 21:54:43 +0000 (UTC) (envelope-from edwin@mavetju.org) Received: by mail5out.barnet.com.au (Postfix, from userid 1001) id 4CA2E2218A40; Thu, 19 Jun 2008 07:29:41 +1000 (EST) X-Viruscan-Id: <48597E450000913100E689@BarNet> Received: from mail5auth.barnet.com.au (mail5.barnet.com.au [202.83.178.78]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (Client CN "mail5auth.barnet.com.au", Issuer "*.barnet.com.au" (verified OK)) by mail5.barnet.com.au (Postfix) with ESMTP id 0D5A321B3B74; Thu, 19 Jun 2008 07:29:41 +1000 (EST) Received: from k7.mavetju (unknown [10.10.26.6]) by mail5auth.barnet.com.au (Postfix) with ESMTP id B75052218A77; Thu, 19 Jun 2008 07:29:40 +1000 (EST) Received: by k7.mavetju (Postfix, from userid 1001) id 930BE4E3; Thu, 19 Jun 2008 07:29:41 +1000 (EST) Date: Thu, 19 Jun 2008 07:29:41 +1000 From: Edwin Groothuis To: BarryFriedman@www1.emax.ca Message-ID: <20080618212941.GM89661@k7.mavetju> References: <20080618162900.GA70649@emax.ca> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20080618162900.GA70649@emax.ca> User-Agent: Mutt/1.4.2.3i Cc: bfriedman@emax.ca, freebsd-proliant@freebsd.org Subject: Re: Server reboots at random :-( X-BeenThere: freebsd-proliant@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Technical discussion of FreeBSD on HP ProLiant server platforms." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 18 Jun 2008 21:54:43 -0000 On Wed, Jun 18, 2008 at 12:29:00PM -0400, BarryFriedman@www1.emax.ca wrote: > Any ideas about how to troubleshoot this would be greatly appreciated. Start with enabling crash dumps to see if it is a kernel issue or a hardware issue: In /etc/rc.conf: dumpdev="AUTO" dumpdir="/usr/crash" The second reboot will take muuuuch longer because it is saving this, but it gives some information to start with. See also http://www.freebsd.org/doc/en_US.ISO8859-1/books/developers-handbook/kerneldebug.html and following. Edwin -- Edwin Groothuis | Personal website: http://www.mavetju.org edwin@mavetju.org | Weblog: http://www.mavetju.org/weblog/ From owner-freebsd-proliant@FreeBSD.ORG Fri Jun 20 08:10:06 2008 Return-Path: Delivered-To: freebsd-proliant@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 4BD0D1065670 for ; Fri, 20 Jun 2008 08:10:06 +0000 (UTC) (envelope-from ulf@alameda.net) Received: from mail.alameda.net (mail.alameda.net [194.55.105.10]) by mx1.freebsd.org (Postfix) with ESMTP id 348508FC1A for ; Fri, 20 Jun 2008 08:10:06 +0000 (UTC) (envelope-from ulf@alameda.net) Received: by mail.alameda.net (Postfix, from userid 1000) id 2A3D933C61; Fri, 20 Jun 2008 00:50:00 -0700 (PDT) Date: Fri, 20 Jun 2008 00:50:00 -0700 From: Ulf Zimmermann To: Edwin Groothuis Message-ID: <20080620074959.GO89289@evil.alameda.net> References: <20080618162900.GA70649@emax.ca> <20080618212941.GM89661@k7.mavetju> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20080618212941.GM89661@k7.mavetju> User-Agent: Mutt/1.4.2.2i Organization: Alameda Networks, Inc. X-Operating-System: FreeBSD 5.3-STABLE X-ANI-MailScanner-Information: Please contact the ISP for more information X-ANI-MailScanner: Found to be clean X-ANI-MailScanner-From: ulf@alameda.net Cc: bfriedman@emax.ca, BarryFriedman@www1.emax.ca, freebsd-proliant@freebsd.org Subject: Re: Server reboots at random :-( X-BeenThere: freebsd-proliant@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: ulf@Alameda.net List-Id: "Technical discussion of FreeBSD on HP ProLiant server platforms." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 20 Jun 2008 08:10:06 -0000 On Thu, Jun 19, 2008 at 07:29:41AM +1000, Edwin Groothuis wrote: > On Wed, Jun 18, 2008 at 12:29:00PM -0400, BarryFriedman@www1.emax.ca wrote: > > Any ideas about how to troubleshoot this would be greatly appreciated. > > Start with enabling crash dumps to see if it is a kernel issue or > a hardware issue: > > In /etc/rc.conf: > > dumpdev="AUTO" > dumpdir="/usr/crash" > > The second reboot will take muuuuch longer because it is saving > this, but it gives some information to start with. > > See also > http://www.freebsd.org/doc/en_US.ISO8859-1/books/developers-handbook/kerneldebug.html > and following. > You might also want to setup virtual serial and enable console on serial. Then you can use conserver to log anything and check after a reboot. -- Regards, Ulf. --------------------------------------------------------------------- Ulf Zimmermann, 1525 Pacific Ave., Alameda, CA-94501, #: 510-865-0204 You can find my resume at: http://www.Alameda.net/~ulf/resume.html From owner-freebsd-proliant@FreeBSD.ORG Fri Jun 20 21:05:51 2008 Return-Path: Delivered-To: freebsd-proliant@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 8C001106564A for ; Fri, 20 Jun 2008 21:05:51 +0000 (UTC) (envelope-from bf.mbox@gmail.com) Received: from an-out-0708.google.com (an-out-0708.google.com [209.85.132.245]) by mx1.freebsd.org (Postfix) with ESMTP id 5A24D8FC2F for ; Fri, 20 Jun 2008 21:05:51 +0000 (UTC) (envelope-from bf.mbox@gmail.com) Received: by an-out-0708.google.com with SMTP id b33so353078ana.13 for ; Fri, 20 Jun 2008 14:05:51 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:received:received:message-id:date:from:to :subject:mime-version:content-type:content-transfer-encoding :content-disposition; bh=Z9lH7XnNV+Tr5Ls4Bx9D8vVavUo+XWWNSpThGcWje88=; b=aXy7kLoKMVZ0I0aXscPZS4Ko0PzGPvrlYp6Sap2p8mMLplXRsaXhe3T0mrl6o7MaRL 0vKSBiT1+pqW3vkAvCikFyWWe0bJzXiKPKSNmFITUF1z8hQgGR/d/tN3WC+Ar0c4SK2v mX67+Gbu8BpMfc+d+AatqD2XzyU9kJyfQJs24= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=message-id:date:from:to:subject:mime-version:content-type :content-transfer-encoding:content-disposition; b=Gv/g6f8Hv3alLDHr4ulinYL+VDXpvm9EEpeYNPmuI6a8rnFLAQEcz2tg/8APjA5OOL Kk/nwrSuIhqE/RSI40jfUbpDpCZJQWSXIjEyCQb0Es4t2zqlt1L797xnTiBnncjgx5pg uhxQD8o5VyXdWek0zzSG+SyRfJzOc1DiYsrBQ= Received: by 10.100.10.15 with SMTP id 15mr6101701anj.131.1213994395776; Fri, 20 Jun 2008 13:39:55 -0700 (PDT) Received: by 10.100.198.20 with HTTP; Fri, 20 Jun 2008 13:39:55 -0700 (PDT) Message-ID: <3a9afd990806201339q6d2b330bh56fa23c84356b464@mail.gmail.com> Date: Fri, 20 Jun 2008 16:39:55 -0400 From: "Barry Friedman" To: freebsd-proliant@freebsd.org MIME-Version: 1.0 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Content-Disposition: inline Subject: Re: Server reboots at random X-BeenThere: freebsd-proliant@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: "Technical discussion of FreeBSD on HP ProLiant server platforms." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 20 Jun 2008 21:05:51 -0000 I made the changes Edwin suggested and after several crash cycles no dump files have been generated. I will look into using conserver and see if that catches anything. One other anomaly that I have noticed with this machine is that when receiving file transfers the thruput is throttled down to a trickle although on transfers out it runs at full speed. Also ssh login to the ilo doesn't seem to work although I can connect to it with a browser Thanks for the advice. -- Barry Friedman Emax Computer Systems Inc., 480 Tweedsmuir Ave., Ottawa, Ont. Canada K1Z 5N9 Phone: (613) 725-3198 Fax: 725-0298