From owner-freebsd-stable@FreeBSD.ORG Sat Feb 26 01:11:46 2005 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id F0F0916A4CE for ; Sat, 26 Feb 2005 01:11:46 +0000 (GMT) Received: from carver.gumbysoft.com (carver.gumbysoft.com [66.220.23.50]) by mx1.FreeBSD.org (Postfix) with ESMTP id BB2EF43D3F for ; Sat, 26 Feb 2005 01:11:46 +0000 (GMT) (envelope-from dwhite@gumbysoft.com) Received: by carver.gumbysoft.com (Postfix, from userid 1000) id AB16872DD4; Fri, 25 Feb 2005 17:11:46 -0800 (PST) Received: from localhost (localhost [127.0.0.1]) by carver.gumbysoft.com (Postfix) with ESMTP id A5B6E72DCB; Fri, 25 Feb 2005 17:11:46 -0800 (PST) Date: Fri, 25 Feb 2005 17:11:46 -0800 (PST) From: Doug White To: Piotr Gnyp In-Reply-To: <20050223145950.GA1147@discordia.pl> Message-ID: <20050225170533.M30975@carver.gumbysoft.com> References: <20050223145950.GA1147@discordia.pl> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII cc: freebsd-stable@freebsd.org Subject: Re: FreeBSD 5.3-R-p5: frequent kernel panics X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 26 Feb 2005 01:11:47 -0000 On Wed, 23 Feb 2005, Piotr Gnyp wrote: > Hi. > > First - some details: > > uname -pr > 5.3-RELEASE-p5 i386 > > kernel: > http://discordia.pl/~toread/OBLIVION > > dmesg: > http://discordia.pl/~toread/dmesg.txt > > mptable: > http://discordia.pl/~toread/mptable.txt > > System frequently crashes with kernel panic - some examples: > http://discordia.pl/~toread/crash4.txt > http://discordia.pl/~toread/crash5.txt Please capture the panic messages and relevant kernel output along with your traces. It helps to know what the system was doing when it tanked, and the trap info decodes the frame in an easy-to-digest format for humans. :) Your traces are quite bizarre... somehow ttwakeup() jumps off into space. Line 2370 of ttwakeup() is ttecho() which implies your system is suffering from severe memory corruption, or your kernel binary is damaged. I'd suggest doing a fresh buildworld+kernel and running memtest86. Also the mpt driver in 5.3 is known to react badly to SCSI errors but I don't think it runs off and randomly corrupts memory. Also check the system event log for any errors logged from hardware, such as ECC corrections, power problems, or temperature alarms. The ProLiants (this is a DL380 or something of that nature?) have sophisticated internal monitoring and can log events even if the OS is damaged. What were your kernel compile flags? -- Doug White | FreeBSD: The Power to Serve dwhite@gumbysoft.com | www.FreeBSD.org