From owner-freebsd-stable@FreeBSD.ORG Sat Feb 20 19:37:21 2010 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 1FCEC1065670 for ; Sat, 20 Feb 2010 19:37:21 +0000 (UTC) (envelope-from jdc@koitsu.dyndns.org) Received: from qmta12.emeryville.ca.mail.comcast.net (qmta12.emeryville.ca.mail.comcast.net [76.96.27.227]) by mx1.freebsd.org (Postfix) with ESMTP id 02DC78FC12 for ; Sat, 20 Feb 2010 19:37:19 +0000 (UTC) Received: from omta22.emeryville.ca.mail.comcast.net ([76.96.30.89]) by qmta12.emeryville.ca.mail.comcast.net with comcast id kJYT1d0031vN32cACKdL3J; Sat, 20 Feb 2010 19:37:20 +0000 Received: from koitsu.dyndns.org ([98.248.46.159]) by omta22.emeryville.ca.mail.comcast.net with comcast id kKfd1d0013S48mS8iKfdll; Sat, 20 Feb 2010 19:39:37 +0000 Received: by icarus.home.lan (Postfix, from userid 1000) id 69C841E301A; Sat, 20 Feb 2010 11:37:18 -0800 (PST) Date: Sat, 20 Feb 2010 11:37:18 -0800 From: Jeremy Chadwick To: freebsd-stable@freebsd.org Message-ID: <20100220193718.GA33214@icarus.home.lan> References: <20100131144217.ca08e965.torfinn.ingolfsen@broadpark.no> <20100131175639.86ba9aee.torfinn.ingolfsen@broadpark.no> <20100207163631.da7205fc.torfinn.ingolfsen@broadpark.no> <20100213192404.5e15b5eb.torfinn.ingolfsen@broadpark.no> <20100217091625.d0e74570.torfinn.ingolfsen@broadpark.no> <20100220202108.e1dd1b74.torfinn.ingolfsen@broadpark.no> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20100220202108.e1dd1b74.torfinn.ingolfsen@broadpark.no> User-Agent: Mutt/1.5.20 (2009-06-14) Subject: Re: panic - sleeping thread on FreeBSD 8.0-stable / amd64 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 20 Feb 2010 19:37:21 -0000 On Sat, Feb 20, 2010 at 08:21:08PM +0100, Torfinn Ingolfsen wrote: > Another day, another crash. > >From /var/log/messages: > Feb 20 08:52:26 kg-f2 ntpd[58609]: time reset +1.169751 s > Feb 20 08:54:57 kg-f2 kernel: ata5: port is not ready (timeout 10000ms) tfd = 0000007f > Feb 20 08:54:57 kg-f2 kernel: ata5: hardware reset timeout > Feb 20 19:18:51 kg-f2 syslogd: kernel boot file is /boot/kernel/kernel > > The drives are as follows: > root@kg-f2# atacontrol list;camcontrol devlist > ATA channel 0: > Master: no device present > Slave: no device present > ATA channel 2: > Master: ad4 SATA revision 2.x > Slave: no device present > ATA channel 3: > Master: ad6 SATA revision 2.x > Slave: no device present > ATA channel 4: > Master: ad8 SATA revision 2.x > Slave: no device present > ATA channel 5: > Master: ad10 SATA revision 2.x > Slave: no device present > ATA channel 6: > Master: ad12 SATA revision 2.x > Slave: no device present > ATA channel 7: > Master: ad14 SATA revision 2.x > Slave: no device present > at scbus0 target 0 lun 0 (pass0,ada0) > > Smartctl is happy, too: > root@kg-f2# smartctl -H /dev/ad4 > smartctl 5.39 2009-12-09 r2995 [FreeBSD 8.0-STABLE amd64] (local build) > Copyright (C) 2002-9 by Bruce Allen, http://smartmontools.sourceforge.net > > === START OF READ SMART DATA SECTION === > SMART overall-health self-assessment test result: PASSED > > root@kg-f2# smartctl -H /dev/ad6 > smartctl 5.39 2009-12-09 r2995 [FreeBSD 8.0-STABLE amd64] (local build) > Copyright (C) 2002-9 by Bruce Allen, http://smartmontools.sourceforge.net > > === START OF READ SMART DATA SECTION === > SMART overall-health self-assessment test result: PASSED > > root@kg-f2# smartctl -H /dev/ad8 > smartctl 5.39 2009-12-09 r2995 [FreeBSD 8.0-STABLE amd64] (local build) > Copyright (C) 2002-9 by Bruce Allen, http://smartmontools.sourceforge.net > > === START OF READ SMART DATA SECTION === > SMART overall-health self-assessment test result: PASSED > > root@kg-f2# smartctl -H /dev/ad10 > smartctl 5.39 2009-12-09 r2995 [FreeBSD 8.0-STABLE amd64] (local build) > Copyright (C) 2002-9 by Bruce Allen, http://smartmontools.sourceforge.net > > === START OF READ SMART DATA SECTION === > SMART overall-health self-assessment test result: PASSED > > root@kg-f2# smartctl -H /dev/ad12 > smartctl 5.39 2009-12-09 r2995 [FreeBSD 8.0-STABLE amd64] (local build) > Copyright (C) 2002-9 by Bruce Allen, http://smartmontools.sourceforge.net > > === START OF READ SMART DATA SECTION === > SMART overall-health self-assessment test result: PASSED > > root@kg-f2# smartctl -H /dev/ada0 > smartctl 5.39 2009-12-09 r2995 [FreeBSD 8.0-STABLE amd64] (local build) > Copyright (C) 2002-9 by Bruce Allen, http://smartmontools.sourceforge.net > > === START OF READ SMART DATA SECTION === > SMART overall-health self-assessment test result: PASSED > > Maybe the hardware is just plain broken. Can you re-run smartctl -a instead of -H? Some of the SMART attributes may help determine what's going on, or there may be related errors in the SMART error log. Otherwise I'd say what's happening is a SATA controller lock-up of some sort, since it happens on any of your channels. Could be a quirk of some kind in the SATA->CAM stuff (unless it also happens when using pure ata(4)). What controller are these disks hooked to again? -- | Jeremy Chadwick jdc@parodius.com | | Parodius Networking http://www.parodius.com/ | | UNIX Systems Administrator Mountain View, CA, USA | | Making life hard for others since 1977. PGP: 4BD6C0CB |