From owner-freebsd-questions@FreeBSD.ORG Thu Oct 6 16:03:17 2005 Return-Path: X-Original-To: freebsd-questions@freebsd.org Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 62F3816A41F for ; Thu, 6 Oct 2005 16:03:17 +0000 (GMT) (envelope-from drue@therub.org) Received: from cenn.mc.mpls.visi.com (cenn.mc.mpls.visi.com [208.42.156.9]) by mx1.FreeBSD.org (Postfix) with ESMTP id 1855C43D46 for ; Thu, 6 Oct 2005 16:03:16 +0000 (GMT) (envelope-from drue@therub.org) Received: from egypt.therub.org (therub.org [209.98.146.43]) by cenn.mc.mpls.visi.com (Postfix) with ESMTP id 111F482B2; Thu, 6 Oct 2005 11:03:16 -0500 (CDT) Received: by egypt.therub.org (Postfix, from userid 1001) id C9F0A45676C; Thu, 6 Oct 2005 11:03:15 -0500 (CDT) Date: Thu, 6 Oct 2005 11:03:15 -0500 From: Dan Rue To: Lowell Gilbert Message-ID: <20051006160315.GQ3785@therub.org> References: <20051004215545.GJ3785@therub.org> <444q7wt7om.fsf@be-well.ilk.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <444q7wt7om.fsf@be-well.ilk.org> User-Agent: Mutt/1.4.2.1i Cc: freebsd-questions@freebsd.org Subject: Re: twa kernel panic under heavy load X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 06 Oct 2005 16:03:17 -0000 On Wed, Oct 05, 2005 at 09:27:53AM -0400, Lowell Gilbert wrote: > Dan Rue writes: > > > Greetings, > > > > I am running a 3ware 9500 SATA raid card in a 12x300GB raid 50 > > configuration. Too often, I am seeing reboots during high I/O (rsync) > > operations. > > > > drue@leopard:~$ uname -a > > FreeBSD leopard.claimlynx.com 5.4-SECURITY FreeBSD 5.4-SECURITY #0: Thu > > Jun 30 02:25:52 UTC 2005 > > root@builder.daemonology.net:/usr/obj/usr/src/sys/GENERIC i386 > > > > Here is dmesg identifying the controller: > > 3ware device driver for 9000 series storage controllers, version: 2.50.02.012 > > twa0: <3ware 9000 series Storage Controller> port 0xb800-0xb8ff mem 0xfb800000-0xfbffffff,0xfc5ffc00-0xfc5ffcff irq 24 at device 2.0 on pci2 > > twa0: 12 ports, Firmware FE9X 2.06.00.009, BIOS BE9X 2.03.01.051 > > > > > > Is this somethign that has been worked on in 5-STABLE or 6? Would > > a kernel dump be helpful? Please aim me at the appropriate list or > > people that would know. > > The driver was imported from 3ware's own code on their website, and a > new version was imported since the 5.4 release was branched. Your > problem may or may not have been fixed, but debugging it on the old > code base is definitely not worth anyone's time. > > Definitely upgrade and see what happens. I upgraded to 5-STABLE yesterday. Last night, while running rsync, the machine rebooted on me again.. >From /var/log/messages: Oct 5 23:08:41 leopard kernel: ected status bit(s): status reg = 0x15025f32; Missing bits: [MC_RDY,] Oct 5 23:08:41 leopard kernel: twa0: ERROR: (0x16: 0x1301): Missing expected status bit(s): status reg = 0x15025f32; Missing bits: [MC_RDY,] Oct 5 23:08:41 leopard last message repeated 7 times Oct 5 23:08:41 leopard kernel: twa0: ERROR: (0x16: 0x1301): Missing expected status bit(s): status reg =ected status bit(s): status reg = 0x15025f32; Missing bits: [MC_RDY,] Oct 5 23:08:41 leopard kernel: twa0: ERROR: (0x16: 0x1301): Missing expected status bit(s): status reg = 0x15025f32; Missing bits: [MC_RDY,] Oct 5 23:08:41 leopard last message repeated 106 times Oct 5 23:08:41 leopard kernel: twa0: ERROR: (0x16: 0x1301): Missected status bit(s): status reg = 0x15025f32; Missing bits: [MC_RDY,] Oct 5 23:08:41 leopard kernel: twa0: ERROR: (0x16: 0x1301): Missing expected status bit(s): status reg = 0x15025f32; Missing bits: [MC_RDY,] Oct 5 23:08:41 leopard last message repeated 296 times Oct 5 23:09:42 leopard kernel: twa0: ERROR: (0x05: 0x210b): Request timed out!:request = 0xc2425600 Oct 5 23:09:42 leopard kernel: twa0: INFO: (0x16: 0x1108): Resetting controller...: Oct 5 23:09:42 leopard kernel: twa0: INFO: (0x04: 0x005e): Cache synchronized after power fail: unit=0 Oct 5 23:09:42 leopard kernel: twa0: INFO: (0x04: 0x0001): Controller reset occurred: resets=1 Oct 5 23:09:42 leopard kernel: twa0: INFO: (0x16: 0x1107): Controller reset done!: Oct 5 23:12:59 leopard kernel: twa0: ERROR: (0x16: 0x1301): Missing expected st atus bit(s): status reg = 0x15025d50; Missing bits: [MC_RDY,] Oct 5 23:13:00 leopard last message repeated 379 times Oct 5 23:13:00 leopard kernel: twa0: ERROR: (0x16: 0x1301): Missing expected status bit(s): status reg = 0x15025d52; Missing bits: [MC_RDY,] Oct 5 23:46:31 leopard syslogd: kernel boot file is /boot/kernel/kernel Please let me know who I may contact to get this debugged. Thanks, Dan