Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 6 Oct 2005 11:03:15 -0500
From:      Dan Rue <drue@therub.org>
To:        Lowell Gilbert <freebsd-questions-local@be-well.ilk.org>
Cc:        freebsd-questions@freebsd.org
Subject:   Re: twa kernel panic under heavy load
Message-ID:  <20051006160315.GQ3785@therub.org>
In-Reply-To: <444q7wt7om.fsf@be-well.ilk.org>
References:  <20051004215545.GJ3785@therub.org> <444q7wt7om.fsf@be-well.ilk.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On Wed, Oct 05, 2005 at 09:27:53AM -0400, Lowell Gilbert wrote:
> Dan Rue <drue@therub.org> writes:
> 
> > Greetings,
> > 
> > I am running a 3ware 9500 SATA raid card in a 12x300GB raid 50
> > configuration.  Too often, I am seeing reboots during high I/O (rsync)
> > operations.  
> > 
> > drue@leopard:~$ uname -a
> > FreeBSD leopard.claimlynx.com 5.4-SECURITY FreeBSD 5.4-SECURITY #0: Thu
> > Jun 30 02:25:52 UTC 2005
> > root@builder.daemonology.net:/usr/obj/usr/src/sys/GENERIC  i386
> > 
> > Here is dmesg identifying the controller: 
> > 3ware device driver for 9000 series storage controllers, version: 2.50.02.012
> > twa0: <3ware 9000 series Storage Controller> port 0xb800-0xb8ff mem 0xfb800000-0xfbffffff,0xfc5ffc00-0xfc5ffcff irq 24 at device 2.0 on pci2
> > twa0: 12 ports, Firmware FE9X 2.06.00.009, BIOS BE9X 2.03.01.051
> > 
> > 
> > Is this somethign that has been worked on in 5-STABLE or 6?  Would
> > a kernel dump be helpful?  Please aim me at the appropriate list or
> > people that would know.
> 
> The driver was imported from 3ware's own code on their website, and a
> new version was imported since the 5.4 release was branched.  Your
> problem may or may not have been fixed, but debugging it on the old
> code base is definitely not worth anyone's time.
> 
> Definitely upgrade and see what happens.

I upgraded to 5-STABLE yesterday.  Last night, while running rsync, the
machine rebooted on me again..

>From /var/log/messages: 
Oct  5 23:08:41 leopard kernel: ected status bit(s): status reg = 0x15025f32; Missing bits: [MC_RDY,]
Oct  5 23:08:41 leopard kernel: twa0: ERROR: (0x16: 0x1301): Missing expected status bit(s): status reg = 0x15025f32; Missing bits: [MC_RDY,]
Oct  5 23:08:41 leopard last message repeated 7 times
Oct  5 23:08:41 leopard kernel: twa0: ERROR: (0x16: 0x1301): Missing expected status bit(s): status reg =ected status bit(s): status reg = 0x15025f32; Missing bits: [MC_RDY,]
Oct  5 23:08:41 leopard kernel: twa0: ERROR: (0x16: 0x1301): Missing expected status bit(s): status reg = 0x15025f32; Missing bits: [MC_RDY,]
Oct  5 23:08:41 leopard last message repeated 106 times
Oct  5 23:08:41 leopard kernel: twa0: ERROR: (0x16: 0x1301): Missected status bit(s): status reg = 0x15025f32; Missing bits: [MC_RDY,]
Oct  5 23:08:41 leopard kernel: twa0: ERROR: (0x16: 0x1301): Missing expected status bit(s): status reg = 0x15025f32; Missing bits: [MC_RDY,]
Oct  5 23:08:41 leopard last message repeated 296 times
Oct  5 23:09:42 leopard kernel: twa0: ERROR: (0x05: 0x210b): Request timed out!:request = 0xc2425600
Oct  5 23:09:42 leopard kernel: twa0: INFO: (0x16: 0x1108): Resetting controller...:  
Oct  5 23:09:42 leopard kernel: twa0: INFO: (0x04: 0x005e): Cache synchronized after power fail: unit=0
Oct  5 23:09:42 leopard kernel: twa0: INFO: (0x04: 0x0001): Controller reset occurred: resets=1
Oct  5 23:09:42 leopard kernel: twa0: INFO: (0x16: 0x1107): Controller reset done!:  
Oct  5 23:12:59 leopard kernel: twa0: ERROR: (0x16: 0x1301): Missing expected st
atus bit(s): status reg = 0x15025d50; Missing bits: [MC_RDY,]
Oct  5 23:13:00 leopard last message repeated 379 times
Oct  5 23:13:00 leopard kernel: twa0: ERROR: (0x16: 0x1301): Missing expected status bit(s): status reg = 0x15025d52; Missing bits: [MC_RDY,]
Oct  5 23:46:31 leopard syslogd: kernel boot file is /boot/kernel/kernel

Please let me know who I may contact to get this debugged.
Thanks, 
Dan



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20051006160315.GQ3785>