From owner-freebsd-hardware@FreeBSD.ORG Thu Jul 3 01:06:34 2008 Return-Path: Delivered-To: freebsd-hardware@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B5A31106566C for ; Thu, 3 Jul 2008 01:06:34 +0000 (UTC) (envelope-from bseklecki@collaborativefusion.com) Received: from collaborativefusion.com (mx01.pub.collaborativefusion.com [206.210.89.201]) by mx1.freebsd.org (Postfix) with ESMTP id 6A1948FC16 for ; Thu, 3 Jul 2008 01:06:34 +0000 (UTC) (envelope-from bseklecki@collaborativefusion.com) Received: from Internal Mail-Server by mx01 (envelope-from bseklecki@collaborativefusion.com) with SMTP; 2 Jul 2008 20:07:51 -0500 From: "Brian A. Seklecki (Mobile)" To: Scott Long In-Reply-To: <486909B1.3020309@samsco.org> References: <20071114122210.42E8613C4BB@mx1.freebsd.org> <1195160114.4042.154.camel@new-host> <1214840198.18670.43.camel@soundwave.ws.pitbpa0.priv.collaborativefusion.com> <486909B1.3020309@samsco.org> Content-Type: text/plain Organization: Collaborative Fusion, Inc. Date: Wed, 02 Jul 2008 21:07:41 -0400 Message-Id: <1215047261.9810.16.camel@localhost.localdomain> Mime-Version: 1.0 X-Mailer: Evolution 2.22.2 (2.22.2-2.fc9) Content-Transfer-Encoding: 7bit Cc: Sean McAfee , Jason Thomson , scottl@freebsd.org, Benjie Chen , "freebsd-hardware@freebsd.org" Subject: Re: PERC5 (LSI MegaSAS) Patrol Read crashes X-BeenThere: freebsd-hardware@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list Reply-To: bseklecki@collaborativefusion.com List-Id: General discussion of FreeBSD hardware List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 03 Jul 2008 01:06:34 -0000 > >> Its a software bug (driver). It can probably be easily fixed. I > >> think there's a PR on it somewhere (will check). > > The problem is a firmware bug in the Megaraid SAS controller. It seems > that while the controller can handle 512 or more concurrent commands, That's great news. We will try that patch in our local source tree. One thing to note, though, is that both or R1 and R2/3 systems have the same controller with the same firmware version, but we never saw the problem in the R2/3. Indeed the Dell product number revision is different (from ipmitool fru) for the parts. Although the firmware updates are the same for R1 and R2/3, maybe the updater probes the underlying hardware revision and applies different code? Or perhaps it is something to do with the performance or kernel behavior of the older Hyperthreading Xeon's (and motherboard) in the the R1 that just causes it to occur more-often. ~BAS > it can only handle 128 concurrent commands to each array. Patrols > reads aren't the primary cause, they just help the problem appear; when > a patrol read cycle runs, it tends to slow down i/o enough that commands > to the array get backed up, and you tend to reach the 128 limit. > > I don't know if there is a firmware fix from Dell/LSI, or if there will > ever be a fix. FreeBSD drivers tend to stress hardware a lot more > than Linux and Windows do, and since the latter two are used as the > QA yardstick, anything that doesn't affect them doesn't usually get > fixed. An easy work-around for the driver is to change the following > line in /sys/dev/mfi/mfi.c::mfi_alloc_commands() > > ncmds = sc->mfi_max_fw_cmds; > > to > > ncmds = 128; > > A more complete solution requires me writing an i/o scheduler in the > driver, something that would take quite a bit of effort. > > With all this said, I still stand behind LSI controllers. This bug, > while unfortunate, is relatively minor and easy to work around, and > it's the only significant bug that has turned up in over two and half > years with this hardware. > > Scott > > _______________________________________________ > freebsd-hardware@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-hardware > To unsubscribe, send any mail to "freebsd-hardware-unsubscribe@freebsd.org" IMPORTANT: This message contains confidential information and is intended only for the individual named. If the reader of this message is not an intended recipient (or the individual responsible for the delivery of this message to an intended recipient), please be advised that any re-use, dissemination, distribution or copying of this message is prohibited. Please notify the sender immediately by e-mail if you have received this e-mail by mistake and delete this e-mail from your system.