From owner-freebsd-hardware@FreeBSD.ORG Wed Nov 14 12:22:10 2007 Return-Path: Delivered-To: freebsd-hardware@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 9756416A4DC for ; Wed, 14 Nov 2007 12:22:10 +0000 (UTC) (envelope-from simon@optinet.com) Received: from cobra.acceleratedweb.net (cobra-gw.acceleratedweb.net [207.99.79.37]) by mx1.freebsd.org (Postfix) with SMTP id 42E8613C4BB for ; Wed, 14 Nov 2007 12:22:10 +0000 (UTC) (envelope-from simon@optinet.com) Received: (qmail 4648 invoked by uid 110); 14 Nov 2007 12:22:07 -0000 Received: from unknown (HELO desktop1) (simon%optinet.com@69.112.29.182) by cobra.acceleratedweb.net with SMTP; 14 Nov 2007 12:22:07 -0000 From: "Simon" To: "Jason Thomson" Date: Wed, 14 Nov 2007 07:23:52 -0500 Priority: Normal X-Mailer: PMMail 2000 Professional (2.20.2717) For Windows 2000 (5.1.2600;2) In-Reply-To: <473AC973.70509@mintel.com> MIME-Version: 1.0 Message-Id: <20071114122210.42E8613C4BB@mx1.freebsd.org> Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: Sean McAfee , Benjie Chen , "freebsd-hardware@freebsd.org" Subject: Re: PERC5 (LSI MegaSAS) Patrol Read crashes X-BeenThere: freebsd-hardware@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: General discussion of FreeBSD hardware List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 14 Nov 2007 12:22:10 -0000 Do you guys perform consistency checks on your RAID5 or other redundant arrays? -Simon On Wed, 14 Nov 2007 10:09:55 +0000, Jason Thomson wrote: >We have ~20 Dell servers with 6.x and Perc5. >Automatic PR is enabled on all of them, and happens once a week >without any problems (so far - since July). >Obviously, from the thread, manual Patrol Reads cause a definite >problem, but it's not clear that the initial problem had the same >symptoms - just that there was a correlation between the automatic >Patrol Reads and the server lockup? >I guess it may be that under heavy load (exacerbated by PR) something >else gets screwed up and causes the machine lockup? >Simon wrote: >> So everyone that uses Dell servers with Perc5 and 6.x disables automatic PR? >> >> -Simon >> >> --Original Message Text--- >> From: Benjie Chen >> Date: Tue, 13 Nov 2007 17:50:41 -0500 >> >> If you disable PR, the problem will go away. Below are some useful commands. Sean mentioned that you could disable it, then set it to do an automatic PR 1 hour after you restart the automatic PR. automatic PR does not crash the system >> definitively, but manual PR does. So during your downtime, you could try to do an automatic PR... >> >> # Disables patrol reads on all adapters >> megacli -AdpPR -Dsbl -aALL >> >> # Enables automatic patrol reads >> megacli -AdpPR -EnblAuto -aALL >> >> # Sets the interval for automatic reads to 1 hour - it only >> >> # accepts whole numbers so that's the lowest you can go >> megacli -AdpPR -SetDelay 1 -aALL >> >> >> Some other useful ones: >> >> # Patrol read settings and information >> megacli -AdpPR -Info -aALL >> >> # Extended information >> >> megacli -AdpAllInfo -aALL >> >> # Export controller's event log to file >> megacli -AdpEventLog -IncludeDeleted -f -aALL >> >> # More logging >> megacli -FwTermLog -Dsply -aALL >> >> >> On 11/13/07, Simon wrote: Hello, >> >> I'm just wondering, was this ever resolved? I was about to start using >> new 2950 with Perc5 in it, but now I'm afraid to as I cannot afford >> downtime. Why this is still a linux hack is beyond me. The way Dell >> is doing, they ought to have a port specifically for FreeBSD >> >> If I disable PR altogether (not sure if this is possible, yet), although I >> don't see why it wouldn't be, would the mentioned problem go away? >> >> Thank you, >> Simon >> >> >> On Mon, 01 Oct 2007 17:26:29 -0400, Sean McAfee wrote: >> >> >>>John Baldwin wrote: >>> >>>>On Saturday 29 September 2007 09:18:17 pm Benjie Chen wrote: >>>> >>>>Hmm, I haven't tried with megacli, but an internal tool at work is able to >>>>start manual patrol reads w/o causing a crash, and I've also seen production >>>>boxes running automatic patrol reads w/o causing crashes. Do you have to >>>>have a certain load before it will crash? >> >> >>>The crashes that we've seen in production have occurred while patrol >>>reads kick off under moderate-high load, but in testing, an automatic >>>read will complete fine. Even with maxed-out I/O*, we haven't been able >>>to come up with reliable testing scenario to trigger crashes on >>>automatic patrol reads. >> >> >> >>>(*My base testing scenario involved running a pretty heavy stress [as in >>>the program available in ports], while repeatedly copying ports & src >> >>>from an NFS mount to another local mountpoint and SCPing a large file in >> >>>a loop from another machine.) >> >> >> >>>Sean McAfee >>>Collaborative Fusion, Inc. >>> smcafee@collaborativefusion.com >>> 412-422-3463 x 4025 >> >> >>>1710 Murray Avenue, Suite 320 >>>Pittsburgh, PA 15217 >> >> >>>**************************************************************** >>>IMPORTANT: This message contains confidential information >>>and is intended only for the individual named. If the reader of >>>this message is not an intended recipient (or the individual >>>responsible for the delivery of this message to an intended >>>recipient), please be advised that any re-use, dissemination, >>>distribution or copying of this message is prohibited. Please >>>notify the sender immediately by e-mail if you have received >>>this e-mail by mistake and delete this e-mail from your system. >>>E-mail transmission cannot be guaranteed to be secure or >>>error-free as information could be intercepted, corrupted, lost, >>>destroyed, arrive late or incomplete, or contain viruses. The >>>sender therefore does not accept liability for any errors or >>>omissions in the contents of this message, which arise as a >>>result of e-mail transmission. >>>**************************************************************** >> >> >> >>>_______________________________________________ >>>freebsd-hardware@freebsd.org mailing list >>>http://lists.freebsd.org/mailman/listinfo/freebsd-hardware >>>To unsubscribe, send any mail to " freebsd-hardware-unsubscribe@freebsd.org" >> >> >> >> >> _______________________________________________ >> freebsd-hardware@freebsd.org mailing list >> http://lists.freebsd.org/mailman/listinfo/freebsd-hardware >> To unsubscribe, send any mail to " freebsd-hardware-unsubscribe@freebsd.org" >> >> >> >> >>