From owner-freebsd-current Thu Mar 23 20:16: 5 2000 Delivered-To: freebsd-current@freebsd.org Received: from lamb.sas.com (lamb.sas.com [192.35.83.8]) by hub.freebsd.org (Postfix) with ESMTP id 47F6737B660; Thu, 23 Mar 2000 20:15:53 -0800 (PST) (envelope-from brdean@unx.sas.com) Received: from mozart (mozart.unx.sas.com [149.173.6.8]) by lamb.sas.com (8.9.3/8.9.1) with SMTP id XAA16446; Thu, 23 Mar 2000 23:15:48 -0500 (EST) Received: from dean.pc.sas.com by mozart (5.65c/SAS/Domains/5-6-90) id AA10589; Thu, 23 Mar 2000 23:15:17 -0500 Received: (from brdean@localhost) by dean.pc.sas.com (8.9.3/8.9.1) id XAA32499; Thu, 23 Mar 2000 23:15:16 -0500 (EST) (envelope-from brdean) From: Brian Dean Message-Id: <200003240415.XAA32499@dean.pc.sas.com> Subject: Re: AMI MegaRAID lockup? not accepting commands. In-Reply-To: <200003232150.NAA02123@mass.cdrom.com> from Mike Smith at "Mar 23, 2000 01:50:28 pm" To: Mike Smith Date: Thu, 23 Mar 2000 23:15:16 -0500 (EST) Cc: mw@kpnqwest.ch, freebsd-current@FreeBSD.ORG X-Mailer: ELM [version 2.4ME+ PL61 (25)] Mime-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit Sender: owner-freebsd-current@FreeBSD.ORG Precedence: bulk X-Loop: FreeBSD.ORG Mike Smith wrote: > > I've played around changing the spinloop to using DELAY (like the Linux model), > > but this didn't prevent the controller from either "just" locking up or > > crashing the whole machine with it. Changing various other places in a similar > > manner (like replacing the bcopy() in amr_quartz_get_work() with similar > > code as in the linux driver to wait for 0xFF to clear) didn't do the trick > > either. > > Can you try instead the changes that I just committed to -current? I > think that the problem shows up when the controller is heavily loaded; > your patch will keep the load on the controller down, which may mask the > 'real' bug. Just recently (this evening), I was able to get our controller to lock up with the latest patch. Previously, with that patch installed, I must not have been able to tickle the bug just right, and I believe that Mike based his decision to make that mod based on my lack of a lockup, which always happened quickly. That's what made me think that we'd solved it, but I guess I just got "lucky" on the previous lockups that happened very quickly, making me think it was more easily reproduceable that it actually is. It sounds like Markus may be onto something. -Brian -- Brian Dean brdean@unx.sas.com SAS Institute Inc. bsd@FreeBSD.ORG To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-current" in the body of the message