From owner-freebsd-scsi@FreeBSD.ORG Mon Jan 24 16:01:11 2011 Return-Path: Delivered-To: freebsd-scsi@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 9BDFF1065672; Mon, 24 Jan 2011 16:01:11 +0000 (UTC) (envelope-from ken@kdm.org) Received: from nargothrond.kdm.org (nargothrond.kdm.org [70.56.43.81]) by mx1.freebsd.org (Postfix) with ESMTP id 627548FC15; Mon, 24 Jan 2011 16:01:11 +0000 (UTC) Received: from nargothrond.kdm.org (localhost [127.0.0.1]) by nargothrond.kdm.org (8.14.2/8.14.2) with ESMTP id p0OG1A49080781; Mon, 24 Jan 2011 09:01:10 -0700 (MST) (envelope-from ken@nargothrond.kdm.org) Received: (from ken@localhost) by nargothrond.kdm.org (8.14.2/8.14.2/Submit) id p0OG1A9J080780; Mon, 24 Jan 2011 09:01:10 -0700 (MST) (envelope-from ken) Date: Mon, 24 Jan 2011 09:01:10 -0700 From: "Kenneth D. Merry" To: Joachim Tingvold Message-ID: <20110124160110.GA77711@nargothrond.kdm.org> References: <4D2DAA45.30602@FreeBSD.org> <41C64262-4300-4187-B5FD-04A5EFB7F87C@tingvold.com> <20110113203750.GA39494@nargothrond.kdm.org> <20110114001758.GA12793@nargothrond.kdm.org> <07392102-4584-4690-9188-5202728CC7CA@tingvold.com> <20110120155746.GA22515@nargothrond.kdm.org> Mime-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: User-Agent: Mutt/1.4.2i Cc: freebsd-scsi@freebsd.org, Alexander Motin Subject: Re: mps0-troubles X-BeenThere: freebsd-scsi@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: SCSI subsystem List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 24 Jan 2011 16:01:11 -0000 On Fri, Jan 21, 2011 at 20:15:15 +0100, Joachim Tingvold wrote: > On Thu, Jan 20, 2011, at 16:57:46PM GMT+01:00, Kenneth D. Merry wrote: > >So did the system freeze or crash this time? > > It happened while I was not actively using the machine. When I found > out about it, everything seemed to be fine. Okay, so that means it didn't panic and didn't lock up permanently, so things are probably okay. > >Do you have a serial console on the machine? > >If it hangs or crashes, perhaps we can get a stack trace. > > You mean normal RS-232? Yes. > >It does look like the out of chain problem was fixed by increasing the > >number, so that's good at least. > > Yes. For now, at least. (-: Yes. It looks like you're just getting some timeouts, followed by aborts and retries. It's not optimal, but at least the system is recovering and continuing on. FWIW, there is some much more sophisticated error recovery code coming for the driver, written by the guys at Isilon. That'll come in when LSI gets done with their version of the driver. The error recovery that is in there now is basic, but it works in most of the common cases. There is a case related to device removal that Justin fixed that I'm planning to commit in the near future. Ken -- Kenneth Merry ken@FreeBSD.ORG