From owner-freebsd-arm@FreeBSD.ORG Tue Feb 12 15:02:05 2013 Return-Path: Delivered-To: freebsd-arm@FreeBSD.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by hub.freebsd.org (Postfix) with ESMTP id 8C511ED2 for ; Tue, 12 Feb 2013 15:02:05 +0000 (UTC) (envelope-from ian@FreeBSD.org) Received: from duck.symmetricom.us (duck.symmetricom.us [206.168.13.214]) by mx1.freebsd.org (Postfix) with ESMTP id 258879D6 for ; Tue, 12 Feb 2013 15:02:04 +0000 (UTC) Received: from damnhippie.dyndns.org (daffy.symmetricom.us [206.168.13.218]) by duck.symmetricom.us (8.14.6/8.14.6) with ESMTP id r1CF1vLV082563 for ; Tue, 12 Feb 2013 08:01:57 -0700 (MST) (envelope-from ian@FreeBSD.org) Received: from [172.22.42.240] (revolution.hippie.lan [172.22.42.240]) by damnhippie.dyndns.org (8.14.3/8.14.3) with ESMTP id r1CF1Z5r041033; Tue, 12 Feb 2013 08:01:35 -0700 (MST) (envelope-from ian@FreeBSD.org) Subject: Re: BeagleBone locked up From: Ian Lepore To: Tim Kientzle In-Reply-To: <4AF15BB9-4174-4564-A770-BF9EB9D447F5@kientzle.com> References: <20130210231709.26f122dc@ivory.local> <20130211190606.1c985baf@ivory.local> <1360629932.4545.150.camel@revolution.hippie.lan> <4AF15BB9-4174-4564-A770-BF9EB9D447F5@kientzle.com> Content-Type: text/plain; charset="us-ascii" Date: Tue, 12 Feb 2013 08:01:35 -0700 Message-ID: <1360681295.4545.165.camel@revolution.hippie.lan> Mime-Version: 1.0 X-Mailer: Evolution 2.32.1 FreeBSD GNOME Team Port Content-Transfer-Encoding: 7bit Cc: freebsd-arm@FreeBSD.org, Brett Wynkoop X-BeenThere: freebsd-arm@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Porting FreeBSD to the StrongARM Processor List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 Feb 2013 15:02:05 -0000 On Mon, 2013-02-11 at 22:38 -0800, Tim Kientzle wrote: > On Feb 11, 2013, at 4:45 PM, Ian Lepore wrote: > > > On Mon, 2013-02-11 at 19:06 -0500, Brett Wynkoop wrote: > >> Greeting- > >> > >> While building a kernel the Bone stopped responding on the net and this > >> is what I found on the console: > >> > >> ti_mmchs0: Error: current cmd NULL, already done? > >> swap_pager: indefinite wait buffer: bufobj: 0, blkno: 5534, size: 8192 > > [...] > >> ti_mmchs0: Error: current cmd NULL, already done? > >> ifaddr cache = 0xc1fbd700 is deleted > >> ti_mmchs0: Error: current cmd NULL, already done? > >> ti_mmchs0: Error: current cmd NULL, already done? > >> > >> The interesting thing is I have seen this same swap_pager error message > >> on my 32 bit x86 FreeBSD 9 box when it drops it's IDE disks and goes > >> off the net as well. > >> > >> The last I saw of the kernel recompile it was at linking just before it > >> locked up. > >> > >> Ideas? > > > > That's the second report recently of indefinite wait buffer. What it > > really means is that it has been waiting more than 20 seconds to pull a > > page (or block of pages) in from swap. That plus the cmd NULL errors > > tend to point in the direction of the mmchs driver. > > This is something that's degraded fairly recently. > I only started seeing these in the last week or so. > > And the BeagleBone MMCHS driver has not been > touched in a very long time. Hrm, good point. What has been touched recently that might be related to this? This first thing that comes to mind is the recent change for allocating kmem map space proportional to the available ram. I wonder if that has led to some unexpected under- or over-allocation of buffer resources, which, when combined with slow IO, leads to unusually long waits for swapping? (Purely guessing here, but I have found in the past that playing with tuning NBUF in kernel config can lead to some odd lockups and panics.) -- Ian