From owner-freebsd-arm@FreeBSD.ORG Tue May 12 13:49:37 2015 Return-Path: Delivered-To: freebsd-arm@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1.2 with cipher AECDH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id 8B29725F for ; Tue, 12 May 2015 13:49:37 +0000 (UTC) Received: from pmta2.delivery6.ore.mailhop.org (pmta2.delivery6.ore.mailhop.org [54.200.129.228]) by mx1.freebsd.org (Postfix) with SMTP id 6E48E1FB5 for ; Tue, 12 May 2015 13:49:37 +0000 (UTC) Received: from ilsoft.org (unknown [73.34.117.227]) by outbound2.ore.mailhop.org (Halon Mail Gateway) with ESMTPSA; Tue, 12 May 2015 13:48:35 +0000 (UTC) Received: from revolution.hippie.lan (revolution.hippie.lan [172.22.42.240]) by ilsoft.org (8.14.9/8.14.9) with ESMTP id t4CDmSFw001133; Tue, 12 May 2015 07:48:29 -0600 (MDT) (envelope-from ian@freebsd.org) Message-ID: <1431438508.6170.258.camel@freebsd.org> Subject: Re: state of FreeBSD ARM (less stable than 6 months ago) From: Ian Lepore To: Ralf Wenk Cc: freebsd-arm@freebsd.org Date: Tue, 12 May 2015 07:48:28 -0600 In-Reply-To: References: <5550C252.6030001@foxvalley.net> <1431357226.2428197.265704673.6A544F74@webmail.messagingengine.com> <555177D9.8080001@foxvalley.net> Content-Type: text/plain; charset="us-ascii" X-Mailer: Evolution 3.12.10 FreeBSD GNOME Team Port Mime-Version: 1.0 Content-Transfer-Encoding: 7bit X-BeenThere: freebsd-arm@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: "Porting FreeBSD to ARM processors." List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 12 May 2015 13:49:37 -0000 On Tue, 2015-05-12 at 08:45 +0200, Ralf Wenk wrote: > On Mon, 11 May 2015, at 21:47:37, Dan Raymond wrote: > > On 5/11/2015 9:13 AM, Mark Felder wrote: > > > On Mon, May 11, 2015, at 09:53, Dan Raymond wrote: > > >> I've been running an email and web server using FreeBSD 11 on a > > >> Raspberry Pi B+ since November. It has crashed 3 times since then > > >> (roughly every two months). I'm currently running r277334. I thought > > >> I'd try the latest build to see if stability has improved. I purchased a > > >> Raspberry Pi 2 and used the latest crochet to built r282738. No > > >> problems building it and it booted up fine. However, it crashes about > > >> an hour into building some ports I use for my server (nginx, php, > > >> etc.). I tried twice last night and it crashed both times. Is anybody > > >> looking into these stability issues? > > >> > > > RPi2 support is something like less than a week old for SMP and DMA > > > transport. I'm not sure more than a handful of people have actually > > > tried it yet. The bugs here will be worked out in time, but if you have > > > any core dumps or info that can assist in tracking down issues you're > > > experiencing that would certainly be appreciated. > > > > > > > These panics always seem to be mmcsd related. I doubt it has anything > > to do with RPi2 or SMP. > > > > sdhci_bcm0-slot0: Controller timeout > > sdhci_bcm0-slot0: ============== REGISTER DUMP ============== > > sdhci_bcm0-slot0: Sys addr: 0x4d295a00 | Version: 0x00009902 > > sdhci_bcm0-slot0: Blk size: 0x00000200 | Blk cnt: 0x00000020 > > sdhci_bcm0-slot0: Argument: 0x002d19c0 | Trn mode: 0x0000193a > > sdhci_bcm0-slot0: Present: 0x01ff0506 | Host ctl: 0x00000003 > > sdhci_bcm0-slot0: Power: 0x0000000f | Blk gap: 0x00000000 > > sdhci_bcm0-slot0: Wake-up: 0x00000000 | Clock: 0x00000507 > > sdhci_bcm0-slot0: Timeout: 0x0000000e | Int stat: 0x00000010 > > sdhci_b > > > > > > > > mmcsd0: Error indicated: 1 Timeout > > g_vfs_done():mmcsd0s2a[WRITE(offset=1460830208, length=24576)]error = 5 > > panic: No b_bufobj 0xd767ca00 > > cpuid = 1 > > KDB: enter: panic > > [ thread pid 12 tid 100013 ] > > Stopped at $d.7: ldrb r15, [r15, r15, ror r15]! > > db> > > I see such panics every two to three months. They happen on a RPi B > and RPi B+ as well. I have tried different the SD-Cards on the B and > the B+ of course. So I think it is not related to SD-card, manufacturer > or RPi board. > > Usually they happen in the middle of the night when syslogd(8) tries to > write something. I have never seen them happen when the RPi has some work > to do, e.g. is compiling a port. > > Continuing out of the debugger prints the usual messages, but on reboot > the RPi freeze. Only a power cycle will get it back to operating. > > Very often after such a panic happened my RPi gets "unstable" and panics > within the next 48 hours again with the same cause. I found out that, if > that happened and I force an fsck ignoring the journal there will be some > minor issue fixed and the RPi is stable again. For the next 2 or 3 months. > > > Ralf IMO, the moral of that story is: Never use softupdates with journaling enabled. For years there have been reports on the mailing lists of fsck failures when journaling is enabled (not arm-specific). Sometimes a few months goes by without a report and you wonder if it got fixed with some checkin you didn't notice, then the reports crop up again. My conclusion is that journalling has never really worked right. The only advantage of journaling is to speed up fsck on huge filesystems. An sdcard with a handful of GB isn't huge. -- Ian