From owner-freebsd-stable@FreeBSD.ORG Mon Mar 24 15:50:36 2014 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [8.8.178.115]) (using TLSv1 with cipher ADH-AES256-SHA (256/256 bits)) (No client certificate requested) by hub.freebsd.org (Postfix) with ESMTPS id A319A969; Mon, 24 Mar 2014 15:50:36 +0000 (UTC) Received: from hydra.pix.net (hydra.pix.net [IPv6:2001:470:e254::4]) (using TLSv1 with cipher DHE-RSA-AES256-SHA (256/256 bits)) (No client certificate requested) by mx1.freebsd.org (Postfix) with ESMTPS id 766F15F8; Mon, 24 Mar 2014 15:50:36 +0000 (UTC) Received: from mail.distal.com (mail.distal.com [IPv6:2001:470:e24c:200::ae25]) (authenticated bits=0) by hydra.pix.net (8.14.5/8.14.5) with ESMTP id s2OFoRZc068476; Mon, 24 Mar 2014 11:50:34 -0400 (EDT) (envelope-from cross+freebsd@distal.com) X-Virus-Status: Clean X-Virus-Scanned: clamav-milter 0.98 at mail.pix.net Received: from zalamar.mm-corp.net ([65.207.51.214]) (authenticated bits=0) by mail.distal.com (8.14.8/8.14.8) with ESMTP id s2OFn5Rx001454 (version=TLSv1/SSLv3 cipher=AES128-SHA bits=128 verify=NO); Mon, 24 Mar 2014 11:49:06 -0400 (EDT) (envelope-from cross+freebsd@distal.com) From: Chris Ross Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: quoted-printable Subject: 10-stable sparc64 boot problems Message-Id: <9800ED26-6E2E-42F0-9641-3B9EDF653CE6@distal.com> Date: Mon, 24 Mar 2014 11:48:42 -0400 To: freebsd-sparc64@freebsd.org, freebsd-stable@freebsd.org Mime-Version: 1.0 (Mac OS X Mail 7.1 \(1827\)) X-Mailer: Apple Mail (2.1827) X-Greylist: Sender succeeded SMTP AUTH, not delayed by milter-greylist-4.4.3 (mail.distal.com [206.138.151.250]); Mon, 24 Mar 2014 11:49:06 -0400 (EDT) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.17 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 24 Mar 2014 15:50:36 -0000 I just updated my 10-stable sparc64 (Sun Fire v240) to a 10-stable = kernel from revision 263676, which was the first one I found after the = numerous failures over the weekend in route6d. Reboot into single-user = for mergemaster and install world, I attempted to reboot into multi-user = the first two attempts yielded: Trying to mount root from zfs:zroot []... Setting hostuuid: 94588820-cd20-11e1-b15b-0003bae34047. Setting hostid: 0x4f9a5776. Entropy harvesting: interrupts ethernet point_to_point swi. Starting file system checks: Mounting local file systems:. Writing entropy file:. Setting hostname: hostname.distal.com. bge0: link state changed to DOWN spin lock 0xc0c61cb0 (smp rendezvous) held by 0xfffff800054dcdb0 (tid = 100328) too long timeout stopping cpus panic: spin lock held too long cpuid =3D 1 KDB: stack backtrace: #0 0xc051fcf0 at _mtx_lock_spin_failed+0x50 #1 0xc051fdb8 at _mtx_lock_spin_cookie+0xb8 #2 0xc088771c at tick_get_timecount_mp+0xdc #3 0xc0541efc at binuptime+0x3c #4 0xc08513cc at timercb+0x6c #5 0xc0887a80 at tick_intr+0x220 Uptime: 23s Automatic reboot in 15 seconds - press a key on the console to abort Both were the same, except 27s uptime in one case and 23s in the = other. The next reboot went all the way to multiuser, and appears to be = operating normally. At least, for the first 5 minutes. I=92ll keep an eye on it. But, is this possibly related to the bge0 = device driver, or is this more likely to be a problem in the = sparc/sparc64 code not related to a specific device? The prior kernel that had been running without this problem was: FreeBSD 10.0-STABLE #6 r261083: Thu Jan 23 17:54:24 EST 2014 Just wanted to see if anyone had any thoughts, and I=92ll hope the = machine stays operational now that it=92s up and running=85 - Chris