From owner-freebsd-stable@freebsd.org Thu Jul 2 15:00:36 2015 Return-Path: Delivered-To: freebsd-stable@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 611AF9925A5 for ; Thu, 2 Jul 2015 15:00:36 +0000 (UTC) (envelope-from gjb@FreeBSD.org) Received: from freefall.freebsd.org (freefall.freebsd.org [IPv6:2001:1900:2254:206c::16:87]) by mx1.freebsd.org (Postfix) with ESMTP id 3CC98225A; Thu, 2 Jul 2015 15:00:36 +0000 (UTC) (envelope-from gjb@FreeBSD.org) Received: from FreeBSD.org (freefall.freebsd.org [IPv6:2001:1900:2254:206c::16:87]) by freefall.freebsd.org (Postfix) with ESMTP id 9B5661D0B; Thu, 2 Jul 2015 15:00:35 +0000 (UTC) (envelope-from gjb@FreeBSD.org) Date: Thu, 2 Jul 2015 15:00:33 +0000 From: Glen Barber To: Kurt Lidl Cc: Chris Ross , freebsd-stable@freebsd.org Subject: Re: New FreeBSD snapshots available: stable/10 (20150625 r284813) Message-ID: <20150702150033.GE53770@FreeBSD.org> References: <56A9EB91-2F97-4096-99C8-26D3EFC13D2D@distal.com> <20150701023640.GM5423@FreeBSD.org> <29FAA191-D0E5-4127-B016-65B4AE42ABE8@distal.com> <20150701025433.GN5423@FreeBSD.org> <5593D9ED.4080402@pix.net> <55940879.8060604@pix.net> <55946220.5050109@pix.net> <20150701234131.GF31841@FreeBSD.org> <55955010.5000605@pix.net> MIME-Version: 1.0 Content-Type: multipart/signed; micalg=pgp-sha256; protocol="application/pgp-signature"; boundary="IU5/I01NYhRvwH70" Content-Disposition: inline In-Reply-To: <55955010.5000605@pix.net> X-Operating-System: FreeBSD 11.0-CURRENT amd64 X-SCUD-Definition: Sudden Completely Unexpected Dataloss X-SULE-Definition: Sudden Unexpected Learning Event X-PEKBAC-Definition: Problem Exists, Keyboard Between Admin/Computer User-Agent: Mutt/1.5.23 (2014-03-12) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Thu, 02 Jul 2015 15:00:36 -0000 --IU5/I01NYhRvwH70 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Thu, Jul 02, 2015 at 10:52:00AM -0400, Kurt Lidl wrote: > >Kurt, can you re-enable the ipv6 line in rc.conf(5), and add '-tso6' to > >your rc.conf(5) lines? > > > > ifconfig_bge0=3D"DHCP" > > ifconfig_bge0_ipv6=3D"inet6 accept_rtadv -tso6" > > >=20 > I tried this, and it panic'd in the same manner. (Note - I've upgraded > this machine to the second 10.2-PRELEASE build.) >=20 Okay, thank you for testing. The last commits that I see specifically referencing this bge(4) model were a long time ago, but TSO was mentioned. It was worth a shot. > [...] >=20 > I've also seen (now that it's been running a bit longer), a couple of > other occurrences of the "spin lock held too long" panic. So while > having the IPv6 configuration in /etc/rc.conf causes this crash to > occur most of the time on boot, the same crash occurs at other times > too, which don't appear to IPv6 related. >=20 Can you update the PR with this information, please? > 1) when making the requested change, I editted my /etc/rc.conf file, > and then issued "reboot". The machine panic'd during the reboot > processing: >=20 > root@spork:~ # reboot > Jul 2 09:48:53 spork reboot: rebooted by root > Jul 2 09:48:53 spork syslogd: exiting on signal 15 > Waiting (max 60 seconds) for system process `vnlru' to stop...done > Waiting (max 60 seconds) for system process `bufdaemon' to stop...done > Waiting (max 60 seconds) for system process `syncer' to stop... > Syncing disks, vnodes remaining...0 0 0 0 done > All buffers synced. > Uptime: 14h34m16s > GEOM_MIRROR: Device gswap: provider mirror/gswap destroyed. > GEOM_MIRROR: Device gswap destroyed. > pid 1 (init), uid 0: exited on signal 4 > spin lock 0xc0cba338 (smp rendezvous) held by 0xfffff8000bbbe920 (tid > 100367) too long > timeout stopping cpus > panic: spin lock held too long > cpuid =3D 1 > KDB: stack backtrace: > #0 0xc05757c0 at panic+0x20 > #1 0xc0559250 at _mtx_lock_spin_failed+0x50 > #2 0xc0559318 at _mtx_lock_spin_cookie+0xb8 > #3 0xc08d801c at tick_get_timecount_mp+0xdc > #4 0xc05840c8 at binuptime+0x48 > #5 0xc08a400c at timercb+0x6c > #6 0xc08d8380 at tick_intr+0x220 > Uptime: 14h34m16s > Automatic reboot in 15 seconds - press a key on the console to abort > Rebooting... > timeout stopping cpus > timeout shutting down CPUs. >=20 > SC Alert: Host System has Reset >=20 > Note: the "SC Alert:" message comes the Sparc's ALOM management system, > so that's from the hardware directly, not from FreeBSD's kernel. >=20 Hmm. Any chance this could be hardware (failure) related? > It didn't crashdump, so I don't have any other backtrace other than > what I just copied here. >=20 > 2) After I rebooted with the "-tso" flag in place and it crashed, > I booted again, single user, so I could edit the /etc/rc.conf again > and manually did a savecore: >=20 Okay, thank you for checking, in any case. Glen --IU5/I01NYhRvwH70 Content-Type: application/pgp-signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v2 iQIcBAEBCAAGBQJVlVIRAAoJEAMUWKVHj+KT6hwP/Rnf7CpDzw16xZ0aVH03VRyx WsfSl3Wjb39FjKekf0+F19On5P3UzQZmHWVFV7+CvqG+wxm00L4gU650edSgZ8Gn w85P+iiNl+3aLtTq/cQzJ/sxOphDBqaMQx7Uo4bYM4SKb0TCkND4YxU7PK7tdyZY x7zJeiZNhDygGosY4xlL4M7VAHayqFlL2uxSqpU6lqYdrLjWgBsmFFU9GTEK4ttQ AeZ2OOhFueQeJUYPF7a9EdkH9CvW4cOfr8wyoJYAGGsO/3KgV9RDTLGtIfZHkMzf ASuPEzOERTm6qsdxZciR3qCTLiMA6rTaRLnJPw0g7O0gI6yKDX9xPDI+2nFMrkvc PAc2DW8gsJ318bVvRlfo6lgIbouBOnsdOZkDnGJZTCBZjt/jafXrHafx0FBCAXSz Uf+1c5VDMIQmoMDSHLg8qv1sQjt38NDjtuXguo4Fmq/PC5FDyCLTGJkuAJkZqD85 J1RI0V9Kd+d6aBhB/QUTYh1/fNp0BkHpYwcbVZgiPEBRH9o4da9X9MHCd6g9PLUk eHD+BkMqcm+mvZzfOh5x3lpLQONlNvosOO8t/ci4WCL8ujl2xnx9uf7BzK+P/3aV HIgTwGbGAVzx8MvEj1r/+GRjNOI7d62xEfLfZmr6dkGKPssFcDQunBMZRO1IPlgg MWIrt32czB81fAG+Xr1n =sliU -----END PGP SIGNATURE----- --IU5/I01NYhRvwH70--