From owner-freebsd-current@FreeBSD.ORG Mon Jan 23 21:00:03 2006 Return-Path: X-Original-To: freebsd-current@freebsd.org Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 1B22C16A420 for ; Mon, 23 Jan 2006 21:00:03 +0000 (GMT) (envelope-from jhb@freebsd.org) Received: from speedfactory.net (mail6.speedfactory.net [66.23.216.219]) by mx1.FreeBSD.org (Postfix) with ESMTP id 4643043D4C for ; Mon, 23 Jan 2006 21:00:02 +0000 (GMT) (envelope-from jhb@freebsd.org) Received: from server.baldwin.cx (unverified [66.23.211.162]) by speedfactory.net (SurgeMail 3.5b3) with ESMTP id 6790125 for multiple; Mon, 23 Jan 2006 16:01:07 -0500 Received: from localhost (john@localhost [127.0.0.1]) by server.baldwin.cx (8.13.4/8.13.4) with ESMTP id k0NKxsoB003319; Mon, 23 Jan 2006 15:59:56 -0500 (EST) (envelope-from jhb@freebsd.org) From: John Baldwin To: thierry@herbelot.com Date: Mon, 23 Jan 2006 15:51:07 -0500 User-Agent: KMail/1.9.1 References: <200601040806.37953.thierry@herbelot.com> <200601040838.49663.jhb@freebsd.org> <200601210705.11539.thierry@herbelot.com> In-Reply-To: <200601210705.11539.thierry@herbelot.com> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: quoted-printable Content-Disposition: inline Message-Id: <200601231551.08474.jhb@freebsd.org> X-Virus-Scanned: ClamAV 0.87.1/1247/Sat Jan 21 05:24:51 2006 on server.baldwin.cx X-Virus-Status: Clean X-Spam-Status: No, score=-1.4 required=4.2 tests=ALL_TRUSTED autolearn=failed version=3.1.0 X-Spam-Checker-Version: SpamAssassin 3.1.0 (2005-09-13) on server.baldwin.cx X-Server: High Performance Mail Server - http://surgemail.com r=1653887525 Cc: freebsd-current@freebsd.org Subject: Re: panic: spin lock held too long (while rebooting) X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 23 Jan 2006 21:00:03 -0000 On Saturday 21 January 2006 01:05, Thierry Herbelot wrote: > Le Wednesday 4 January 2006 14:38, John Baldwin a =E9crit : > > On Wednesday 04 January 2006 02:06 am, Thierry Herbelot wrote: > > [SNIP previous similar panic] > > > Next time you get this, can you use 'show threads' to figure out the tid > > for the thread whose pointer is in the printf (0xc16de480 in this case) > > and then do a trace of that thread? > > Hello, > > Here is a more detailed crash session : > > is this (zomb) problematic ? (in ps) : > 8 c182e228 0 1 0 0002204 zomb[INACTIVE] g_mirror gm0s1 > > I keep the machine in DDB, if there are more detailed commands to > investigate the panic (the machine is an SMP BP6, runs a GENERIC current > kernel, and stores its local files in two g_mirror partitions). > > The problematic spinlock is held by 0xc16de340 which is cpustop_handler. > > TfH > > PS : printout of the crash : > > # reboot > Waiting (max 60 seconds) for system process `vnlru' to stop...done > Waiting (max 60 seconds) for system process `bufdaemon' to stop...done > Waiting (max 60 seconds) for system process `syncer' to stop... > Syncing disks, vnodes remaining...3 2 2 2 0 0 done > All buffers synced. > Uptime: 39m52s > GEOM_MIRROR: Device files1: provider mirror/files1 destroyed. > GEOM_MIRROR: Device files1 destroyed. > GEOM_MIRROR: Device gm0s1: provider mirror/gm0s1 destroyed. > GEOM_MIRROR: Device gm0s1 destroyed. > Rebooting... > cpu_reset: Stopping other CPUs > spin lock sched lock held by 0xc16de340 for > 5 seconds > panic: spin lock held too long Ok, it's not a fatal panic in that your disks should already be clean at th= is=20 point, etc. You can try this hack to see if it fixes it: Index: vm_machdep.c =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D= =3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D=3D RCS file: /usr/cvs/src/sys/i386/i386/vm_machdep.c,v retrieving revision 1.267 diff -u -r1.267 vm_machdep.c =2D-- vm_machdep.c 14 Nov 2005 00:43:44 -0000 1.267 +++ vm_machdep.c 23 Jan 2006 20:49:21 -0000 @@ -533,6 +533,7 @@ ; /* Wait for other cpu to see that we've started */ stop_cpus((1< <>< http://www.FreeBSD.org/~jhb/ "Power Users Use the Power to Serve" =3D http://www.FreeBSD.org