Date: Wed, 19 Feb 2003 20:04:47 -0800 From: Terry Lambert <tlambert2@mindspring.com> To: Lars Eggert <larse@ISI.EDU> Cc: current@freebsd.org, Craig Boston <craig@xfoil.gank.org>, Poul-Henning Kamp <phk@critter.freebsd.dk> Subject: Re: panic starting gnome Message-ID: <3E5453DF.E0E47477@mindspring.com> References: <3E52BB14.2040309@isi.edu> <3E532F61.653A09B0@mindspring.com> <3E5408B0.9030300@isi.edu>
next in thread | previous in thread | raw e-mail | index | archive | help
Lars Eggert wrote: > Terry Lambert wrote: > > Debug: > > > [excellent kernel-debugging recipe snipped] > > Here's a backtrace of a crashdump that should be more helpful: [ ... ] > (kgdb) up 12 > #12 0xc02098a4 in namei (ndp=0x9e) at /usr/src/sys/kern/vfs_lookup.c:158 > 158 FILEDESC_LOCK(fdp); > (kgdb) list > 153 #endif > 154 > 155 /* > 156 * Get starting point for the translation. > 157 */ > 158 FILEDESC_LOCK(fdp); > 159 ndp->ni_rootdir = fdp->fd_rdir; > 160 ndp->ni_topdir = fdp->fd_jdir; > 161 > 162 dp = fdp->fd_cdir; > > (kgdb) print ndp > $2 = (struct nameidata *) 0x9e > > (kgdb) print fdp > $1 = (struct filedesc *) 0x34 > (kgdb) > > (kgdb) print p > $3 = (struct proc *) 0x0 > > (kgdb) print td > $5 = (struct thread *) 0xc662d1e0 > > (kgdb) print *td > $7 = {td_proc = 0xc66307f0, > [...] > > Very strange. namei() does essentially the following: > > p = td->td_proc; > fdp = p->p_fd; > > td->td_proc seems reasonable, but p is 0. No idea how this could happen, > any guesses? Cool. This is not where I was guessing it was at, at all. 8-) 8-). There's a commit that Alfred made last Friday night that might have something to do with it. It was an attempt to fix a lock order reversal between "PROC/filedesc", according to the commit, and it introduced "fdesc_mtx". If you grep for that everywhere, and then annotate the involved files, it should be pretty obvious which changes to revert to see if this is the case (1.50->1.49 of /sys/sys/filedesc.h, etc.). It may also be an issue with some of the recent KSE commits over the last weekend missing an assignment on a context switch. Probably the easiest thing to do, if you can repeat the problem reliably, is to bsearch, starting 8 days days ago, for the commit that broke the camel's back. It's really tempting to make a script that's capable of carrying out a /usr/src/sys bsearch semi-automatically, because people are really hesistant to use this approach for solving problems, even though it only requires O(log2(N)) reboots to find it... -- Terry To Unsubscribe: send mail to majordomo@FreeBSD.org with "unsubscribe freebsd-current" in the body of the message
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?3E5453DF.E0E47477>