From owner-freebsd-current@FreeBSD.ORG Mon Mar 1 15:56:42 2004 Return-Path: Delivered-To: freebsd-current@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 2826516A4CE for ; Mon, 1 Mar 2004 15:56:42 -0800 (PST) Received: from root.org (root.org [67.118.192.226]) by mx1.FreeBSD.org (Postfix) with SMTP id D28B043D1D for ; Mon, 1 Mar 2004 15:56:41 -0800 (PST) (envelope-from nate@root.org) Received: (qmail 8755 invoked by uid 1000); 1 Mar 2004 23:56:43 -0000 Date: Mon, 1 Mar 2004 15:56:43 -0800 (PST) From: Nate Lawson To: Joe Marcus Clarke In-Reply-To: <1078183073.779.40.camel@gyros> Message-ID: <20040301155505.F8753@root.org> References: <20040228152618.C98870@root.org> <1078019158.18071.12.camel@shumai.marcuscom.com> <1078020761.18071.15.camel@shumai.marcuscom.com> <1078032512.20048.7.camel@shumai.marcuscom.com> <1078102552.62463.64.camel@shumai.marcuscom.com> <20040229170401.Q3406@root.org> <1078183073.779.40.camel@gyros> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII cc: current@freebsd.org Subject: Re: mozilla hanging on gconfd2 startup? X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 01 Mar 2004 23:56:42 -0000 On Mon, 1 Mar 2004, Joe Marcus Clarke wrote: > On Sun, 2004-02-29 at 20:07, Nate Lawson wrote: > > On Sun, 29 Feb 2004, Joe Marcus Clarke wrote: > > > On Sun, 2004-02-29 at 19:53, Nate Lawson wrote: > > > > On Sun, 29 Feb 2004, Joe Marcus Clarke wrote: > > > > > > Nope. Still get this hanging in "select": > > > > > > 1000 33625 1 0 76 0 6932 5444 select S ?? 0:00.10 /usr/X11R6/libexec/gconfd-2 12 > > > > > > > > > > Could you break into this with gdb, and get a back trace just to see > > > > > what this guy is trying to do? Thanks. > > > > > > > > (gdb) bt > > > > #0 0x28313397 in poll () from /lib/libc.so.5 > > > > #1 0x281331a1 in _thread_kern_sched_state_unlock () from /usr/lib/libc_r.so.5 > > > > #2 0x28132be1 in _thread_kern_scheduler () from /usr/lib/libc_r.so.5 > > > > > > > > > I know I've seen gconfd hang when starting up on -CURRENT with an > > > > > NFS-mounted home if rpc.lockd wasn't running on the server. I've also > > > > > seen problems where the local hostname wasn't resolvable or if there was > > > > > a permissions problem on /tmp or /var/tmp. > > > > > > > > No NFS mounts. The local hostname is not resolvable. A tcpdump shows > > > > this: > > > > > > > > tcpdump: listening on fxp0 > > > > 16:50:35.000326 laptop.49457 > mydns.53: 60862+ A? laptop.example.org. (36) > > > > 16:50:35.067216 mydns.53 > laptop.49457: 60862 NXDomain* 0/1/0 (94) > > > > 16:50:35.067602 laptop.49458 > mydns.53: 60863+ A? laptop. (24) > > > > 16:50:35.206926 mydns.53 > laptop.49458: 60863 NXDomain 0/1/0 (99) > > > > 16:50:35.209422 laptop.49459 > mydns.53: 60864+ A? laptop.example.org. (36) > > > > 16:50:35.242605 mydns.53 > laptop.49459: 60864 NXDomain 0/1/0 (105) > > > > 16:50:35.242745 laptop.49460 > mydns.53: 60865+ A? laptop. (24) > > > > 16:50:35.408390 mydns.53 > laptop.49460: 60865 NXDomain 0/1/0 (99) > > > > 16:50:35.410527 laptop.49461 > mydns.53: 60866+ A? laptop.example.org. (36) > > > > 16:50:35.477876 mydns.53 > laptop.49461: 60866 NXDomain 0/1/0 (105) > > > > 16:50:35.478001 laptop.49462 > mydns.53: 60867+ A? laptop. (24) > > > > 16:50:35.634809 mydns.53 > laptop.49462: 60867 NXDomain 0/1/0 (99) > > > > > > > > So it does appear that the hostname is the issue. However, on a mozilla > > > > 1.5 and a -current of a few weeks ago, this was not a problem. So what > > > > changed? > > > > > > The gconf "dependency" was most likely added in 1.6. The solution to > > > this is to add your local hostname to /etc/hosts (that's the common > > > answer to GNOME users encountering this lock in gconfd). If you don't > > > want to do that, the pref trick is another valid workaround, but I think > > > the hostname thing makes more sense. > > > > Attempts to add the hostname as 127.0.0.1 and the actual IP resulted in a > > hang in the same place. A tcpdump of both lo0 and fxp0 show no more DNS > > queries so the hosts entry was definitely being used. A telnet to the > > hostname also shows that the lookup succeeds. Since there were no > > segments of TCP or UDP, I'm at a loss why it is blocking still. > > I'm not able to reproduce this after upgrading to today's -CURRENT, and > rebuilding Mozilla, ORBit2, and gconf2. I'm using ULE and libpthread. > Perhaps tomorrow I'll try it with libc_r (I assume you're using libc_r > since you have the nVidia drivers?). I am using libc_r although I thought the default had been switched a while ago. But I'm only doing the buildword/installworld/mergemaster process (no libmap.conf) so perhaps I need to do something to move to libpthread. I have no nvidia drivers. -Nate