From owner-freebsd-questions@FreeBSD.ORG Sun Oct 5 13:46:31 2008 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id B66DF1065686 for ; Sun, 5 Oct 2008 13:46:31 +0000 (UTC) (envelope-from fbsd.questions@rachie.is-a-geek.net) Received: from mail.rachie.is-a-geek.net (rachie.is-a-geek.net [66.230.99.27]) by mx1.freebsd.org (Postfix) with ESMTP id 856C18FC12 for ; Sun, 5 Oct 2008 13:46:31 +0000 (UTC) (envelope-from fbsd.questions@rachie.is-a-geek.net) Received: from localhost (mail.rachie.is-a-geek.net [192.168.2.101]) by mail.rachie.is-a-geek.net (Postfix) with ESMTP id 044EAAFBC01; Sun, 5 Oct 2008 05:46:30 -0800 (AKDT) From: Mel To: freebsd-questions@freebsd.org Date: Sun, 5 Oct 2008 15:46:27 +0200 User-Agent: KMail/1.9.7 References: <86r66v6gsj.fsf@ponoka.ab.hsia.telus.net> In-Reply-To: <86r66v6gsj.fsf@ponoka.ab.hsia.telus.net> MIME-Version: 1.0 Content-Type: text/plain; charset="iso-8859-1" Content-Transfer-Encoding: 7bit Content-Disposition: inline Message-Id: <200810051546.28440.fbsd.questions@rachie.is-a-geek.net> Cc: Dale Hagglund Subject: Re: processes hanging in _umtx_op X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sun, 05 Oct 2008 13:46:31 -0000 On Sunday 05 October 2008 02:12:44 Dale Hagglund wrote: > I can reproduce this reliably with > > $ python -c "import wx" > > I'm running 7.0-RELEASE-p2 with "uname -a" giving the following output, > suitably wrapped. > > FreeBSD ponoka.ab.hsia.telus.net > 7.0-RELEASE-p2 FreeBSD 7.0-RELEASE-p2 > #0: Wed Jun 18 07:33:20 UTC 2008 > root@i386-builder.daemonology.net:/usr/obj/usr/src/sys/GENERIC > i386 [...] > Dale Hagglund > > ------------------------- start: kdump output ------------------------- > 34453 python 1223164746.661828 CALL munmap(0x2aa00000,0xc1000) > 34453 python 1223164746.661840 RET munmap 0 > 34453 python 1223164746.662541 CALL > _umtx_op(0x283071e0,0x8,0x1,0x283071c0,0 ) Can you change scheduler to ULE and rebuild kernel? Or better yet, try 7.1-PRERELEASE, since it's good to know if this bug persists with 7.1 being close to release. It's not a 'standard answer', btw, but an educated guess, since utmx is (simplified) the kernel equivalent of pthread_(rwlock|mutex)_* and looks like it's hanging in one of those functions. If you're comfy with kernel debugging, print uap->op when it's hanging, so we know which op it's waiting on. Now, it can simply be programmer error (lock twice, unlock once), but most of the time the kernel catches this for me with EDEADLK. -- Mel Problem with today's modular software: they start with the modules and never get to the software part.