From owner-freebsd-stable@FreeBSD.ORG Sat Jul 15 04:08:39 2006 Return-Path: X-Original-To: freebsd-stable@freebsd.org Delivered-To: freebsd-stable@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 44EAE16A4DA; Sat, 15 Jul 2006 04:08:39 +0000 (UTC) (envelope-from freebsd@hub.org) Received: from hub.org (hub.org [200.46.204.220]) by mx1.FreeBSD.org (Postfix) with ESMTP id B904C43D45; Sat, 15 Jul 2006 04:08:38 +0000 (GMT) (envelope-from freebsd@hub.org) Received: from localhost (mx1.hub.org [200.46.208.251]) by hub.org (Postfix) with ESMTP id 92368290C37; Sat, 15 Jul 2006 01:08:37 -0300 (ADT) Received: from hub.org ([200.46.204.220]) by localhost (mx1.hub.org [200.46.208.251]) (amavisd-new, port 10024) with ESMTP id 19712-04; Sat, 15 Jul 2006 01:08:37 -0300 (ADT) Received: from ganymede.hub.org (blk-224-179-167.eastlink.ca [24.224.179.167]) by hub.org (Postfix) with ESMTP id 0CC70290C29; Sat, 15 Jul 2006 01:08:37 -0300 (ADT) Received: by ganymede.hub.org (Postfix, from userid 1027) id AEC9A48CE7; Sat, 15 Jul 2006 01:08:36 -0300 (ADT) Received: from localhost (localhost [127.0.0.1]) by ganymede.hub.org (Postfix) with ESMTP id AD84847FB2; Sat, 15 Jul 2006 01:08:36 -0300 (ADT) Date: Sat, 15 Jul 2006 01:08:36 -0300 (ADT) From: User Freebsd To: Kostik Belousov In-Reply-To: <20060715035308.GJ32624@deviant.kiev.zoral.com.ua> Message-ID: <20060715010607.L1799@ganymede.hub.org> References: <20060705100403.Y80381@fledge.watson.org> <20060705234514.I70011@fledge.watson.org> <20060715000351.U1799@ganymede.hub.org> <20060715035308.GJ32624@deviant.kiev.zoral.com.ua> MIME-Version: 1.0 Content-Type: TEXT/PLAIN; charset=US-ASCII; format=flowed Cc: freebsd-stable@freebsd.org, Robert Watson , Michel Talon , Francisco Reyes Subject: Re: vm_map.c lock up (Was: Re: NFS Locking Issue) X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 15 Jul 2006 04:08:39 -0000 On Sat, 15 Jul 2006, Kostik Belousov wrote: > On Sat, Jul 15, 2006 at 12:10:29AM -0300, User Freebsd wrote: >> >> >> On Wed, 5 Jul 2006, Robert Watson wrote: >> >>> If you can get into DDB when the hang has occurred, output via serial >>> console for the following commands would be very helpful: >>> >>> show pcpu >>> show allpcpu >>> ps >>> trace >>> traceall >>> show locks >>> show alllocks >>> show uma >>> show malloc >>> show lockedvnods >> >> 'k, after 16 days uptime, the server that I got all the debugging turned >> on for finally hung up solid ... I was able to break into DDB over the >> serial link, and have run all of the above on it ... and the output is >> attached ... >> >> One thing to note is that the ps listing is not complete ... there are >6k >> processes running at the time, and I don't know how to get rid of the >> '--more--' prompt :( After 1k processes, I just hit 'q' and went onto the >> other commands ... > set lines=0 >> >> Also, traceall gave me a 'No such command' error ... now that I think >> about it, my luck, it was supposed to be 'trace all'? > It is alltrace. >> >> If this doesn't provide enough information, please let me know what else I >> should do the next time through, besides the above commands ... > Missing alltrace output seems to be critical. If this is not feasible, > please, provide at least the output of the bt for each pid > shown in the "show lockedvnods" and "show alllocks". In you case, > bt 64880 was the most interesting. It is pity that you had reset the > machine. Was down for too long as it was ... it, of course, happened while I was out with the family :( Will keep all of this in mind next time I get a chance to run through things ... Any idea why 'panic' doesn't produce core like it used to? > Just in case, do you use mlocked mappings ? Also, why so huge number of > crons exist in the system ? The are all forking now. It may be (can not > say definitely without further investigation) just a fork bomb. mlocked mappings? What are they? :) re: crons ... this, I'm not sure of, but my suspicion was that the crons weren't able to complete, since the file system was locked up, but the next one was being attempted to run ... *shrug* ---- Marc G. Fournier Hub.Org Networking Services (http://www.hub.org) Email . scrappy@hub.org MSN . scrappy@hub.org Yahoo . yscrappy Skype: hub.org ICQ . 7615664