From owner-freebsd-questions@FreeBSD.ORG Mon May 22 21:25:25 2006 Return-Path: X-Original-To: freebsd-questions@freebsd.org Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 0554316B24B for ; Mon, 22 May 2006 21:25:24 +0000 (UTC) (envelope-from chad@shire.net) Received: from hobbiton.shire.net (hobbiton.shire.net [166.70.252.250]) by mx1.FreeBSD.org (Postfix) with ESMTP id B6FE143D66 for ; Mon, 22 May 2006 21:25:20 +0000 (GMT) (envelope-from chad@shire.net) Received: from [67.171.127.191] (helo=[192.168.99.68]) by hobbiton.shire.net with esmtpa (Exim 4.51) id 1FiHtw-0002SP-0O for freebsd-questions@freebsd.org; Mon, 22 May 2006 15:25:20 -0600 Mime-Version: 1.0 (Apple Message framework v750) In-Reply-To: <6.0.0.22.2.20060522161545.02762a10@mail.computinginnovations.com> References: <4471DF47.8060500@spek.org> <6.0.0.22.2.20060522161545.02762a10@mail.computinginnovations.com> Content-Type: text/plain; charset=US-ASCII; delsp=yes; format=flowed Message-Id: Content-Transfer-Encoding: 7bit From: "Chad Leigh -- Shire.Net LLC" Date: Mon, 22 May 2006 15:25:19 -0600 To: FreeBSD Questions Mailing List X-Mailer: Apple Mail (2.750) X-SA-Exim-Connect-IP: 67.171.127.191 X-SA-Exim-Mail-From: chad@shire.net X-SA-Exim-Scanned: No (on hobbiton.shire.net); SAEximRunCond expanded to false Subject: Re: system freezes. X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 22 May 2006 21:25:25 -0000 > At 10:56 AM 5/22/2006, Brent Rieck wrote: >> Hello, >> I've been having some freeze problems with my "managed" freebsd >> server >> that my host has been less than helpful with; I hope that this is the >> right place to ask the questions I have. >> >> os: freebsd 4.8-stable >> major applications: apache 1.3.29 + php 4.3.10 , mysql 4.1.18-log, >> dirvish, riff-backup >> >> Machine freezes with nothing written to the logs or console - if >> you >> happen to be logged in and running top when it "starts" to freeze >> your >> top session will run completely normally and without lag (spacebar >> refreshes display, you can resort on size or cpu, etc), but no other >> processes can start - typing a command into another open shell >> will not >> start that program. Until it fully freezes it will echo >> characters back >> in the shell - and top will continue to run as normal. Top always >> shows >> a load of <0.1, there's always 5MB to 50MB of ram free. >> >> All of the hardware has been replaced (motherboard, cpu, ram, power >> supply, hard drive) >> >> I can't make it freeze on demand by replaying the web hits or >> database >> queries that occurred before the crash. >> >> I am able to make it freeze on demand by slurping down a particular >> dirvish vault with rsync. The freeze symptoms are the same as the >> random freeze symptoms (top responds normally, new processes can't >> start) >> >> The random freezes occur whether or not I'm running dirvish on a >> schedule. >> >> The rsync freezing I can work around if needed, the random >> freezes I >> cannot. Does anybody have any suggestions on how I might track >> down the >> problem? >> >> thanks, >> Brent This sounds like some sort of IO is not finishing and other processes are getting stuck in a queue behind the process with the "stuck" IO. I have a similar issue 5.3-6.0 that has been bedeviling me very infrequently with some md file backed images mounted as /dev/md* devices. Chad --- Chad Leigh -- Shire.Net LLC Your Web App and Email hosting provider chad at shire.net