From owner-freebsd-questions@FreeBSD.ORG Sat Feb 26 05:54:00 2005 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id 0E09A16A4D0 for ; Sat, 26 Feb 2005 05:54:00 +0000 (GMT) Received: from shell.reiteration.net (82-34-179-228.cable.ubr01.sout.blueyonder.co.uk [82.34.179.228]) by mx1.FreeBSD.org (Postfix) with ESMTP id 1657D43D93 for ; Sat, 26 Feb 2005 05:28:56 +0000 (GMT) (envelope-from lists@reiteration.net) Received: from [127.0.0.1] (helo=reiteration.net) by shell.reiteration.net with esmtp (Exim 4.44 (FreeBSD)) id 1D4ubK-000GDC-Lx for freebsd-questions@FreeBSD.org; Sat, 26 Feb 2005 05:34:50 +0000 From: "John" To: freebsd-questions@FreeBSD.org Date: Sat, 26 Feb 2005 05:34:48 +0000 Message-Id: <20050226052037.M6206@reiteration.net> In-Reply-To: <003801c51b2b$1deecc60$04cf589d@simula.eis.uva.es> References: <003801c51b2b$1deecc60$04cf589d@simula.eis.uva.es> X-Mailer: Open WebMail 2.50 20050106 X-OriginatingIP: 192.168.1.7 (jfm) MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-1 X-SA-Exim-Connect-IP: 127.0.0.1 X-SA-Exim-Mail-From: lists@reiteration.net X-SA-Exim-Scanned: No (on shell.reiteration.net); SAEximRunCond expanded to false Subject: Re: I killed my system with grep X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.1 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Sat, 26 Feb 2005 05:54:01 -0000 On Fri, 25 Feb 2005 12:14:04 +0100, Ramiro Aceves wrote > Hello FreeBSD friends: > > I am running a FreeBSD 5.3 system with 64MB RAM and 150 MB swap. > > Yesterday I entered the command: > > # grep -R something / > > and after a while, my system did not respond. I do not remember the exact > messages as I am on a winbugs at the University. The error was about > swapping. I could switch among terminals but the system was dead. I needed > to reboot. > > I rebooted and tried again watching "top" output and I could see as swap > usage was incresing very quickly until it ran out of swap space and > the swap pager failed. > > Was my sytem dead? or, is it possible to recover from that state without > rebooting? How is it possible that a simple command like this could > auto-kill the machine? > > What is the recomended fix for this?: > > a- Asigning more swap. > b- Not executing that command anymore. I had a very similar problem. I fixed it by reading up on auxillary swapfile. I set one of those up as indicated, at about 2.5x RAM (750MB, roughly) and the problem went away after I edited /etc/fstab and commented out the old swap entry, then rebooted. The problem was caused by the hard drive having failing blocks where the swap partition resides. I got a few errors in /var/log/messages where the swap daemon couldnt zero pages or inodes or whatever. If the swapper cant liberate old pages then swap will fill up. My machine uses a lot of perl, and openwebmail makes it swap under load. It seems that the bad blocks are only on the old swap so hopefully the hard drive still has some life in it yet. -- lists@reiteration.net