From owner-freebsd-questions@FreeBSD.ORG Tue May 24 17:15:37 2005 Return-Path: X-Original-To: freebsd-questions@freebsd.org Delivered-To: freebsd-questions@freebsd.org Received: from mx1.FreeBSD.org (mx1.freebsd.org [216.136.204.125]) by hub.freebsd.org (Postfix) with ESMTP id D40CA16A41C for ; Tue, 24 May 2005 17:15:37 +0000 (GMT) (envelope-from bigbrother@bonbon.net) Received: from mail.duth.gr (mail.duth.gr [192.108.114.110]) by mx1.FreeBSD.org (Postfix) with ESMTP id 386B043D55 for ; Tue, 24 May 2005 17:15:36 +0000 (GMT) (envelope-from bigbrother@bonbon.net) Received: from bigb3server.ath.cx (b9-82.xan.duth.gr [193.92.211.82]) by mail.duth.gr (8.13.1/8.13.1) with ESMTP id j4OHFYTU033808 (version=TLSv1/SSLv3 cipher=DHE-RSA-AES256-SHA bits=256 verify=NO) for ; Tue, 24 May 2005 20:15:35 +0300 (EEST) (envelope-from bigbrother@bonbon.net) Received: from macedon (macedon.ivlsi.gr [192.168.3.226]) by bigb3server.ath.cx (8.13.1/8.13.1) with ESMTP id j4OHC4sO025518 for ; Tue, 24 May 2005 20:12:05 +0300 (EEST) (envelope-from bigbrother@bonbon.net) Message-Id: <200505241712.j4OHC4sO025518@bigb3server.ath.cx> From: "Bigbrother" To: Date: Tue, 24 May 2005 20:12:17 +0300 Organization: Bigbrother Clustered Network MIME-Version: 1.0 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: 7bit X-Mailer: Microsoft Office Outlook, Build 11.0.5510 Thread-Index: AcVgeC1f/87BDDSkTzSIIe9PFG1pDQ== X-MimeOLE: Produced By Microsoft MimeOLE V6.00.2800.1478 X-Scanned-By: MIMEDefang 2.51 on 192.108.114.110 X-Greylist: Sender DNS name whitelisted, not delayed by milter-greylist-1.6 (mail.duth.gr [192.108.114.110]); Tue, 24 May 2005 20:15:35 +0300 (EEST) Subject: RE:VM pager read error X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 24 May 2005 17:15:38 -0000 > I have a distributed network of systems running FreeBSD 4.10-Release, > and periodically, I see the following errors on the console: > > vm_fault: pager read error, pid 1 (init) I have also a network with many diskless boxes of 4.11 FreeBSD and every now and then I see messages like this. The process that dies varies...It is not always (init)..Some times it is (mrtg) some times (sshd) or (syslogd) and some other processes that the boxes are running. I have resolved this situation by running a series of crontab scripts that rlogin to every diskless box and checks/restarts every service that is critical for that box (e.g. syslogd, cron, sshd...). Of course if your (init )dies then you cannot do anything with it and you should ask/phone a worker there to do a reboot on that machine. I think the problem relies on poor hardware (my network has some low-end network cards). It would be nice if we could find any solution to this (without buying new cards). Perhaps if we could define that some processes would never be swapped out (like init) this problem would disappeared, but I do not have time for such experiments. BB p.s. Even though people leave the office, perhaps crontab and periodic execute some scripts...So perhaps the machines have a high cpu load and some NFS packets are dropped resulting in the process to die. --- Give a man fire, and he'll be warm for a day; set a man on fire, and he'll be warm for the rest of his life