From owner-freebsd-stable@FreeBSD.ORG Fri May 21 14:55:54 2010 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id E5A3D1065670 for ; Fri, 21 May 2010 14:55:54 +0000 (UTC) (envelope-from mark@islandnet.com) Received: from mx20.islandnet.com (mx20.islandnet.com [199.175.106.101]) by mx1.freebsd.org (Postfix) with ESMTP id AFFC18FC18 for ; Fri, 21 May 2010 14:55:54 +0000 (UTC) Received: from [199.175.106.221] (port=30426 helo=helpdesk.islandnet.com) by mx2.islandnet.com with SMTP id 1OFTTv-000Pbg-7D for freebsd-stable@freebsd.org; Fri, 21 May 2010 07:45:47 -0700 From: Mark Morley To: freebsd-stable@freebsd.org Date: Fri, 21 May 2010 07:45:47 -0700 X-Priority: 3 X-Mailer: Islandnet.com Helpdesk Webmail MIME-Version: 1.0 Message-Id: <20100521145554.E5A3D1065670@hub.freebsd.org> Content-Type: text/plain; charset="UTF-8 Content-Transfer-Encoding: 8bit X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Subject: NFS trouble on 7.3-STABLE i386 X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 21 May 2010 14:55:55 -0000 Having an issue with a file server here (7.3-STABLE i386) The nfsd processes are hanging. Client access to the nfs shares stops working and the nfsd processes on the server cannot be killed by any means. There are no errors showing up anywhere on the server. The network connection to the server seems fine (ie: anything other than nfs traffic seems ok). Rebooting the server fixes the problem for a while, but it doesn't reboot easily. It times out on terminating the nfsd processes. When it finally does reboot the file system isn't marked clean, resulting in a long wait for fsck (although it doesn't find any problems, it's a multi terrabyte share and it takes a while). This morning it did it again. This time I tried manually killing nfsd but nothing I did would make them die. No errors. The server is a dual core intel cpu with 2 gigs of ram. Adaptec 5805 raid controller, 8 x 750G drives, RAID 6 2 x em interfaces It's been find until about last week some time. I did recently upgrade from 7.1 to 7.3, which may be related, although this issue didn't start happening right away. No particular time of day and it doesn't seem to coincide with any particular cron tasks or have anything to do with the level of activity. Any thoughts? Mark