Skip site navigation (1)Skip section navigation (2)
Date:      Sun, 8 Aug 2010 19:58:09 -0400 (EDT)
From:      Rick Macklem <rmacklem@uoguelph.ca>
To:        rhfb@akira.stdio.com
Cc:        freebsd-hackers@freebsd.org, dfr@freebsd.org
Subject:   Re: NFS server hangs (was no subject)
Message-ID:  <282423324.419135.1281311889612.JavaMail.root@erie.cs.uoguelph.ca>
In-Reply-To: <20100729094046.AD3F3C2@akira.stdio.com>

next in thread | previous in thread | raw e-mail | index | archive | help
------=_Part_419134_272684746.1281311889610
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: 7bit

> I have a similar problem.
> 
> I have a NFS server (8.0 upgraded a couple times since Feb 2010) that
> locks up
> and requires a reboot.
> 
> The clients are busy vm's from VMWare ESXi using the NFS server for
> vmdk virtual
> disk storage.
> 
> The ESXi reports nfs server inactive and all the vm's post disk write
> errors when
> trying to write to their disk.
> 
> /etc/rc.d/nfsd restart fails to work (it can not kill the nfsd
> process)
> 
> The nfsd process runs at 100% cpu at rc_lo state in top.
> 
> reboot is the only fix.
> 
> It has only happened under two circumstances.
> 1) Installation of a VM using Windows 2008.
> 2) Migrating 16 million mail messages from a physical server to a VM
> running FreeBSD with ZFS file system as a VM on the ESXi box that uses
> NFS to store the VM's ZFS disk.
> 
> The NFS server uses ZFS also.

I don't think what you are seeing is the same as what others have reported.
(I have a hunch that your problem might be a replay cache problem.)

Please try the attached patch and make sure that your sys/rpc/svc.c
is at r205562 (upgrade if it isn't).

If this patch doesn't help, you could try using the experimental nfs
server (which doesn't use the generic replay cache), by adding "-e" to
mountd and nfsd.

Please let me know if the patch or switching to the experimental nfs
server helps, rick


------=_Part_419134_272684746.1281311889610
Content-Type: text/x-patch; name=replay.patch
Content-Transfer-Encoding: base64
Content-Disposition: attachment; filename=replay.patch

LS0tIHJwYy9yZXBsYXkuYy5zYXYJMjAxMC0wOC0wOCAxODowNTo1MC4wMDAwMDAwMDAgLTA0MDAK
KysrIHJwYy9yZXBsYXkuYwkyMDEwLTA4LTA4IDE4OjE2OjQzLjAwMDAwMDAwMCAtMDQwMApAQCAt
OTAsOCArOTAsMTAgQEAKIHJlcGxheV9zZXRzaXplKHN0cnVjdCByZXBsYXlfY2FjaGUgKnJjLCBz
aXplX3QgbmV3bWF4c2l6ZSkKIHsKIAorCW10eF9sb2NrKCZyYy0+cmNfbG9jayk7CiAJcmMtPnJj
X21heHNpemUgPSBuZXdtYXhzaXplOwogCXJlcGxheV9wcnVuZShyYyk7CisJbXR4X3VubG9jaygm
cmMtPnJjX2xvY2spOwogfQogCiB2b2lkCkBAIC0xNDQsOCArMTQ2LDggQEAKIAlib29sX3QgZnJl
ZWRfb25lOwogCiAJaWYgKHJjLT5yY19jb3VudCA+PSBSRVBMQVlfTUFYIHx8IHJjLT5yY19zaXpl
ID4gcmMtPnJjX21heHNpemUpIHsKLQkJZnJlZWRfb25lID0gRkFMU0U7CiAJCWRvIHsKKwkJCWZy
ZWVkX29uZSA9IEZBTFNFOwogCQkJLyoKIAkJCSAqIFRyeSB0byBmcmVlIGFuIGVudHJ5LiBEb24n
dCBmcmVlIGluLXByb2dyZXNzIGVudHJpZXMKIAkJCSAqLwo=
------=_Part_419134_272684746.1281311889610--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?282423324.419135.1281311889612.JavaMail.root>