FreeBSD Mail Archives

Date:      Fri, 21 Oct 2016 21:47:28 +0000
From:      Rick Macklem <rmacklem@uoguelph.ca>
To:        Marek Salwerowicz <marek.salwerowicz@misal.pl>, "freebsd-fs@freebsd.org" <freebsd-fs@freebsd.org>
Subject:   Re: ZFS - NFS server for VMware ESXi issues
Message-ID:  <YTXPR01MB018980B341213CA43277464ADDD40@YTXPR01MB0189.CANPRD01.PROD.OUTLOOK.COM>
In-Reply-To: <930df17b-8db8-121a-a24b-b4909b8162dc@misal.pl>

index | next in thread | previous in thread | raw e-mail

Marek Salwerowicz wrote:

Stuff snipped for brevity...

> Today, after two weeks of working, we experienced the same situation.
> The nfsd service was in following state:
>
>  PID USERNAME    THR PRI NICE   SIZE    RES STATE   C   TIME WCPU COMMAND
>   984 root        128  20    0 12344K  4020K vq->vq  8 346:27 0.00% nfsd
>
> nfsd service didn't respond to service nfsd restart, but this time
> machine was able to reboot using "# reboot" command.
I am not sure how "top" got a STATE of "vq->vq", but I suspect that refers to the
vdev section of the ZFS code. (The only other place in the kernel where "vq->vq"
shows up is in virtio and I doubt you are using that?)

I'm not a ZFS guy so I can't help, but I'd guess that it's looping around in the vdev
code, possibly competing for the vq->vq_lock?

Hopefully someone with ZFS expertise can help out?

Btw, about the only area of the NFS server that might need tuning is the DRC and
this doesn't suggest that. If you "nfsstat -e -s" on the server and see large #s for
the last line under "Server Cache Stats:" there are tunables that can be used.
I'd also suggest you capture the output of "ps axHl" on the server when it happens
again, which tells you what all the nfsd threads are up to.

Good luck with it, rick

home | help

Want to link to this message? Use this
URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?YTXPR01MB018980B341213CA43277464ADDD40>

Header And Logo

Peripheral Links

Site Navigation

Header And Logo

Peripheral Links

Search

Site Navigation