Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 19 Dec 2022 17:36:05 +0000 (UTC)
From:      "Bjoern A. Zeeb" <bz@freebsd.org>
To:        Rick Macklem <rick.macklem@gmail.com>
Cc:        Konstantin Belousov <kib@freebsd.org>, James Gritton <jamie@freebsd.org>,  freebsd-current@freebsd.org
Subject:   Re: RFC: nfsd in a vnet jail
Message-ID:  <p4p1r59o-1519-o723-24s6-2rqo1p4qr0r2@serrofq.bet>
In-Reply-To: <CAM5tNy73kfEToEEbT9Mz0VAZhNLxoFwEo1FMvoyR1Sg5Xaiv8g@mail.gmail.com>
References:  <CAM5tNy7CQaBTRWG0m0aN6T0xG2L2zSQJGa%2BatGaH%2BmW%2BwEpdyQ@mail.gmail.com> <1955021.aDjkhKmpDe@ravel> <CAM5tNy5a9GYjJcjXLQvsjF77Gsu6yej5XR=mMTAuVKWxoNfR1A@mail.gmail.com> <8351812.Gc231LQI4k@ravel> <CAM5tNy63yE%2BL0rjfdYSs_WVwh3_gi8fmRVNiTu9BEKzNj_iYgA@mail.gmail.com> <CAM5tNy73kfEToEEbT9Mz0VAZhNLxoFwEo1FMvoyR1Sg5Xaiv8g@mail.gmail.com>

next in thread | previous in thread | raw e-mail | index | archive | help
On Mon, 19 Dec 2022, Rick Macklem wrote:

> Hi,
>
> Kostik expressed some concern w.r.t. using a non-default VNET_NFSD kernel
> build option and I understand his concern, given that many prefer to use
> a GENERIC kernel and binary updates.

yes, I may have hinted towards that (at least in my mind) during looking
at the review.

There is a reason that (at least for now) I do like having it.
Personally (due to lack of time mostly) I haven't figured out if I would
want this to be a vnfs one day or part of vnet.  The earlier (like the
kernel option) could more easily address possible security concerns to
people (especially while this is a moving target with more possibly
coming later).

Hence me having asked for a dedicated macro not mangling with the VNET
macros directly in place as this gives us a lot of flexibility to easily
move this around in the future if we wanted to.

Removing the option will make the code simpler.


> Right now there are 29 NFS variables VNET_DEFINED() and several of them
> are arrays currently sized at 500. One of the reasons for the non-default
> VNET_NFSD kernel option was the bloat this caused to the vnet.
> (Chris <bsd-lists@bsdforge.com> expressed concern that adding mountd/nfsd
> to the vnet would result in bloat/overhead in a previous post to this
> thread.)
>
> Another issue with putting all these variables in the vnet is that the
> nfsd.ko
> cannot be loaded (it complains the vnet is out of space) and, as such,
> options NFSD must be used with VNET_NFSD so that the nfsd is linked into the
> kernel.
>
> As such, I am wondering what others think of this alternate plan?
> - Pull all the VNET_DEFINE()'d variable (except the 3 manipulated by
> sysctls)
>  into a structure.
> - Define a single VNET_DEFINE()'d variable that is a pointer to that
> structure
>  and then malloc() the structure in the function called by VNET_SYSINIT().
> This would result in a malloc'd structure for each vnet jail (for kernels
> built with VMIMAGE), but would only add 4 variables to the vnet.
>
> If a small C file that only consists of the VNET_DEFINE()s for the 4
> variables
> is linked into the kernel whenever the VIMAGE option is specified, then I
> think nfsd.ko would be loadable.

It is not the number of variables added that is a problem; it is (as you
point out above) their size which is a problem.

So 500 uint8_t variables are as expensive as 1 uint8_t[500];

There's only a "small" allocated per-vnet to replicate state for
modules.

Multicast also had that problem needing huge junks and we eventually
switched to malloc to fix this and make the module loadable as well
again.  (See 1a117215c7f90e6ef8c50ef3bfe099490aaa98f9 for one of the
changes -- I think I scrwed somehting up there about sizing so probably
follow-ups but it'll show the concept).

So wether you make this one huge malloc or multiple small ones for the
few big variables is up to you in the end.  I also wonder what is easier
to deal with "vnet0/prison0/'nfs0-bits'" as NFS_ROOT and other parts
will need some of these things eventually to be in place early one for
the base.

Also sysctl and malloc and virtualisation can be a bit tricky.

The "common" theme would be to only malloc the complex data types but
leave the simple type variables being normal virtualised ones.


> Unfortunately, this does not deal with vnet'ng the kgssapi, rpcsec_gss for
> Kerberized mounts or vnet'ng NFS-over-TLS, but those could be handled in a
> similar manner, I think?

Could be, yes.


> So, what do others think of this alternate plan?
>
> rick
> ps: Every use of the vnet'd variables is currently wrapped in a macro called
>    NFSD_VNET(), so the change is pretty easy to do by just re-writing this
> macro.
>

-- 
Bjoern A. Zeeb                                                     r15:7



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?p4p1r59o-1519-o723-24s6-2rqo1p4qr0r2>