Skip site navigation (1)Skip section navigation (2)
Date:      Thu, 26 Nov 2020 00:48:17 +0000
From:      bugzilla-noreply@freebsd.org
To:        bugs@FreeBSD.org
Subject:   [Bug 251347] NFS hangs on client side when mounted from outside in Jail Tree (BROKEN NFS SERVER OR MIDDLEWARE)
Message-ID:  <bug-251347-227-5HjuCvQZFk@https.bugs.freebsd.org/bugzilla/>
In-Reply-To: <bug-251347-227@https.bugs.freebsd.org/bugzilla/>
References:  <bug-251347-227@https.bugs.freebsd.org/bugzilla/>

next in thread | previous in thread | raw e-mail | index | archive | help
https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=3D251347

--- Comment #7 from Arne Steinkamm <arne@Steinkamm.COM> ---
(In reply to Rick Macklem from comment #6)

Thanks Rick,

as said before:
Using NFSv4 changes nothing :-(

I'm unable to change the wiring because the complete setup is in a computer
center 600km from here but I have access to all lights out service processo=
rs,
the switches, etc.

I have the same problem now with a Linux NFS server so I think the problem =
is
not on the server side.

To make the picture clearer:

two main administrative servers: adm001 and adm002
Both have a couple of VNET jails (if_bridge, not netgraph) running nameserv=
er
etc.
And both have so called login server jails. Just a jail where the people can
ssh too and jump (ssh) inside the development networks.

Two fileserver, one Linux one FreeBSD share the $HOMEs and a pool share.

These are mounted at adm001 and amd002 to /l/home and /l/pool.

NULLFS mounts forward these in the two login jails:

/l/home --> /l/prison/login1/l/home and
/l/pool --> /l/prison/login1/l/pool.

Same procedure on adm002 with login002.

After 30 seconds to 60 minutes one of the two NFS mounts freezes.
df, ls, umount etc. hang.

And now the strange part:

  "sh /etc/rc.d/jail stop login1"

immediately unfreezes the NFS mount and everything is working normal again =
but,
of course, without the jail.

In the second the Jail is dead the "BROKEN NFS..." text is displayed only on
the console, not in dmesg buffer or log files and, this is strange, only on
adm001.
adm001 and adm002 are different hardware (other CPU) but there are clones, =
real
clones made with zfs send/receive. Of course the hostids are different.

The workaround is to allow the users to use adm001 and adm002 directly. But
this is not the way we want to have it.

When I don't use the NULLFS and make two independent NFS mounts to /l/home =
and
/l/prison/login1/l/home then only the second freezes.

I guess this is not normal and a real bug...=20

Any help?

Thanks in advance
.//. Arne

--=20
You are receiving this mail because:
You are the assignee for the bug.=



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?bug-251347-227-5HjuCvQZFk>