Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 20 Apr 2026 10:11:21 -0400
From:      Olgun Adak <olgun.adak@trexquant.com>
To:        freebsd-stable@freebsd.org
Cc:        freebsd-net@freebsd.org
Subject:   [REGRESSION] nfsd TCP socket lockup on 14.3-RELEASE-p9/p10 - Confirmed on Multiple Systems
Message-ID:  <CAFww=iytpc%2BJJuiPJtTrKKkqZaP%2BZ9OzHnTCzLk=MSEkzGwNzA@mail.gmail.com>

index | next in thread | raw e-mail

[-- Attachment #1 --]
Hello FreeBSD Community,

We’ve run into a consistent nfsd lockup after moving from 14.3-RELEASE-p8
to p10. We have verified this across two identical bare-metal systems.
Reverting to p8 via bectl immediately restores stability on both systems,
so this appears to be a regression introduced in the p9/p10 cycle.

*The symptoms:*

Under NFSv3 load, the nfsd service hangs and becomes unresponsive to all
clients. Looking at procstat -kk, we see a deadlock pattern where threads
are stuck waiting on soiolock:

_sx_xlock_hard -> soiolock -> sosend_generic -> sosend -> svc_vc_reply

Several threads are blocked in _sx_xlock_hard while others sit in sbwait.

*The environment:*

The systems are bare-metal with 2 x dual-port Mellanox ConnectX-6 100GbE
(mlx5en) cards. We see the issue regardless of MTU (1500 and 9000).

Offloads:

   -

   TSO: Enabled
   -

   LRO: The issue persists regardless of LRO state (tested with LRO
   disabled and with software-only LRO). Hardware LRO is disabled in all cases.

Relevant tunables:

kern.ipc.soacceptqueue=1000
kern.ipc.somaxconn=2000
kern.ipc.maxsockbuf=67108864
net.inet.tcp.sendbuf_max=67108864
net.inet.tcp.sendspace=16777216
net.inet.tcp.sendbuf_inc=262144
net.inet.tcp.recvbuf_max=67108864
net.inet.tcp.recvspace=16777216
vfs.nfsd.srvmaxio=1048576

We have kept the p10 Boot Environments intact and can boot back into them
to run any additional debug commands or test patches if someone can help
point us in the right direction.

Best regards,
-Olgun Adak

-- 



This message is intended only for the use of the individual or entity to 
which it is addressed, and may contain private and confidential 
information. If you are not the intended recipient of this message you are 
hereby notified that any review, dissemination, distribution or copying of 
this message is strictly prohibited. If you have received this e-mail in 
error, please immediately notify the sender by replying to this e-mail and 
delete the message and any attachment(s) from your system. This 
communication is for information purposes only and should not be regarded 
as an offer to sell or as a solicitation of an offer to buy any financial 
product, an official confirmation of any transaction, or as an official 
statement of Trexquant Investment LP. All information is subject to change 
without notice.

[-- Attachment #2 --]
<div dir="ltr"><div><div><p>Hello FreeBSD Community,</p><p>We’ve run into a consistent nfsd lockup after moving from 14.3-RELEASE-p8 to p10. We have verified this across two identical bare-metal systems. Reverting to p8 via bectl immediately restores stability on both systems, so this appears to be a regression introduced in the p9/p10 cycle.</p><p><b>The symptoms:</b> </p><p>Under NFSv3 load, the nfsd service hangs and becomes unresponsive to all clients. Looking at <code>procstat -kk</code>, we see a deadlock pattern where threads are stuck waiting on <code>soiolock</code>:</p><p><code>_sx_xlock_hard -&gt; soiolock -&gt; sosend_generic -&gt; sosend -&gt; svc_vc_reply</code></p><p>Several threads are blocked in <code>_sx_xlock_hard</code> while others sit in <code>sbwait</code>.</p><p><b>The environment:</b> </p><p>The systems are bare-metal with 2 x dual-port Mellanox ConnectX-6 100GbE (mlx5en) cards. We see the issue regardless of MTU (1500 and 9000).</p><p>Offloads:</p><ul><li style="margin-left:15px"><p>TSO: Enabled</p></li><li style="margin-left:15px"><p>LRO: The issue persists regardless of LRO state (tested with LRO disabled and with software-only LRO). Hardware LRO is disabled in all cases.</p></li></ul><p>Relevant tunables:</p><p></p></div><blockquote style="margin:0px 0px 0px 40px;border:none;padding:0px"><div><div><div><div><pre style="text-wrap-mode: wrap;"><code role="text">kern.ipc.soacceptqueue=1000<br></code>kern.ipc.somaxconn=2000<br>kern.ipc.maxsockbuf=67108864<br>net.inet.tcp.sendbuf_max=67108864<br>net.inet.tcp.sendspace=16777216<br>net.inet.tcp.sendbuf_inc=262144<br>net.inet.tcp.recvbuf_max=67108864<br>net.inet.tcp.recvspace=16777216<br>vfs.nfsd.srvmaxio=1048576</pre></div></div></div></div></blockquote><div><p></p><p></p><p></p><p>We have kept the p10 Boot Environments intact and can boot back into them to run any additional debug commands or test patches if someone can help point us in the right direction.</p><p></p><p>Best regards,</p></div><div><div dir="ltr" class="gmail_signature"><div dir="ltr"><div dir="ltr">-Olgun Adak</div></div></div></div></div></div>

<br>
<div><hr></div><font face="Arial, Helvetica, sans-serif"><span style="font-size:13px"><div><font face="Arial, Helvetica, sans-serif"><span style="font-size:13px"><br></span></font></div>This message is intended only for the use of the individual or entity to which it is addressed, and may contain private and confidential information. If you are not the intended recipient of this message you are hereby notified that any review, dissemination, distribution or copying of this message is strictly prohibited. If you have received this e-mail in error, please immediately notify the sender by replying to this e-mail and delete the message and any attachment(s) from your system. This communication is for information purposes only and should not be regarded as an offer to sell or as a solicitation of an offer to buy any financial product, an official confirmation of any transaction, or as an official statement of Trexquant Investment LP. All information is subject to change without notice.</span></font>
home | help

Want to link to this message? Use this
URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?CAFww=iytpc%2BJJuiPJtTrKKkqZaP%2BZ9OzHnTCzLk=MSEkzGwNzA>