Date: Tue, 19 Jul 2016 17:40:32 +0200 From: Hans Petter Selasky <hps@selasky.org> To: Justin Clift <justin@postgresql.org>, freebsd-infiniband@freebsd.org Subject: Re: Weird NFS client lock up with Mellanox cards :/ Message-ID: <6a5b530e-521c-47f7-5012-7512b2fa050c@selasky.org> In-Reply-To: <288AE8D3-9F16-453D-BD73-00672C4E2D94@postgresql.org> References: <288AE8D3-9F16-453D-BD73-00672C4E2D94@postgresql.org>
next in thread | previous in thread | raw e-mail | index | archive | help
On 07/19/16 15:28, Justin Clift wrote: > Hi all, > > Brian Krusic (CC'd), has been kind enough to put time into some > performance testing of Mellanox ConnectX-3 Pro cards in 40GbE mode, > with FreeNAS 9.10-STABLE. (That uses FreeBSD 10-STABLE as it's > base OS) > > Weirdly, his NFS clients are locking up when using Mellanox cards, > but not with SolarFlare ones. > > https://bugs.freenas.org/issues/7659#note-40 > > Comparing the OFED code in FreeNAS 9.10-STABLE to FreeNSD 10-STABLE, > there's one patch difference. It's a recent one from 3 days ago: > > MFC r301877 > Add a missing error check for a malloc() call in idr_get(). > https://github.com/freebsd/freebsd/commit/03f3328da077d2def40be7dea8d13c74c2ccd447 > > Does anyone know if this missing patch could result in a slow down > of NFS clients (but not Samba/SMB)? Maybe memory leak style, leading > to a lack of resources or something? > > Hoping it's really this simple. But if not... does anyone have > suggestions on what to try for figuring this out? > > Regards and best wishes, > Hi, Might be some timing issue not related to the Mellanox cards. What link speeds is being used? What happens when the lockup happens? Is RSS being used? --HPS
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?6a5b530e-521c-47f7-5012-7512b2fa050c>