Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 19 Jul 2016 17:40:32 +0200
From:      Hans Petter Selasky <hps@selasky.org>
To:        Justin Clift <justin@postgresql.org>, freebsd-infiniband@freebsd.org
Subject:   Re: Weird NFS client lock up with Mellanox cards :/
Message-ID:  <6a5b530e-521c-47f7-5012-7512b2fa050c@selasky.org>
In-Reply-To: <288AE8D3-9F16-453D-BD73-00672C4E2D94@postgresql.org>
References:  <288AE8D3-9F16-453D-BD73-00672C4E2D94@postgresql.org>

next in thread | previous in thread | raw e-mail | index | archive | help
On 07/19/16 15:28, Justin Clift wrote:
> Hi all,
>
> Brian Krusic (CC'd), has been kind enough to put time into some
> performance testing of Mellanox ConnectX-3 Pro cards in 40GbE mode,
> with FreeNAS 9.10-STABLE.  (That uses FreeBSD 10-STABLE as it's
> base OS)
>
> Weirdly, his NFS clients are locking up when using Mellanox cards,
> but not with SolarFlare ones.
>
>   https://bugs.freenas.org/issues/7659#note-40
>
> Comparing the OFED code in FreeNAS 9.10-STABLE to FreeNSD 10-STABLE,
> there's one patch difference.  It's a recent one from 3 days ago:
>
>   MFC r301877
>   Add a missing error check for a malloc() call in idr_get().
>   https://github.com/freebsd/freebsd/commit/03f3328da077d2def40be7dea8d13c74c2ccd447
>
> Does anyone know if this missing patch could result in a slow down
> of NFS clients (but not Samba/SMB)?  Maybe memory leak style, leading
> to a lack of resources or something?
>
> Hoping it's really this simple.  But if not... does anyone have
> suggestions on what to try for figuring this out?
>
> Regards and best wishes,
>

Hi,

Might be some timing issue not related to the Mellanox cards. What link 
speeds is being used?

What happens when the lockup happens?

Is RSS being used?

--HPS




Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?6a5b530e-521c-47f7-5012-7512b2fa050c>