Skip site navigation (1)Skip section navigation (2)
Date:      Mon, 22 Apr 2013 11:56:32 +0200
From:      Baptiste Daroussin <bapt@FreeBSD.org>
To:        "Eggert, Lars" <lars@netapp.com>
Cc:        Rick Macklem <rmacklem@uoguelph.ca>, current <current@freebsd.org>
Subject:   Re: newnfs pkgng database corruption?
Message-ID:  <20130422095632.GG38955@ithaqua.etoilebsd.net>
In-Reply-To: <20130412131037.GI95891@ithaqua.etoilebsd.net>
References:  <746529932.761884.1365721847968.JavaMail.root@erie.cs.uoguelph.ca> <8A9F25C5-C5E6-4373-85F9-C47642CF1BAE@netapp.com> <20130412131037.GI95891@ithaqua.etoilebsd.net>

next in thread | previous in thread | raw e-mail | index | archive | help

--BzCohdixPhurzSK4
Content-Type: text/plain; charset=us-ascii
Content-Disposition: inline
Content-Transfer-Encoding: quoted-printable

On Fri, Apr 12, 2013 at 03:10:37PM +0200, Baptiste Daroussin wrote:
> On Fri, Apr 12, 2013 at 12:56:10PM +0000, Eggert, Lars wrote:
> > Hi,
> >=20
> > On Apr 12, 2013, at 1:10, Rick Macklem <rmacklem@uoguelph.ca> wrote:
> > > Well, I have no idea why an NFS server would reply errno 70 if the fi=
le
> > > still exists, unless the client has somehow sent a bogus file handle
> > > to the server. (I am not aware of any client bug that might do that. I
> > > am almost suspicious that there might be a memory problem or something
> > > that corrupts bits in the network layer. Do you have TSO enabled for =
your
> > > network interface by any chance? If so, I'd try disabling that on the
> > > network interface. Same goes for checksum offload.)
> > >=20
> > > rick
> > > ps: If you can capture packets between the client and server at the
> > >    time this error occurs, looking at them in wireshark might be
> > >    useful?
> >=20
> > I will try all of those things.
> >=20
> > But first, a question that someone who understands pkgng will be able t=
o answerr: Is this "fake-pkg" process even running on the NFS mount? The WR=
KDIR is /tmp, which is an mfs mount.
>=20
> fake-pkg is run in WRKDIR, but it calls pkgng which will open
> /var/db/pkg/local.sqlite aka nfs mount.
>=20
> The Error 70 is EX_SOFTWARE returned by pkgng.
>=20
> Can you try the following patch:
> http://people.freebsd.org/~bapt/patch-libpkg__pkgdb.c
>=20
> Just add that file to /usr/ports/ports-mgmt/pkg/files/
>=20
> If that works for you, that means the posix advisory locks is somehow fai=
ling on
> nfsv4 files.
>=20
> Given it is already known to be failing on nfsv3 (because people often
> misconfigure it) I'll probablmy make unix-dotfile the default locking sys=
tem
> when local.sqlite is stored on network filesystem.
>=20
> regards,
> Bapt

As anyone been able to test this patch?

regards,
Bapt

--BzCohdixPhurzSK4
Content-Type: application/pgp-signature

-----BEGIN PGP SIGNATURE-----
Version: GnuPG v2.0.19 (FreeBSD)

iEYEARECAAYFAlF1CVAACgkQ8kTtMUmk6EygBACfelcjZhPWPvoASmTwb/jLpCYJ
Y4oAnR2nMGsg0kfRIcDPkMMHF0nugxce
=BUgb
-----END PGP SIGNATURE-----

--BzCohdixPhurzSK4--



Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20130422095632.GG38955>