Date: Mon, 22 Apr 2013 11:56:32 +0200 From: Baptiste Daroussin <bapt@FreeBSD.org> To: "Eggert, Lars" <lars@netapp.com> Cc: Rick Macklem <rmacklem@uoguelph.ca>, current <current@freebsd.org> Subject: Re: newnfs pkgng database corruption? Message-ID: <20130422095632.GG38955@ithaqua.etoilebsd.net> In-Reply-To: <20130412131037.GI95891@ithaqua.etoilebsd.net> References: <746529932.761884.1365721847968.JavaMail.root@erie.cs.uoguelph.ca> <8A9F25C5-C5E6-4373-85F9-C47642CF1BAE@netapp.com> <20130412131037.GI95891@ithaqua.etoilebsd.net>
next in thread | previous in thread | raw e-mail | index | archive | help
--BzCohdixPhurzSK4 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On Fri, Apr 12, 2013 at 03:10:37PM +0200, Baptiste Daroussin wrote: > On Fri, Apr 12, 2013 at 12:56:10PM +0000, Eggert, Lars wrote: > > Hi, > >=20 > > On Apr 12, 2013, at 1:10, Rick Macklem <rmacklem@uoguelph.ca> wrote: > > > Well, I have no idea why an NFS server would reply errno 70 if the fi= le > > > still exists, unless the client has somehow sent a bogus file handle > > > to the server. (I am not aware of any client bug that might do that. I > > > am almost suspicious that there might be a memory problem or something > > > that corrupts bits in the network layer. Do you have TSO enabled for = your > > > network interface by any chance? If so, I'd try disabling that on the > > > network interface. Same goes for checksum offload.) > > >=20 > > > rick > > > ps: If you can capture packets between the client and server at the > > > time this error occurs, looking at them in wireshark might be > > > useful? > >=20 > > I will try all of those things. > >=20 > > But first, a question that someone who understands pkgng will be able t= o answerr: Is this "fake-pkg" process even running on the NFS mount? The WR= KDIR is /tmp, which is an mfs mount. >=20 > fake-pkg is run in WRKDIR, but it calls pkgng which will open > /var/db/pkg/local.sqlite aka nfs mount. >=20 > The Error 70 is EX_SOFTWARE returned by pkgng. >=20 > Can you try the following patch: > http://people.freebsd.org/~bapt/patch-libpkg__pkgdb.c >=20 > Just add that file to /usr/ports/ports-mgmt/pkg/files/ >=20 > If that works for you, that means the posix advisory locks is somehow fai= ling on > nfsv4 files. >=20 > Given it is already known to be failing on nfsv3 (because people often > misconfigure it) I'll probablmy make unix-dotfile the default locking sys= tem > when local.sqlite is stored on network filesystem. >=20 > regards, > Bapt As anyone been able to test this patch? regards, Bapt --BzCohdixPhurzSK4 Content-Type: application/pgp-signature -----BEGIN PGP SIGNATURE----- Version: GnuPG v2.0.19 (FreeBSD) iEYEARECAAYFAlF1CVAACgkQ8kTtMUmk6EygBACfelcjZhPWPvoASmTwb/jLpCYJ Y4oAnR2nMGsg0kfRIcDPkMMHF0nugxce =BUgb -----END PGP SIGNATURE----- --BzCohdixPhurzSK4--
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?20130422095632.GG38955>