Skip site navigation (1)Skip section navigation (2)
Date:      Tue, 02 Mar 2010 12:25:17 +0200
From:      Daniel Braniss <danny@cs.huji.ac.il>
To:        Rick Macklem <rmacklem@uoguelph.ca>
Cc:        stable@freebsd.org, freebsd-fs@freebsd.org, Willem Jan Withagen <wjw@digiware.nl>, =?utf-8?B?RWlyaWsgw5h2ZXJieQ==?= <ltning@anduin.net>, rwatson@freebsd.org, Jeremy Chadwick <freebsd@jdc.parodius.com>
Subject:   Re: mbuf leakage with nfs/zfs? 
Message-ID:  <E1NmPHy-0009jy-Dj@kabab.cs.huji.ac.il>
In-Reply-To: <Pine.GSO.4.63.1003011703100.26054@muncher.cs.uoguelph.ca> 
References:  <20100226174021.8feadad9.gerrit@pmp.uni-hannover.de>  <E1Nl6VA-000557-D9@kabab.cs.huji.ac.il> <20100226224320.8c4259bf.gerrit@pmp.uni-hannover.de> <4B884757.9040001@digiware.nl> <20100227080220.ac6a2e4d.gerrit@pmp.uni-hannover.de> <4B892918.4080701@digiware.nl> <20100227202105.f31cbef7.gerrit@pmp.uni-hannover.de> <20100227193819.GA60576@icarus.home.lan> <BD8AC9F6-DF96-41F9-8E92-48A4E5606DC7@anduin.net> <4B89943C.70704@digiware.nl> <20100227220310.GA65110@icarus.home.lan> <Pine.GSO.4.63.1003011703100.26054@muncher.cs.uoguelph.ca>

next in thread | previous in thread | raw e-mail | index | archive | help
> 
> 
> On Sat, 27 Feb 2010, Jeremy Chadwick wrote:
> 
> >> I concur.
> >> Everything in my network is now on TCP, and there is no mbuf leakage.
> >> I just don't get over the 5500 mark, no matter what I throw at it.
> >>
> >> I do feel that TCP is not as well performing on a local net with Linux,
> >> hence the choice for UDP. But TCP is workable as next best.
> >
> > NFS; Rick Macklem would be a better choice, but as reported, he's MIA.
> >
> 
> Not exactly MIA, but only able to read email from time to time at this
> point. I don't know when I'll be able to do more than that.
> 
> So, it does sound like it is UDP specific. Robert mentioned one scenario,
> which was an infrequently executed code path that is being tickled and it
> has a missing m_freem().
> 
> One thing someone could try is switching to the experimental nfs server
> ("-e" on both mountd and nfsd) and see if the leak goes away. If it does
> go away, it is almost certainly the above in the regular nfs server code.
> 
runing with the experimental nfs server all is ok!
(at least I can't see any mbuf leakage :-)

so now that we can  assume that the problem is in NFS/UDP writes via
classic nfsserver, where to look?

> If it doesn't go away, the problem is more likely in the krpc or the
> generic udp code. (When I looked at svc_dg.c, I could only spot one
> possible leak and you've already determined that patch doesn't help.
> The other big difference when using udp on the FreeBSD8 krpc is the
> reply cache code. I seem to recall it's an lru cache with a fixed upper
> bound, but it might be broken and leaking.
> 
> If you change the server to set sp_rcache = NULL in the initialization
> function in sys/nfsserver/nfs_srvkrpc.c, I think that disables the replay
> cache. You wouldn't want to run this way in production, but it would 
> determine if the leak is in it.
> 
> Change the 3 lines in nfsrv_init() to:
> nfsrv_pool->sp_rcache = NULL;
> nfsrv_pool->sp_assign = NULL;
> nfsrv_pool->sp_done = NULL;
> 
> and I think the krpc replay cache will be disabled.
> 
> Good luck with it and please report back if you get to try the above.
> 
> I'll get back to committing etc one of these days, rick

just keep sending insights/pointers and enjoy life

	danny





Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?E1NmPHy-0009jy-Dj>