From owner-freebsd-questions@FreeBSD.ORG Tue Nov 30 23:17:02 2010 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id AE883106564A for ; Tue, 30 Nov 2010 23:17:02 +0000 (UTC) (envelope-from elon@emmi.physik-pool.tu-berlin.de) Received: from emmi.physik-pool.tu-berlin.de (emmi.physik-pool.tu-berlin.de [130.149.58.146]) by mx1.freebsd.org (Postfix) with ESMTP id 566558FC19 for ; Tue, 30 Nov 2010 23:17:01 +0000 (UTC) Received: from emmi.physik-pool.tu-berlin.de (localhost.physik-pool.tu-berlin.de [127.0.0.1]) by emmi.physik-pool.tu-berlin.de (8.14.4/8.14.4) with ESMTP id oAUNH0Ju056368; Wed, 1 Dec 2010 00:17:00 +0100 (CET) (envelope-from elon@emmi.physik-pool.tu-berlin.de) Received: (from elon@localhost) by emmi.physik-pool.tu-berlin.de (8.14.4/8.14.4/Submit) id oAUNH0m8056367; Wed, 1 Dec 2010 00:17:00 +0100 (CET) (envelope-from elon) Date: Wed, 1 Dec 2010 00:17:00 +0100 From: Leon =?iso-8859-15?Q?Me=DFner?= To: krad Message-ID: <20101130231700.GD98014@emmi.physik-pool.tu-berlin.de> Mail-Followup-To: krad , freebsd-questions@freebsd.org References: <4CF44E2E.4070700@egr.msu.edu> <20101130014807.GC98014@emmi.physik-pool.tu-berlin.de> MIME-Version: 1.0 Content-Type: text/plain; charset=iso-8859-15 Content-Disposition: inline Content-Transfer-Encoding: 8bit In-Reply-To: User-Agent: Mutt/1.5.20 (2009-06-14) Cc: freebsd-questions@freebsd.org Subject: Re: Stale NFS file handles on 8.x amd64 X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 30 Nov 2010 23:17:02 -0000 I set a wrong cc . Please look over to -stable. Sorry for that, Leon On Tue, Nov 30, 2010 at 03:10:18PM +0000, krad wrote: > On 30 November 2010 01:48, Leon Meßner wrote: > > > Hi, > > > > On Mon, Nov 29, 2010 at 08:06:54PM -0500, Adam McDougall wrote: > > > I've been running dovecot 1.1 on FreeBSD 7.x for a while with a bare > > > minimum of NFS problems, but it got worse with 8.x. I have 2-4 servers > > > (usually just 2) accessing mail on a Netapp over NFSv3 via imapd. > > > delivery is via procmail which doesn't touch the dovecot metadata and > > > webmail uses imapd. Client connections to imapd go to random servers > > > and I don't yet have solid means to keep certain users on certain > > > servers. I upgraded some of the servers to 8.x and dovecot 1.2 and ran > > > into Stale NFS file handles causing index/uidlist corruption causing > > > inboxes to appear as empty when they were not. In some situations their > > > corrupt index had to be deleted manually. I first suspected dovecot 1.2 > > > since it was upgraded at the same time but I downgraded to 1.1 and its > > > doing the same thing. I don't really have a wealth of details to go on > > > yet and I usually stay quiet until I do, and half the time it is > > > difficult to reproduce myself so I've had to put it in production to get > > > a feel for progress. This only happens a dozen or so times per weekday > > > but I feel the need to start taking bigger steps. I'll probably do what > > > > Does it depend on the size of the message? > > > > > I can to get IMAP back on a stable base (7.x?) and also try to debug 8.x > > > on the remaining servers. A binary search is within possibility if I > > > can reproduce the symptoms often enough even if I have to put a test > > > server in production for a few hours. > > > > > > Any tips on where we could start looking, or alterations I could try > > > making such as sysctls to return to older behavior? It might be worth > > > > there were some problems on nullfs mounted nfs shares (like in jails) > > and dovecot, as dovecot changed its location for temporary file creation > > to the user home. But IIRC the error message looked more like: > > http://www.mail-archive.com/dovecot@dovecot.org/msg26856.html > > And are fixed in stable. > > > > Just a hint, > > Leon > > _______________________________________________ > > freebsd-questions@freebsd.org mailing list > > http://lists.freebsd.org/mailman/listinfo/freebsd-questions > > To unsubscribe, send any mail to " > > freebsd-questions-unsubscribe@freebsd.org" > > > > > im seeing similar issues on a large mail platform with netapp and dovecot on > freebsd 8.1 as well. The problems existed in 7.x as well though. Basically > the NFS mount just locks up. I've not managed to pin point it yet but one > thing im certain of its a client os issue rather than the filer. This is > because only one node out fo the 16 will lock at any time on that particular > nfs mount. Strangely as well if I remount the dead nfs share on say /mnt on > the affected node, it works fine. I'm convinced its some kind of locking > issue. > > I have dtrace (WITH_CTF=1) in the kernel, so will have a poke around with > that and see if I can see anything interesting. Can anyone recommend > anything here? > _______________________________________________ > freebsd-questions@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-questions > To unsubscribe, send any mail to "freebsd-questions-unsubscribe@freebsd.org"