From owner-freebsd-questions@FreeBSD.ORG Tue Nov 30 15:10:20 2010 Return-Path: Delivered-To: freebsd-questions@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:4f8:fff6::34]) by hub.freebsd.org (Postfix) with ESMTP id 0BE5B1065672 for ; Tue, 30 Nov 2010 15:10:20 +0000 (UTC) (envelope-from kraduk@gmail.com) Received: from mail-ey0-f182.google.com (mail-ey0-f182.google.com [209.85.215.182]) by mx1.freebsd.org (Postfix) with ESMTP id 82BD18FC18 for ; Tue, 30 Nov 2010 15:10:19 +0000 (UTC) Received: by eyb7 with SMTP id 7so2858588eyb.13 for ; Tue, 30 Nov 2010 07:10:18 -0800 (PST) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:received:in-reply-to :references:date:message-id:subject:from:to:content-type; bh=NbjZ4OHDtJG/SX4XtO5EjfTolK49qbPFgupSD72tesY=; b=vTsd63Jb+41Z+YRzpHeyjvyTc8aBFBfiZmlwbtnuz5vUE5/81Hn5QhkQu0U5FaFQPF mNgtjGsYQQegwN5j/NdzZYUE5W0R99z4u5x5UV4vuJCuRVYi0OgPtbSI4k7pcQqfcCRN zv30y9hB93DRqA4TV4KMekmsO8ogQK7Fc6VOM= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; b=sYMts2UGbPFEXyWA1GqeQD/ln2QsBHjR3Z0Bjo+iaR6UujbzgdkrwHdJt3f/rkTa2I ujItpdzB73fnqBKKXsovm6HBRLKopAl7YlhfwEhoZMMHEyIxEFUa/FmAFPXIB6LHV8pz zYRYsDEa5HylmdpqwszNXj/x6nbazupedbCEY= MIME-Version: 1.0 Received: by 10.216.71.66 with SMTP id q44mr6625755wed.44.1291129818318; Tue, 30 Nov 2010 07:10:18 -0800 (PST) Received: by 10.216.37.132 with HTTP; Tue, 30 Nov 2010 07:10:18 -0800 (PST) In-Reply-To: <20101130014807.GC98014@emmi.physik-pool.tu-berlin.de> References: <4CF44E2E.4070700@egr.msu.edu> <20101130014807.GC98014@emmi.physik-pool.tu-berlin.de> Date: Tue, 30 Nov 2010 15:10:18 +0000 Message-ID: From: krad To: Adam McDougall , freebsd-questions@freebsd.org Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: quoted-printable X-Content-Filtered-By: Mailman/MimeDel 2.1.5 Cc: Subject: Re: Stale NFS file handles on 8.x amd64 X-BeenThere: freebsd-questions@freebsd.org X-Mailman-Version: 2.1.5 Precedence: list List-Id: User questions List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Tue, 30 Nov 2010 15:10:20 -0000 On 30 November 2010 01:48, Leon Me=DFner wro= te: > Hi, > > On Mon, Nov 29, 2010 at 08:06:54PM -0500, Adam McDougall wrote: > > I've been running dovecot 1.1 on FreeBSD 7.x for a while with a bare > > minimum of NFS problems, but it got worse with 8.x. I have 2-4 servers > > (usually just 2) accessing mail on a Netapp over NFSv3 via imapd. > > delivery is via procmail which doesn't touch the dovecot metadata and > > webmail uses imapd. Client connections to imapd go to random servers > > and I don't yet have solid means to keep certain users on certain > > servers. I upgraded some of the servers to 8.x and dovecot 1.2 and ran > > into Stale NFS file handles causing index/uidlist corruption causing > > inboxes to appear as empty when they were not. In some situations thei= r > > corrupt index had to be deleted manually. I first suspected dovecot 1.= 2 > > since it was upgraded at the same time but I downgraded to 1.1 and its > > doing the same thing. I don't really have a wealth of details to go on > > yet and I usually stay quiet until I do, and half the time it is > > difficult to reproduce myself so I've had to put it in production to ge= t > > a feel for progress. This only happens a dozen or so times per weekday > > but I feel the need to start taking bigger steps. I'll probably do wha= t > > Does it depend on the size of the message? > > > I can to get IMAP back on a stable base (7.x?) and also try to debug 8.= x > > on the remaining servers. A binary search is within possibility if I > > can reproduce the symptoms often enough even if I have to put a test > > server in production for a few hours. > > > > Any tips on where we could start looking, or alterations I could try > > making such as sysctls to return to older behavior? It might be worth > > there were some problems on nullfs mounted nfs shares (like in jails) > and dovecot, as dovecot changed its location for temporary file creation > to the user home. But IIRC the error message looked more like: > http://www.mail-archive.com/dovecot@dovecot.org/msg26856.html > And are fixed in stable. > > Just a hint, > Leon > _______________________________________________ > freebsd-questions@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-questions > To unsubscribe, send any mail to " > freebsd-questions-unsubscribe@freebsd.org" > im seeing similar issues on a large mail platform with netapp and dovecot o= n freebsd 8.1 as well. The problems existed in 7.x as well though. Basically the NFS mount just locks up. I've not managed to pin point it yet but one thing im certain of its a client os issue rather than the filer. This is because only one node out fo the 16 will lock at any time on that particula= r nfs mount. Strangely as well if I remount the dead nfs share on say /mnt on the affected node, it works fine. I'm convinced its some kind of locking issue. I have dtrace (WITH_CTF=3D1) in the kernel, so will have a poke around with that and see if I can see anything interesting. Can anyone recommend anything here?