From owner-freebsd-stable@FreeBSD.ORG Mon Feb 11 00:31:14 2013 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.FreeBSD.org [8.8.178.115]) by hub.freebsd.org (Postfix) with ESMTP id B9685A73; Mon, 11 Feb 2013 00:31:14 +0000 (UTC) (envelope-from rmacklem@uoguelph.ca) Received: from esa-annu.net.uoguelph.ca (esa-annu.mail.uoguelph.ca [131.104.91.36]) by mx1.freebsd.org (Postfix) with ESMTP id 5012EA90; Mon, 11 Feb 2013 00:31:13 +0000 (UTC) X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AqAEAF47GFGDaFvO/2dsb2JhbABDhk66XHOCHwEBAQMBAQEBIAQnIAEKBRYOCgICDRkCKQEJJgYIBwQBCBQEh2sGDK02kV2BI4wTG4MpgRMDiGaLC4IzgR2PNoMkgVE1 X-IronPort-AV: E=Sophos;i="4.84,639,1355115600"; d="scan'208";a="13343758" Received: from erie.cs.uoguelph.ca (HELO zcs3.mail.uoguelph.ca) ([131.104.91.206]) by esa-annu.net.uoguelph.ca with ESMTP; 10 Feb 2013 19:31:06 -0500 Received: from zcs3.mail.uoguelph.ca (localhost.localdomain [127.0.0.1]) by zcs3.mail.uoguelph.ca (Postfix) with ESMTP id 8699CB4171; Sun, 10 Feb 2013 19:31:06 -0500 (EST) Date: Sun, 10 Feb 2013 19:31:06 -0500 (EST) From: Rick Macklem To: Marc Fournier Message-ID: <1946688889.2870936.1360542666536.JavaMail.root@erie.cs.uoguelph.ca> In-Reply-To: <461A7DB1-4693-4AA5-B43C-5F15FE0C4918@hub.org> Subject: Re: 9-STABLE -> NFS -> NetAPP: MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Originating-IP: [172.17.91.202] X-Mailer: Zimbra 6.0.10_GA_2692 (ZimbraWebClient - FF3.0 (Win)/6.0.10_GA_2692) Cc: freebsd-stable@freebsd.org, John Baldwin X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 11 Feb 2013 00:31:14 -0000 Marc Fournier wrote: > Hi John =E2=80=A6 >=20 > Does this help? >=20 > root@io:~ # ps auxl | grep du > root 1054 0.0 0.1 16176 6600 ?? D 3:15AM 0:05.38 du -skx /vm/2799 0 > 81426 0 20 0 newnfs > root 12353 0.0 0.1 16176 5104 ?? D Sat03AM 0:05.41 du -skx /vm/2799 0 > 91597 0 20 0 newnfs > root 64529 0.0 0.1 16176 5164 ?? D Fri03AM 0:05.40 du -skx /vm/2799 0 > 43227 0 20 0 newnfs > root 12855 0.0 0.0 16308 1988 0 S+ 5:26AM 0:00.00 grep du 0 12847 0 20 > 0 piperd It is probably too late, but all the lines (without the | grep du) would be more useful. I also include the "H" flag, so it lists threads as well as processes. The above just says the "du" command is waiting for a vnode lock= . The interesting process/thread is the one that is holding a vnode lock while waiting for something else. Are you still getting the: nfs_getpages: error 13 vm_fault: pager read error, pid 11355 (https) messages logged? With John's recent patch, the error# would no longer be 13 if it was caused by the "intr" flag resulting in a Read RPC terminating with EINTR. If you are still getting the above with "error 13", it suggests that the server is replying EACCES for the Read RPC. I suggested before that you check to make sure that the executable had read access for everyone one the file server. Since I didn't hear back, I'll assume this is the case. rick ps: If it is still up and hasn't been rebooted, you could: sysctl debug.kdb.break_to_debugger=3D1 - then type at the console and do the following from the debugger http://www.freebsd.org/doc/en_US.ISO8859-1/books/developers-handbook/ke= rneldebug-deadlocks.html How well this work depends on what options your kernel was built with. > root@io:~ # grep vm /etc/fstab > 192.168.1.254:/vol/basic /vm nfs rw,nolockd,intr 0 0 >=20 > Haven't rebooted yet =E2=80=A6 if there is anything I can do / try before= =E2=80=A6 ? >=20 > The kernel is from Jan 21st =E2=80=A6 >=20 >=20 > On 2013-01-19, at 4:57 AM, John Baldwin wrote: >=20 > > On Tuesday, December 18, 2012 11:58:36 PM Hub- Marketing wrote: > >> I'm running a few servers sitting on top of a NetAPP file server =E2= =80=A6 > >> everything runs great, but periodically I'm getting: > >> > >> nfs_getpages: error 13 > >> vm_fault: pager read error, pid 11355 (https) > > > > Are you using interruptible mounts ("intr" mount option)? > > > > Also, can you get ps output that includes the 'l' flag to show what > > the processes are stuck on? > > > > -- > > John Baldwin >=20 > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to > "freebsd-stable-unsubscribe@freebsd.org"