From owner-freebsd-stable@FreeBSD.ORG Wed Feb 13 23:54:30 2013 Return-Path: Delivered-To: freebsd-stable@freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by hub.freebsd.org (Postfix) with ESMTP id 7F2A2610; Wed, 13 Feb 2013 23:54:30 +0000 (UTC) (envelope-from rmacklem@uoguelph.ca) Received: from esa-annu.net.uoguelph.ca (esa-annu.mail.uoguelph.ca [131.104.91.36]) by mx1.freebsd.org (Postfix) with ESMTP id 159108A5; Wed, 13 Feb 2013 23:54:29 +0000 (UTC) X-IronPort-Anti-Spam-Filtered: true X-IronPort-Anti-Spam-Result: AqAEAAcnHFGDaFvO/2dsb2JhbABFhk+6NnOCHwEBAQMBAQEBIAQnIAsFFhgCAg0ZAikBCSYGCAcEAQgUBIdrBgytOJIygSOMJw2DGoETA4hmiwuCM4EdjzaDJE9+Bxce X-IronPort-AV: E=Sophos;i="4.84,660,1355115600"; d="scan'208";a="13936263" Received: from erie.cs.uoguelph.ca (HELO zcs3.mail.uoguelph.ca) ([131.104.91.206]) by esa-annu.net.uoguelph.ca with ESMTP; 13 Feb 2013 18:54:28 -0500 Received: from zcs3.mail.uoguelph.ca (localhost.localdomain [127.0.0.1]) by zcs3.mail.uoguelph.ca (Postfix) with ESMTP id 1FE67B4048; Wed, 13 Feb 2013 18:54:28 -0500 (EST) Date: Wed, 13 Feb 2013 18:54:28 -0500 (EST) From: Rick Macklem To: "Marc G. Fournier" Message-ID: <426187631.3000937.1360799668107.JavaMail.root@erie.cs.uoguelph.ca> In-Reply-To: Subject: Re: 9-STABLE -> NFS -> NetAPP: MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable X-Originating-IP: [172.17.91.202] X-Mailer: Zimbra 6.0.10_GA_2692 (ZimbraWebClient - FF3.0 (Win)/6.0.10_GA_2692) Cc: Konstantin Belousov , freebsd-stable@freebsd.org, Kostik Belousov , John Baldwin X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.14 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 13 Feb 2013 23:54:30 -0000 Marc Fournier wrote: > On 2013-02-13, at 14:50 , Rick Macklem wrote: >=20 > > He does get the odd error reported by nfs_getpages() and I don't > > think we've isolated why yet. The error is 13 (EACCES), but jhb@ > > thought it might be because of the bug he fixed where the krpc > > reported EACCES for the EINTR case. I don't think we've heard > > back from Marc w.r.t. whether he has gotten any more of these > > erros logged since applying jhb@'s patch and whether or not > > the errno has changed to EINTR? >=20 > As mentioned previously, it doesn't happen all that often =E2=80=A6 this > latest one was after 21 days of uptime (or so) =E2=80=A6 I just upgraded = the > kernel on that machine to take into consideration changes to hfs > *since* the last upgrade, so it might be another 20-30 days before it > happens again *if* that last patch didn't' fix it =E2=80=A6 >=20 > I have several servers that do have fully operational remote consoles > though =E2=80=A6 to save time if/when it happens next, what do I all need= to > run? >=20 > ps auxlH > procstat -kk (for which process? =E2=80=A6 all part of that "group"= , or > just one of the apparently hung processes?) The pid that is in "T" state for the "ps auxlH". > sysctl debug.kdb.break_to_debugger=3D1 (shell) > (from console) >=20 Then the commands described in: http://www.freebsd.org/doc/en_US.ISO8859-1/book/developers-handbook/kerneld= ebug-deadlocks.html "show alllocks" and "show lockedvnods" may be the most useful, I think you can also "show sleepchain " "show lockchain " using the that is in "T" state. If you haven't built your kernel with "options WITNESS", this won't work we= ll. > now, is there a way of forcing it to do a dump core so that I can run > the various commands from a shell *after* its rebooted? No idea. Someone familiar with what you can do to core dump and how to get your system to make will have to answer this. > Not > particularly easy to redirect console output to a file (or is it?), so > anything that scrolls off the screen is pretty much lost =E2=80=A6 I'm us= ing a > DRAC card in most cases, no serial consoles or anything like that that > I can run within a script session =E2=80=A6 a 'ps' listing is >500 lines = long, > just to give an idea ... >=20 >=20 > _______________________________________________ > freebsd-stable@freebsd.org mailing list > http://lists.freebsd.org/mailman/listinfo/freebsd-stable > To unsubscribe, send any mail to > "freebsd-stable-unsubscribe@freebsd.org"