Date: Thu, 3 Jul 2014 21:26:34 -0700 From: Marc Fournier <scrappy@hub.org> To: freebsd-stable@freebsd.org Subject: Re: FreeBSD 10.x + LiquidSoap + NFS == Server Hang Message-ID: <A82A30E3-381C-4D88-B803-F1F777562D1B@hub.org> In-Reply-To: <D12AB91B-78FC-47A7-84C5-8F69DCD5A5CD@hub.org>
index | next in thread | previous in thread | raw e-mail
Oh, on the remote console, last two lines I see are: == nfs_getpages: error 4 vm_fault: pager read error, pid 2957 (liquid soap) == if that helps any ... On Jul 3, 2014, at 9:23 PM, Marc Fournier <scrappy@hub.org> wrote: > > Hi all … > > I have a jail running on FreeBSD 10-STABLE (svn update as of July 2nd @ ~05:30 UTC: > > == > Working Copy Root Path: /usr/src > URL: https://svn0.us-east.freebsd.org/base/stable/10 > Relative URL: ^/stable/10 > Repository Root: https://svn0.us-east.freebsd.org/base > Repository UUID: ccf9f872-aa2e-dd11-9fc8-001c23d0bc1f > Revision: 268135 > Node Kind: directory > Schedule: normal > Last Changed Author: pfg > Last Changed Rev: 268132 > Last Changed Date: 2014-07-02 01:28:38 +0000 (Wed, 02 Jul 2014) > == > > Currently it has 3 jail’d environments running off it, with the files for them NFS mounted from a NetApp filer … and right now, the NFS mount that these jails are running from is “locked” … a ‘df’ hangs … trying to do a ‘jexec # /bin/tcsh’ into one of the jail’s hangs … etc. > > The same NFS file system is mounted and running on a half dozen other servers, and they are all operating just fine, so the NetApp is operating properly. > > If I move the jail with liquidsoap running around to a different server, the hang will follow to the new server, and the old server will once more become rock solid … > > I’m not 100% certain it is liquidsoap, but the hang appears to always coincide with reloading a new playlist … and although it happens frequently (more with recent upgrades), it doesn’t happen *every* night … > > This is on a remote server … so doing things at the console isn’t possible, and although I’ve got a remote console on this, I’ve never figured out how to break to the debugger through it, although I’m going to work on it to see if I can’t get it to work … > > Baring breaking to the debugger (is there a way, from the command line, to force it to break to the debugger?), is there anything else I can use to provide some sort of useful information? > > ps aux for the proces shows: > > # ps aux | grep liq > 1002 2957 0.0 0.7 226888 112792 - TLJ 4:45AM 370:27.23 /usr/local/bin/liquidsoap -q -d /usr/local/etc/liquidsoap/liquidsoap.liq > > and: > > # ps auxxwl | grep 2957 > 1002 2957 0.0 0.7 226888 112792 - TLJ 4:45AM 370:27.23 /usr/local/bin/l 1002 1 0 20 0 - > 1002 96280 0.0 0.0 12316 0 - IWJ - 0:00.00 pwait 2957 1002 96274 0 52 0 kqread > root 96508 0.0 0.0 18788 1828 4 S+ 4:19AM 0:00.00 grep 2957 0 96505 0 20 0 piperd > > Other commands I can / should run next time it happens … ? Which won’t take long ... > > Thanks … > >help
Want to link to this message? Use this
URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?A82A30E3-381C-4D88-B803-F1F777562D1B>
