Date: Thu, 3 Jul 2014 21:26:34 -0700 From: Marc Fournier <scrappy@hub.org> To: freebsd-stable@freebsd.org Subject: Re: FreeBSD 10.x + LiquidSoap + NFS == Server Hang Message-ID: <A82A30E3-381C-4D88-B803-F1F777562D1B@hub.org> In-Reply-To: <D12AB91B-78FC-47A7-84C5-8F69DCD5A5CD@hub.org> References: <D12AB91B-78FC-47A7-84C5-8F69DCD5A5CD@hub.org>
next in thread | previous in thread | raw e-mail | index | archive | help
Oh, on the remote console, last two lines I see are: =3D=3D nfs_getpages: error 4 vm_fault: pager read error, pid 2957 (liquid soap) =3D=3D if that helps any ...=20 On Jul 3, 2014, at 9:23 PM, Marc Fournier <scrappy@hub.org> wrote: >=20 > Hi all =85 >=20 > I have a jail running on FreeBSD 10-STABLE (svn update as of = July 2nd @ ~05:30 UTC: >=20 > =3D=3D > Working Copy Root Path: /usr/src > URL: https://svn0.us-east.freebsd.org/base/stable/10 > Relative URL: ^/stable/10 > Repository Root: https://svn0.us-east.freebsd.org/base > Repository UUID: ccf9f872-aa2e-dd11-9fc8-001c23d0bc1f > Revision: 268135 > Node Kind: directory > Schedule: normal > Last Changed Author: pfg > Last Changed Rev: 268132 > Last Changed Date: 2014-07-02 01:28:38 +0000 (Wed, 02 Jul 2014) > =3D=3D >=20 > Currently it has 3 jail=92d environments running off it, with = the files for them NFS mounted from a NetApp filer =85 and right now, = the NFS mount that these jails are running from is =93locked=94 =85 a = =91df=92 hangs =85 trying to do a =91jexec # /bin/tcsh=92 into one of = the jail=92s hangs =85 etc. >=20 > The same NFS file system is mounted and running on a half dozen = other servers, and they are all operating just fine, so the NetApp is = operating properly. >=20 > If I move the jail with liquidsoap running around to a different = server, the hang will follow to the new server, and the old server will = once more become rock solid =85=20 >=20 > I=92m not 100% certain it is liquidsoap, but the hang appears to = always coincide with reloading a new playlist =85 and although it = happens frequently (more with recent upgrades), it doesn=92t happen = *every* night =85 >=20 > This is on a remote server =85 so doing things at the console = isn=92t possible, and although I=92ve got a remote console on this, I=92ve= never figured out how to break to the debugger through it, although I=92m= going to work on it to see if I can=92t get it to work =85 >=20 > Baring breaking to the debugger (is there a way, from the = command line, to force it to break to the debugger?), is there anything = else I can use to provide some sort of useful information? >=20 > ps aux for the proces shows: >=20 > # ps aux | grep liq > 1002 2957 0.0 0.7 226888 112792 - TLJ 4:45AM 370:27.23 = /usr/local/bin/liquidsoap -q -d /usr/local/etc/liquidsoap/liquidsoap.liq >=20 > and: >=20 > # ps auxxwl | grep 2957 > 1002 2957 0.0 0.7 226888 112792 - TLJ 4:45AM 370:27.23 = /usr/local/bin/l 1002 1 0 20 0 - > 1002 96280 0.0 0.0 12316 0 - IWJ - 0:00.00 = pwait 2957 1002 96274 0 52 0 kqread > root 96508 0.0 0.0 18788 1828 4 S+ 4:19AM 0:00.00 = grep 2957 0 96505 0 20 0 piperd >=20 > Other commands I can / should run next time it happens =85 ? = Which won=92t take long ... >=20 > Thanks =85 >=20 >=20
Want to link to this message? Use this URL: <https://mail-archive.FreeBSD.org/cgi/mid.cgi?A82A30E3-381C-4D88-B803-F1F777562D1B>