From owner-freebsd-jail@freebsd.org Mon May 16 14:58:21 2016 Return-Path: Delivered-To: freebsd-jail@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 46C7AB3D2D1 for ; Mon, 16 May 2016 14:58:21 +0000 (UTC) (envelope-from allanjude@freebsd.org) Received: from mx1.scaleengine.net (mx1.scaleengine.net [209.51.186.6]) by mx1.freebsd.org (Postfix) with ESMTP id 2D1BC1B66 for ; Mon, 16 May 2016 14:58:20 +0000 (UTC) (envelope-from allanjude@freebsd.org) Received: from [10.1.1.2] (unknown [10.1.1.2]) (Authenticated sender: allanjude.freebsd@scaleengine.com) by mx1.scaleengine.net (Postfix) with ESMTPSA id 67E8DD759 for ; Mon, 16 May 2016 14:58:19 +0000 (UTC) Subject: Re: Unresponsive jails issues To: freebsd-jail@freebsd.org References: <6beab349-73bb-7159-cd81-443e115b687a@gjunka.com> From: Allan Jude Message-ID: <5739E00B.7090100@freebsd.org> Date: Mon, 16 May 2016 10:58:19 -0400 User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:38.0) Gecko/20100101 Thunderbird/38.7.2 MIME-Version: 1.0 In-Reply-To: <6beab349-73bb-7159-cd81-443e115b687a@gjunka.com> Content-Type: text/plain; charset=windows-1252 Content-Transfer-Encoding: 7bit X-BeenThere: freebsd-jail@freebsd.org X-Mailman-Version: 2.1.22 Precedence: list List-Id: "Discussion about FreeBSD jail\(8\)" List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 16 May 2016 14:58:21 -0000 On 2016-05-16 08:55, Grzegorz Junka wrote: > I have a server running 13 jails for various system services. Recently I > added two jails to run simple go applications for testing. They open a > network socket and nginx, which is in another jail, and which round > robin balances requests to them. I mention that because it may be > related, however not necessarily because it was happening earlier. > > The problem is that every 2-3 days jails in my servers stop responding. > "jexec jailname tcsh" hangs forever, "service jail stop jailname" hangs > forever as well. "top" doesn't show anything suspicious. I can login > through SSH to the main server fine. I don't login to jails through SSH > so I can't check but it seems that when that happens they stop > responding because the services that are running in them stop too (e.g. > web server, imap, ...). I tried to "kill -9" the "jexec" process that > hangs but that doesn't work. > > My first question is what evidence should I gather when that happens so > that I can investigate the issue later on after the server is restarted? > > And the second question, any idea why that might be happening in the > first place? > > I am running FreeBSD 10.3 AMD64 updated from 10.2 a couple of weeks ago. > > Grzegorz > > _______________________________________________ > freebsd-jail@freebsd.org mailing list > https://lists.freebsd.org/mailman/listinfo/freebsd-jail > To unsubscribe, send any mail to "freebsd-jail-unsubscribe@freebsd.org" When you issue the jexec and it hangs, try pressing 'control+t' to see what the waitchan is. Along with what Bjoern said, using procstat -k to examine other processes etc. -- Allan Jude