From owner-freebsd-stable@freebsd.org Fri Sep 16 18:42:32 2016 Return-Path: Delivered-To: freebsd-stable@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 8C3E2BDD460 for ; Fri, 16 Sep 2016 18:42:32 +0000 (UTC) (envelope-from slw@zxy.spb.ru) Received: from zxy.spb.ru (zxy.spb.ru [195.70.199.98]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 4DA7812C for ; Fri, 16 Sep 2016 18:42:32 +0000 (UTC) (envelope-from slw@zxy.spb.ru) Received: from slw by zxy.spb.ru with local (Exim 4.86 (FreeBSD)) (envelope-from ) id 1bky5l-0000ra-VY; Fri, 16 Sep 2016 21:42:30 +0300 Date: Fri, 16 Sep 2016 21:42:29 +0300 From: Slawa Olhovchenkov To: hiren panchasara Cc: Konstantin Belousov , freebsd-stable@FreeBSD.org Subject: Re: 11.0 stuck on high network load Message-ID: <20160916184229.GF2840@zxy.spb.ru> References: <20160904215739.GC22212@zxy.spb.ru> <20160905014612.GA42393@strugglingcoder.info> <20160914213503.GJ2840@zxy.spb.ru> <20160915085938.GN38409@kib.kiev.ua> <20160915090633.GS2840@zxy.spb.ru> <20160916181839.GC2960@zxy.spb.ru> <20160916183053.GL9397@strugglingcoder.info> MIME-Version: 1.0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline In-Reply-To: <20160916183053.GL9397@strugglingcoder.info> User-Agent: Mutt/1.5.24 (2015-08-30) X-SA-Exim-Connect-IP: X-SA-Exim-Mail-From: slw@zxy.spb.ru X-SA-Exim-Scanned: No (on zxy.spb.ru); SAEximRunCond expanded to false X-BeenThere: freebsd-stable@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Production branch of FreeBSD source code List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 16 Sep 2016 18:42:32 -0000 On Fri, Sep 16, 2016 at 11:30:53AM -0700, hiren panchasara wrote: > On 09/16/16 at 09:18P, Slawa Olhovchenkov wrote: > > On Thu, Sep 15, 2016 at 12:06:33PM +0300, Slawa Olhovchenkov wrote: > > > > > On Thu, Sep 15, 2016 at 11:59:38AM +0300, Konstantin Belousov wrote: > > > > > > > On Thu, Sep 15, 2016 at 12:35:04AM +0300, Slawa Olhovchenkov wrote: > > > > > On Sun, Sep 04, 2016 at 06:46:12PM -0700, hiren panchasara wrote: > > > > > > > > > > > On 09/05/16 at 12:57P, Slawa Olhovchenkov wrote: > > > > > > > I am try using 11.0 on Dual E5-2620 (no X2APIC). > > > > > > > Under high network load and may be addtional conditional system go to > > > > > > > unresponsible state -- no reaction to network and console (USB IPMI > > > > > > > emulation). INVARIANTS give to high overhad. Is this exist some way to > > > > > > > debug this? > > > > > > > > > > > > Can you panic it from console to get to db> to get backtrace and other > > > > > > info when it goes unresponsive? > > > > > > > > > > ipmi console don't respond (chassis power diag don't react) > > > > > login on sol console stuck on *tcp. > > > > > > > > Is 'login' you reference is the ipmi client state, or you mean login(1) > > > > on the wedged host ? > > > > > > on the wedged host > > > > > > > If BMC stops responding simultaneously with the host, I would suspect > > > > the hardware platform issues instead of a software problem. Do you have > > > > dedicated LAN port for BMC ? > > > > > > Yes. > > > But BMC emulate USB keyboard and this is may be lock inside USB > > > system. > > > "ipmi console don't respond" must be read as "ipmi console runnnig and > > > attached but system don't react to keypress on this console". > > > at the sime moment system respon to `enter` on ipmi sol console, but > > > after enter `root` stuck in login in the '*tcp' state (I think this is > > > NIS related). > > > > ~^B don't break to debuger. > > But I can login to sol console. > > You can probably: > debug.kdb.enter: set to enter the debugger > > or force a panic and get vmcore: > debug.kdb.panic: set to panic the kernel > I am still waiting to exit pmcstat. Oh, for NMI need not `ipmitool chassis power diag`! need `ipmitool power diag`! But debugger not entered: ^C^C^C^C^C^C^C^CNMI ... going to debugger NMI ... going to debugger NMI ... going to debugger NMI ... going to debugger NMI ... going to debugger NMI ... going to debugger NMI ... going to debugger NMI ... going to debugger NMI ... going to debugger NMI ... going to debugger NMI ... going to debugger NMI ... going to debugger load: 9.91 cmd: pmcstat 16878 [runnable] 5930.57r 0.00u 0.00s 0% 2940k