From owner-freebsd-net@freebsd.org Fri Aug 4 17:05:00 2017 Return-Path: Delivered-To: freebsd-net@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 229E3DBEC98 for ; Fri, 4 Aug 2017 17:05:00 +0000 (UTC) (envelope-from hps@selasky.org) Received: from mail.turbocat.net (turbocat.net [88.99.82.50]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id DFFE66FE30 for ; Fri, 4 Aug 2017 17:04:59 +0000 (UTC) (envelope-from hps@selasky.org) Received: from hps2016.home.selasky.org (unknown [62.141.129.119]) (using TLSv1.2 with cipher ECDHE-RSA-AES128-GCM-SHA256 (128/128 bits)) (No client certificate requested) by mail.turbocat.net (Postfix) with ESMTPSA id 1A39026080B; Fri, 4 Aug 2017 19:04:51 +0200 (CEST) Subject: Re: mlx4en, timer irq @100%... To: Ben RUBSON , FreeBSD Net References: From: Hans Petter Selasky Message-ID: <72b9de84-5572-3737-b274-34d2c5cdf634@selasky.org> Date: Fri, 4 Aug 2017 19:02:38 +0200 User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:52.0) Gecko/20100101 Thunderbird/52.2.0 MIME-Version: 1.0 In-Reply-To: Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-BeenThere: freebsd-net@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Networking and TCP/IP with FreeBSD List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 04 Aug 2017 17:05:00 -0000 On 08/04/17 18:59, Ben RUBSON wrote: > Hello, > > Not sure this is the right list, but as it seems related to a mlx4en device... > > # vmstat -i 1 > (...) > interrupt total rate > cpu23:timer 1198 1127 > > # top -P ALL > (...) > CPU 23: 0.0% user, 0.0% nice, 0.0% system, 100% interrupt, 0.0% idle > > # netstat -I mlxen0 -d -w 1 > input mlxen0 output > packets errs idrops bytes packets errs bytes colls drops > (and not output at all, same for mlxen1 !) > > # uname -sr > FreeBSD 11.0-RELEASE-p9 > > So, as you can see, one of my CPUs is used at 100% by timer interrupts, > since about 2 hours, and suddenly. > Initiating network connections to this server is now slow. > And what I found is that I can't use netstat on my 2 mlx4en devices anymore > (my monitoring tool is then no more fed). > > sysctl hw.mlxen0 is OK, no errors, and trafic counters grow slowly. > > What should I do ? > How to investigate on this ? > > Thank you very much for your help & support, > Hi, Try "procstat -ak". It should give an idea what is going on. What version of FreeBSD is this? Is this a regression issue? --HPS