From owner-freebsd-current@freebsd.org Fri Oct 12 15:01:37 2018 Return-Path: Delivered-To: freebsd-current@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2610:1c1:1:606c::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 548E410C0BB0 for ; Fri, 12 Oct 2018 15:01:37 +0000 (UTC) (envelope-from eric@vangyzen.net) Received: from smtp.vangyzen.net (hotblack.vangyzen.net [IPv6:2607:fc50:1000:7400:216:3eff:fe72:314f]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id 016878640E; Fri, 12 Oct 2018 15:01:36 +0000 (UTC) (envelope-from eric@vangyzen.net) Received: from disco.vangyzen.net (unknown [70.97.188.230]) by smtp.vangyzen.net (Postfix) with ESMTPSA id 383985648D; Fri, 12 Oct 2018 10:01:34 -0500 (CDT) To: freebsd-current , Stephen Hurd , Sean Bruno From: Eric van Gyzen Subject: iflib_timer hits hung label; never recovers Message-ID: Date: Fri, 12 Oct 2018 10:01:33 -0500 User-Agent: Mozilla/5.0 (X11; FreeBSD amd64; rv:60.0) Gecko/20100101 Thunderbird/60.0.1 MIME-Version: 1.0 Content-Type: text/plain; charset=utf-8; format=flowed Content-Language: en-US Content-Transfer-Encoding: 7bit X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.27 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Fri, 12 Oct 2018 15:01:37 -0000 My firewall is running head at r338402 (30 Aug). It has three I211 NICs (PCI dev 0x1539). About 24 hours ago, it said: Oct 11 22:29:03 asbestos kernel: igb1: TX(1) desc avail = 42, pidx = 524 Oct 11 22:29:03 asbestos kernel: Link state changed to down Oct 11 22:29:03 asbestos kernel: core: link state changed to DOWN It keeps saying this periodically: Oct 12 09:46:05 asbestos kernel: igb1: TX(1) desc avail = 1024, pidx = 0 $ dmesg | uniq -c 2455 igb1: TX(1) desc avail = 1024, pidx = 0 I can panic the box and get a vmcore, but what other information should I get before then? I tried to attach kgdb to the running kernel, but it failed. :( I grabbed sysctl dev.igb.1 and dropped it here: http://vangyzen.net/FreeBSD/igb.hang/ I haven't tried manually recovering with ifconfig because I want to diagnose why the driver couldn't do it automatically. I imagine it's hard to test this code path. :) Eric