From owner-freebsd-current@freebsd.org Wed Jan 11 08:28:21 2017 Return-Path: Delivered-To: freebsd-current@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id CC1B5CA90A5 for ; Wed, 11 Jan 2017 08:28:21 +0000 (UTC) (envelope-from ohartmann@walstatt.org) Received: from mout.gmx.net (mout.gmx.net [212.227.17.21]) (using TLSv1.2 with cipher DHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client CN "mout.gmx.net", Issuer "TeleSec ServerPass DE-2" (verified OK)) by mx1.freebsd.org (Postfix) with ESMTPS id 3F6741862 for ; Wed, 11 Jan 2017 08:28:20 +0000 (UTC) (envelope-from ohartmann@walstatt.org) Received: from freyja.zeit4.iv.bundesimmobilien.de ([87.138.105.249]) by mail.gmx.com (mrgmx101 [212.227.17.168]) with ESMTPSA (Nemesis) id 0LvQkh-1ca49R0a1L-010gZQ for ; Wed, 11 Jan 2017 09:28:10 +0100 Date: Wed, 11 Jan 2017 09:27:30 +0100 From: "O. Hartmann" To: freebsd-current Subject: CURRENT: em0 NIC freezes under heavy I/O on net Message-ID: <20170111091643.1d45ab39@freyja.zeit4.iv.bundesimmobilien.de> Organization: Walstatt X-Mailer: Claws Mail 3.14.1 (GTK+ 2.24.29; amd64-portbld-freebsd12.0) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Provags-ID: V03:K0:0IHxMej9QAWdk2IkvpOqc/qAemosX5g/ZkvZLKo7sjrSb1oIY/H pC6jowyOId1Wxp2izLlJ2K4kQNNBuk55K5sZRhDmarnUpvv1sGDxTQ7IrPNKoc8h+RQYBfB YviPvLIBsNLCSn0Xt8v/Xgd0Zs/7j6lsqxZgorIATW2ichsVyzNB/rVCwHoJdD1U6mvxJX7 uaOTIJMHSYg8v9WVrRDeA== X-UI-Out-Filterresults: notjunk:1;V01:K0:r+yfqRag/vs=:VlIvHNLHzGuOHNwQx2bEm1 LG1j0KtVCNkGFFpVG5MFVbBOPRJb8EdXbvUjONDiynW5sDV9JcIpeybRjubBkgNy2O+PfG+NV 9WK+kZFQBZZU2OqhwFUScsyX11AFbYRX3YJbHDyHaaeCZZ01dlAvbhei9w5+o6Pv0+CsEC/6x +5t9hqdsy0KId8FM+T1RZxfwUpQvn+4dVK/1ffgSGikUoyRgdMWZREa84xc0v3nkmVswEucfD nTnJh9BtS7SLBMIG2f+Hk8ZFt4AVwelTTY2Ur8EvhGHibNYy7UaXTv7cIebHF1LbYfxjZ4aFv 71pSzsY9V4ALY9CW5+oqri/8EAIC5Z17NA2HVbXdeA1XKB0xa588GQWsiYTCxvaodA38q4YoF SxqEZkpN7uHxw45a1imjdX6tezTXwVKS2iYd3GFBKacJLKz6xFMkmd+6WxznO/6HgYi75/tqh b7w8hOZ1/du1KIfc/3I6v26CSw0BeMOtQinVVrGdrMgvj/sKMe9ylA+VLcWyluiD/YiXNznbx kzc+SSqTaflv+DUV1wJqpkgAbPKhj+SCJSWkWNPLEAxpb+IZL/jqI8xjB595w144pomJMXkfM HIMshHUxjjt/s/6VPXwp0GZmfxznKJd3o1Zch113ZsQR2Y0SxZEDElNJQnK9+6Z7W5Nk/O65C BU8on6ciPdVo/5vOnTZvJiqT39kQLG5pb7DKNklq3fUNk2+6rYPXDOa+7MKfqntv/fxn7laSZ cJ3drUZBnSLx2c4GYA5NHJseVIBzRvAQRaOsd9AzuxpoGcSDfAlcxkYMlEpEgHAzZTyLvjcrK eKc8OZJWqFubbXfkjRacgzI68TLtluC4Kk6L7gak8lATU7DnNzErDGzDL0U+gUs37JEBbvBwf nRF4zusAhGHJ+kFPBHZH+i3xpJvRebSOeVq5WSZjr7hZVlnFqELk3Mn5s6S48kFadw2hGXAlx jgheOIR1XoHNKufIuVABlyEFjPH9KcnUrL0KnfhG+stpdZbhwkQWPE3N7F6/l4l3ZBAhJREYV 8OHAPER6aROpXMIuW9T5m7W7dJ3gzA9M+XR2wtcYBQnVVlN0kKwt36/3MSP50Jt8mA== X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.23 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Wed, 11 Jan 2017 08:28:21 -0000 Running recent CURRENT (FreeBSD 12.0-CURRENT #5 r311919: Wed Jan 11 08:24:28 CET 2017 amd64), the system freezes when doing a rsync over automounted (autofs) NFSv4 filesystem, mounted from another CURRENT server (same revision, but with BCM NICs). The host in question is a Fujitsu Celsius M740 equipted with an Intel NIC: [...] em0: port 0xf020-0xf03f mem 0xfb300000-0xfb31ffff,0xfb339000-0xfb339fff at device 25.0 numa-domain 0 on pci1 em0: attach_pre capping queues at 1 em0: using 1024 tx descriptors and 1024 rx descriptors em0: msix_init qsets capped at 1 em0: Unable to map MSIX table em0: Using an MSI interrupt em0: allocated for 1 tx_queues em0: allocated for 1 rx_queues em0: netmap queues/slots: TX 1/1024, RX 1/1024 [...] The pciconf output reveals: em0@pci0:0:25:0: class=0x020000 card=0x11ed1734 chip=0x153a8086 rev=0x05 hdr=0x00 vendor = 'Intel Corporation' device = 'Ethernet Connection I217-LM' class = network subclass = ethernet bar [10] = type Memory, range 32, base 0xfb300000, size 131072, enabled bar [14] = type Memory, range 32, base 0xfb339000, size 4096, enabled bar [18] = type I/O Port, range 32, base 0xf020, size 32, enabled cap 01[c8] = powerspec 2 supports D0 D3 current D0 cap 05[d0] = MSI supports 1 message, 64 bit enabled with 1 message cap 13[e0] = PCI Advanced Features: FLR TP I have a customized kernel. The NIC has revealed itself all the time as an "emX" device (never as igbX). The kernel contains device netmap (if relevevant). The phenomenon: Syncing a poudriere repository between to remote hosts, I use rsync on a NGSv4 exported filesystem, mounted via AUTOFS. So far, this work two days ago perfectly. Since yesterday, syncing brings down the network connection - the connection is simply dead. Terminating the rsync, bringing em0 down and up again doesn't help much, for short moments, the connection is established, but dies within seconds. Restarting via "service netif restart" all network services have the same effect: after the desaster, it is impossible for me to bring back the NIC/connection to normal, I have to reboot. The same happens when having heavy network load, but it takes a time and even rsync isn't "deadly" within the same timeframe - it takes sometimes a couple of seconds, another takes only one or two seconds to make the connection die. I checked with dd'ing a large file over that connection, it takes several seconds then to make the connection freezing (so, someone could reproduce iy not ncessarily using rsync). Kind regards, oh