From owner-freebsd-current@freebsd.org Mon Oct 5 05:24:00 2015 Return-Path: Delivered-To: freebsd-current@mailman.ysv.freebsd.org Received: from mx1.freebsd.org (mx1.freebsd.org [IPv6:2001:1900:2254:206a::19:1]) by mailman.ysv.freebsd.org (Postfix) with ESMTP id 1A164A10545 for ; Mon, 5 Oct 2015 05:24:00 +0000 (UTC) (envelope-from ohartman@zedat.fu-berlin.de) Received: from outpost1.zedat.fu-berlin.de (outpost1.zedat.fu-berlin.de [130.133.4.66]) (using TLSv1.2 with cipher ECDHE-RSA-AES256-GCM-SHA384 (256/256 bits)) (Client did not present a certificate) by mx1.freebsd.org (Postfix) with ESMTPS id C4B401623; Mon, 5 Oct 2015 05:23:59 +0000 (UTC) (envelope-from ohartman@zedat.fu-berlin.de) Received: from inpost2.zedat.fu-berlin.de ([130.133.4.69]) by outpost.zedat.fu-berlin.de (Exim 4.85) with esmtp (envelope-from ) id <1ZiyFg-002Dhq-HN>; Mon, 05 Oct 2015 07:23:56 +0200 Received: from p578a69f9.dip0.t-ipconnect.de ([87.138.105.249] helo=freyja.zeit4.iv.bundesimmobilien.de) by inpost2.zedat.fu-berlin.de (Exim 4.85) with esmtpsa (envelope-from ) id <1ZiyFg-001kke-9g>; Mon, 05 Oct 2015 07:23:56 +0200 Date: Mon, 5 Oct 2015 07:23:55 +0200 From: "O. Hartmann" To: Sean Bruno Cc: Eric Joyner , "Pieper, Jeffrey E" , Hans Petter Selasky , freebsd-current Subject: Re: CURRENT: net/igb broken Message-ID: <20151005072355.5bbadc0b@freyja.zeit4.iv.bundesimmobilien.de> In-Reply-To: <560EA859.2030500@freebsd.org> References: <20150915125618.0f752596@freyja.zeit4.iv.bundesimmobilien.de> <55F7FCB5.5090809@selasky.org> <2A35EA60C3C77D438915767F458D65688077F87B@ORSMSX111.amr.corp.intel.com> <20150917133509.6c87f5b6@freyja.zeit4.iv.bundesimmobilien.de> <20150917133712.0ffd01ee@freyja.zeit4.iv.bundesimmobilien.de> <55FC40CB.7090504@freebsd.org> <55FDA830.7030307@freebsd.org> <20150921105848.386d6ae0@freyja.zeit4.iv.bundesimmobilien.de> <20150922082323.6f861448@freyja.zeit4.iv.bundesimmobilien.de> <5601B591.6000005@freebsd.org> <20151002094721.1f890e19@freyja.zeit4.iv.bundesimmobilien.de> <560EA859.2030500@freebsd.org> Organization: FU Berlin X-Mailer: Claws Mail 3.12.0 (GTK+ 2.24.28; amd64-portbld-freebsd11.0) MIME-Version: 1.0 Content-Type: text/plain; charset=US-ASCII Content-Transfer-Encoding: 7bit X-Originating-IP: 87.138.105.249 X-BeenThere: freebsd-current@freebsd.org X-Mailman-Version: 2.1.20 Precedence: list List-Id: Discussions about the use of FreeBSD-current List-Unsubscribe: , List-Archive: List-Post: List-Help: List-Subscribe: , X-List-Received-Date: Mon, 05 Oct 2015 05:24:00 -0000 On Fri, 2 Oct 2015 08:52:57 -0700 Sean Bruno wrote: > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA512 > > > > On 10/02/15 00:47, O. Hartmann wrote: > > On Thu, 01 Oct 2015 15:39:11 +0000 Eric Joyner > > wrote: > > > >> Oliver, > >> > >> did you try Sean's suggestion? > >> > >> - Eric > >> > >> On Tue, Sep 22, 2015 at 1:10 PM Sean Bruno > >> wrote: > >> > > > > > > On 09/21/15 23:23, O. Hartmann wrote: > >>>>> On Mon, 21 Sep 2015 21:13:18 +0000 Eric Joyner > >>>>> wrote: > >>>>> > >>>>>> If you do a diff between r288057 and r287761, there are > >>>>>> no differences between the sys/dev/e1000, sys/modules/em, > >>>>>> and sys/modules/igb directories. Are you sure r287761 > >>>>>> actually works? > >>>>> > >>>>> I'm quite sure r287761 works (and r287762 doesn't), double > >>>>> checked this this morning again. I also checked r288093 and > >>>>> it is still not working. > >>>>> > >>>>> The ensure that I'm not the culprit and stupid here: > >>>>> > >>>>> I use a NanoBSD environment and the only thing that gets > >>>>> exchanged, is the underlying OS/OS revision. The > >>>>> configuration always stays the same. The base system for > >>>>> all of my tests is built from a clean source - (deleted > >>>>> obj/ dir, clean, fresh build into obj/ for every test I > >>>>> ran). > >>>>> > >>>>> I realised a funny thing. Playing around with > >>>>> enabling/disabling TSO (I have been told that could be the > >>>>> culprit in an earlier Email from this list) with the > >>>>> commend sequence: > >>>>> > >>>>> ifconfig igb1 down ifconfig igb1 -tso ifconfig igb1 up > >>>>> ifconfig igb1 down ifconfig igb1 tso ifconfig igb1 up . . > >>>>> . > >>>>> > >>>>> while a ping is pinging in the background a remote host > >>>>> connected to that specific interface, the ping does work > >>>>> for a while and dies then after a round trip of roughly 10 > >>>>> - 20. I can reproduce this. > >>>>> > >>>>> is that observation of any help? > >>>>> > >>>>> Regards, > >>>>> > >>>>> oh > >>>>> > >>>>>> > >>>>>> On Mon, Sep 21, 2015 at 1:58 AM O. Hartmann > >>>>>> wrote: > >>>>>> > >>>>>>> On Sat, 19 Sep 2015 11:23:44 -0700 Sean Bruno > >>>>>>> wrote: > >>>>>>> > >>>>> > >>>>> > >>>>> On 09/18/15 10:20, Eric Joyner wrote: > >>>>>>>>>> He has an i210 -- he would want to revert > >>>>>>>>>> e1000_i210.[ch], too. > >>>>>>>>>> > >>>>>>>>>> Sorry for the thrash Sean -- it sounds like it > >>>>>>>>>> would be a good idea for you should revert this > >>>>>>>>>> patch, and Jeff and I can go look at trying these > >>>>>>>>>> shared code updates and igb changes internally > >>>>>>>>>> again. We at Intel really could've done a better > >>>>>>>>>> job of making sure these changes worked across a > >>>>>>>>>> wider variety of devices. > >>>>>>>>>> > >>>>>>>>>> - Eric > >>>>> > >>>>> I've reverted the changes to head. I'll reopen the reviews > >>>>> and we can proceed from there. > >>>>> > >>>>> sean > >>>>> > >>>>> > >>>>>>>>>> > >>>>>>>>>> On Fri, Sep 18, 2015 at 9:50 AM Sean Bruno > >>>>>>>>>> > > >>>>>>>>>> wrote: > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>>> > >>>>>>>>>>> r287762 broke the system > >>>>>>>>>> > >>>>>>>>>> > >>>>>>>>>> Before I revert this changeset *again* can you > >>>>>>>>>> test revert r287762 from if_igb.c, e1000_82575.c > >>>>>>>>>> and e1000_82575.h *only* > >>>>>>>>>> > >>>>>>>>>> That narrows down the change quite a bit. > >>>>>>>>>> > >>>>>>>>>> sean [...] > >>>>>>> > >>>>>>>>>> > I'm now on r288057 on that specific machine, supposedly > >>>>>>> reverted changes that seemingly has been identified as > >>>>>>> the culprit. Still NO change in behaviour! > >>>>>>> > >>>>>>> r287761 works with the same configuration on igb > >>>>>>> (i210), any further does not. Not ping/connect from the > >>>>>>> outside, no ping/connect from the inside. Tried > >>>>>>> different protocols (SAMBA, ssh, LDAP, DNS). Affected > >>>>>>> is/are only boxes with the igb driver and i210 chipset > >>>>>>> (we do not have other chips covered by igb). > >>>>>>> > >>>>>>> Regards, Oliver [...] > > > > For my entertainment (and HPS's), can you run HEAD and revert > > r287775? > > > > sean [...] > > I did as suggested: > > > > checking out the most recent HEAD of CURRENT this morning, which > > is/was for me r288474. I applied then "svn merge -c -287775 .", > > which reverted(?) only r287775, which is something with > > tcp_output.c or so. I did not remember. > > > > Thanks. This is what I intended. > > > > I recompiled a fresh world (cleaning up /usr/obj completely by > > deleting the folder) and try running the target system with the > > created image. > > > > Result: the same as >r287761, it doesn't work. I reverted back to > > r287761, which works for me on the specific target hardware > > (Fujitsu Primergy RX 1330 M1). > > > > What's really confusing me is that I've reverted r287762 and you are > still having problems. It is confusing me also. I'm about to walk through the commits to check whether there is another possibility of influence - say: changes in the way things work due to configuration et cetera. Due to the fact I use a NanoBSD image on that very specific system, the configuration always is the very same but the underlying OS changes with the revision. An observation I made is also very strange: on most recent CURRENT flapping the state of the igb network interface by bringing it up and down repeatedly, I get sometimes, not always and reproducable, a connection - pings go through for a couple of pakets, but not more than 10 in the tests I ran so far. > > Can you set bootverbose (boot_verbose="YES" in loader.conf) with the > current version of -CURRENT and post the dmesg somewhere for me to > look at? Yes, of course, but in worst case I can do this not before Wednesday since we have to perform some tests on that specific system today and Tuesday and I'm now with the working revision r287761. It's a bit complicated, die to the fact the system is isolated from the internet so far and I have to pull the dmesg and save it to a flash drive and this I have to do on-site, and I'm not on-site at the moment. > > sean Oliver